Perceptual Hashing Using Pretrained Vision Transformers

被引:0
|
作者
De Geest, Jelle [1 ]
De Smet, Patrick [2 ]
Bonetto, Lucio [2 ]
Lambert, Peter [1 ]
Van Wallendael, Glenn [1 ]
Mareen, Hannes [1 ]
机构
[1] Univ Ghent, Imec, Dept Elect & Informat Syst, Technol Pk Zwijnaarde 122, B-9052 Ghent, Belgium
[2] Natl Inst Criminalist & Criminol NICC, Vilvoordsesteenweg 100, B-1120 Brussels, Belgium
来源
2024 IEEE GAMING, ENTERTAINMENT, AND MEDIA CONFERENCE, GEM 2024 | 2024年
关键词
Perceptual Hashing; Vision Transformer; Image Forensics;
D O I
10.1109/GEM61861.2024.10585453
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The rapid evolution of digital image circulation has necessitated robust techniques for image identification and comparison, particularly for sensitive applications such as detecting Child Sexual Abuse Material (CSAM) and preventing the spread of harmful content online. Traditional perceptual hashing methods, while useful, fall short when exposed to some common image transformations, or when images are doctored to avoid detection, rendering them ineffective for nuanced comparisons. Addressing this challenge, this paper introduces a novel pretrained vision transformer artificial intelligence (AI) model approach that enhances the robustness and accuracy of perceptual hashing. Leveraging a pretrained Vision Transformer (ViT-L/14), our approach integrates visual and textual data processing to generate feature arrays that represent perceptual image hashes. Through a comprehensive evaluation using a dataset of 50,000 images, we demonstrate that our method offers significant improvements in detecting similarities for certain complex image transformations, aligning more closely with human visual perception than conventional methods. While our method presents certain initial drawbacks such as larger hash sizes and high computational complexity, its ability to better handle perceptual nuances presents a forward step in the realm of image forensics. The potential applications of this research extend to law enforcement, digital media management, and the broader domain of content verification, setting the stage for more secure and efficient digital content analysis.
引用
收藏
页码:19 / 24
页数:6
相关论文
共 50 条
  • [21] Securing Biometric Systems by using Perceptual Hashing Techniques
    Hamadouche, Maamar
    Zebbiche, Khalil
    Zafoune, Youcef
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [22] PERCEPTUAL HASHING OF COLOR IMAGES USING HYPERCOMPLEX REPRESENTATIONS
    Laradji, Issam H.
    Ghouti, Lahouari
    Khiari, El-Hebri
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 4402 - 4406
  • [23] Robust perceptual image hashing using SIFT and SVD
    Singh, Kh. Motilal
    Neelima, Arambam
    Tuithung, T.
    Singh, Kh. Manglem
    CURRENT SCIENCE, 2019, 117 (08): : 1340 - 1344
  • [24] Secure Perceptual Hashing Scheme for Image using Encryption
    Sahana, M. S.
    Geetha, M. N.
    2017 INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN COMPUTER, ELECTRICAL, ELECTRONICS AND COMMUNICATION (CTCEEC), 2017, : 534 - 538
  • [25] WASTE CLASSIFICATION USING VISION TRANSFORMERS
    Puchianu, Dan Constantin
    SCIENTIFIC PAPERS-SERIES E-LAND RECLAMATION EARTH OBSERVATION & SURVEYING ENVIRONMENTAL ENGINEERING, 2024, 13 : 727 - 733
  • [26] Distilling Vision Transformers for no-reference Perceptual CT Image Quality Assessment
    Baldeon-Calisto, Maria G.
    Rivera-Velastegui, Francisco
    Lai-Yuen, Susana K.
    Riofrio, Daniel
    Perez-Perez, Noel
    Benitez, Diego
    Flores-Moyano, Ricardo
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [27] Integrating Multimodal Information in Large Pretrained Transformers
    Rahman, Wasifur
    Hasan, Md Kamrul
    Lee, Sangwu
    Zadeh, Amir
    Mao, Chengfeng
    Morency, Louis-Philippe
    Hoque, Ehsan
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2359 - 2369
  • [28] Pretrained Transformers for Text Ranking: BERT and Beyond
    Yates, Andrew
    Nogueira, Rodrigo
    Lin, Jimmy
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2666 - 2668
  • [29] Generating Accurate Assert Statements for Unit Test Cases using Pretrained Transformers
    Tufano, Michele
    Drain, Dawn
    Svyatkovskiy, Alexey
    Sundaresan, Neel
    3RD ACM/IEEE INTERNATIONAL CONFERENCE ON AUTOMATION OF SOFTWARE TEST (AST 2022), 2022, : 54 - 64
  • [30] Pseudo Outlier Exposure for Out-of-Distribution Detection using Pretrained Transformers
    Kim, Jaeyoung
    Jung, Kyuheon
    Na, Dongbin
    Jang, Sion
    Park, Eunbin
    Choi, Sungchul
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1469 - 1482