Deep Learning-Based CNN Multi-Modal Camera Model Identification for Video Source Identification

被引:0
|
作者
Singh S. [1 ]
Sehgal V.K. [1 ]
机构
[1] Department of Computer Science and Engineering and Information Technology, Jaypee University of Information Technology, Solan, Himachal Pradesh, Waknaghat
来源
Informatica (Slovenia) | 2023年 / 47卷 / 03期
关键词
audio forensics; camera model identification; convolutional neural networks; video forensics;
D O I
10.31449/inf.v47i3.4392
中图分类号
学科分类号
摘要
Here is a high demand for multimedia forensics analysts to locate the original camera of photographs and videos that are being taken nowadays. There has been considerable progress in the technology of identifying the source of data, which has enabled conflict resolutions involving copyright infringements and identifying those responsible for serious offenses to be resolved. Video source identification is a challenging task nowadays due to easily available editing tools. This study focuses on the issue of identifying the camera model used to acquire video sequences used in this research that is, identifying the type of camera used to capture the video sequence under investigation. For this purpose, we created two distinct CNN-based camera model recognition techniques to be used in an innovative multi-modal setting. The proposed multi-modal methods combine audio and visual information in order to address the identification issue, which is superior to mono-modal methods which use only the visual or audio information from the investigated video to provide the identification information. According to legal standards of admissible evidence and criminal procedure, Forensic Science involves the application of science to the legal aspects of criminal and civil law, primarily during criminal investigations, in line with the standards of admissible evidence and criminal procedure in the law. It is responsible for collecting, preserving, and analyzing scientific evidence in the course of an investigation. It has become a critical part of criminology as a result of the rapid rise in crime rates over the last few decades. Our proposed methods were tested on a well-known dataset known as the Vision dataset, which contains about 2000 video sequences gathered from various devices of varying types. It is conducted experiments on social media platforms such as YouTube and WhatsApp as well as native videos directly obtained from their acquisition devices by the means of their acquisition devices. According to the results of the study, the multimodal approaches suggest that they greatly outperform their mono-modal equivalents in addressing the challenge at hand, constituting an effective approach to address the challenge and offering the possibility of even more difficult circumstances in the future. © 2023 Slovene Society Informatika. All rights reserved.
引用
收藏
页码:417 / 430
页数:13
相关论文
共 50 条
  • [1] CNN-Based Multi-Modal Camera Model Identification on Video Sequences
    Dal Cortivo, Davide
    Mandelli, Sara
    Bestagini, Paolo
    Tubaro, Stefano
    JOURNAL OF IMAGING, 2021, 7 (08)
  • [2] DeepMEF: A Deep Model Ensemble Framework for Video Based Multi-modal Person Identification
    Dong, Chuanqi
    Gu, Zheng
    Huang, Zhonghao
    Ji, Wen
    Huo, Jing
    Gao, Yang
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2531 - 2534
  • [3] Effective deep learning-based multi-modal retrieval
    Wang, Wei
    Yang, Xiaoyan
    Ooi, Beng Chin
    Zhang, Dongxiang
    Zhuang, Yueting
    VLDB JOURNAL, 2016, 25 (01): : 79 - 101
  • [4] Effective deep learning-based multi-modal retrieval
    Wei Wang
    Xiaoyan Yang
    Beng Chin Ooi
    Dongxiang Zhang
    Yueting Zhuang
    The VLDB Journal, 2016, 25 : 79 - 101
  • [5] An efficient deep learning-based video captioning framework using multi-modal features
    Varma, Soumya
    James, Dinesh Peter
    EXPERT SYSTEMS, 2021,
  • [6] RGB-D BASED MULTI-MODAL DEEP LEARNING FOR FACE IDENTIFICATION
    Lin, Tzu-Ying
    Chiu, Ching-Te
    Tang, Ching-Tung
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1668 - 1672
  • [7] A deep learning-based automated tool for the identification of referable retinal pathology from multi-modal imaging sources
    Gao, Qitong
    Amason, Joshua
    Cousins, Scott
    Pajic, Miroslav
    Hadziahmetovic, Majda
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2021, 62 (08)
  • [8] Applying deep learning-based multi-modal for detection of coronavirus
    Rani, Geeta
    Oza, Meet Ganpatlal
    Dhaka, Vijaypal Singh
    Pradhan, Nitesh
    Verma, Sahil
    Rodrigues, Joel J. P. C.
    MULTIMEDIA SYSTEMS, 2022, 28 (04) : 1251 - 1262
  • [9] Applying deep learning-based multi-modal for detection of coronavirus
    Geeta Rani
    Meet Ganpatlal Oza
    Vijaypal Singh Dhaka
    Nitesh Pradhan
    Sahil Verma
    Joel J. P. C. Rodrigues
    Multimedia Systems, 2022, 28 : 1251 - 1262
  • [10] Multi-modal learning-based algae phyla identification using image and particle modalities
    Kwon, Do Hyuck
    Lee, Min Jun
    Jeong, Heewon
    Park, Sanghun
    Cho, Kyung Hwa
    WATER RESEARCH, 2025, 275