Message-Driven Generative Music Steganography Using MIDI-GAN

被引:0
|
作者
Su, Zhaopin [1 ,2 ,3 ,4 ]
Zhang, Guofu [1 ,2 ,3 ,4 ]
Shi, Zhiyuan [1 ,2 ,3 ,4 ]
Hu, Donghui [1 ,2 ,3 ,4 ]
Zhang, Weiming [5 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230601, Peoples R China
[2] Minist Educ, Engn Res Ctr Safety Crit Ind Measurement & Control, Hefei 230009, Peoples R China
[3] Hefei Univ Technol, Intelligent Interconnected Syst Lab Anhui Prov, Hefei 230009, Peoples R China
[4] Hefei Univ Technol, Anhui Prov Key Lab Ind Safety & Emergency Technol, Hefei 230601, Peoples R China
[5] Univ Sci & Technol China, Sch Cyber Sci & Technol, Hefei 230026, Peoples R China
关键词
Steganography; Generators; Videos; Speech recognition; Multiple signal classification; Adversarial machine learning; Music steganography; generative adversarial networks; MIDI; chord numbers; statistical distribution; QUANTIZATION INDEX MODULATION; STEGANALYSIS; INFORMATION;
D O I
10.1109/TDSC.2024.3372139
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Generative steganography has become a popular research topic in the field of generative AI, including generative image and synthetic speech steganography. However, music files have different statistical properties and knowledge representation compared to image and speech files, and the reversible transform between secret message and music is also challenging. Therefore, the existing generative steganographic methods that are effective for image/speech may not be directly effective for music. In this article, we propose a generative music steganography method, named MIDI-GAN, to generate a secret message as an artificial stego MIDI file using generative adversarial networks (GANs). The created stego MIDI file is small in size, has sweet melodies, and is undetectable to deep learning-based steganalyzers. Unlike the previous generative image/speech steganography, the stego MIDI can also be presented as a sequence of chord numbers, making it difficult for anyone to detect and see grounds for suspicion. Moreover, these chord numbers can be transmitted as any other digital or physical medium to evade detection. Specifically, MIDI-GAN comprises a generator, a discriminator, and an extractor. The generator synthesizes a stego MIDI file from the secret message, while the discriminator ensures that the stego MIDI file approaches the authentic rather than the synthetic MIDI file as much as possible in statistical distribution. The extractor recovers the secret message from the stego MIDI file or chord sequence. Experimental results demonstrate that MIDI-GAN has high concealment and security, as the stego MIDI generated by our method is closely similar to the authentic MIDI files and maintains excellent anti-detection ability against deep learning-based steganalysis.
引用
收藏
页码:5196 / 5207
页数:12
相关论文
共 7 条