Message-Driven Generative Music Steganography Using MIDI-GAN

被引：0

作者：

Su, Zhaopin ^{[1
,2
,3
,4
]}

Zhang, Guofu ^{[1
,2
,3
,4
]}

Shi, Zhiyuan ^{[1
,2
,3
,4
]}

Hu, Donghui ^{[1
,2
,3
,4
]}

Zhang, Weiming ^{[5
]}

机构：

[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230601, Peoples R China

[2] Minist Educ, Engn Res Ctr Safety Crit Ind Measurement & Control, Hefei 230009, Peoples R China

[3] Hefei Univ Technol, Intelligent Interconnected Syst Lab Anhui Prov, Hefei 230009, Peoples R China

[4] Hefei Univ Technol, Anhui Prov Key Lab Ind Safety & Emergency Technol, Hefei 230601, Peoples R China

[5] Univ Sci & Technol China, Sch Cyber Sci & Technol, Hefei 230026, Peoples R China

来源：

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING | 2024年 / 21卷 / 06期

关键词：

Steganography; Generators; Videos; Speech recognition; Multiple signal classification; Adversarial machine learning; Music steganography; generative adversarial networks; MIDI; chord numbers; statistical distribution; QUANTIZATION INDEX MODULATION; STEGANALYSIS; INFORMATION;

D O I：

10.1109/TDSC.2024.3372139

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Generative steganography has become a popular research topic in the field of generative AI, including generative image and synthetic speech steganography. However, music files have different statistical properties and knowledge representation compared to image and speech files, and the reversible transform between secret message and music is also challenging. Therefore, the existing generative steganographic methods that are effective for image/speech may not be directly effective for music. In this article, we propose a generative music steganography method, named MIDI-GAN, to generate a secret message as an artificial stego MIDI file using generative adversarial networks (GANs). The created stego MIDI file is small in size, has sweet melodies, and is undetectable to deep learning-based steganalyzers. Unlike the previous generative image/speech steganography, the stego MIDI can also be presented as a sequence of chord numbers, making it difficult for anyone to detect and see grounds for suspicion. Moreover, these chord numbers can be transmitted as any other digital or physical medium to evade detection. Specifically, MIDI-GAN comprises a generator, a discriminator, and an extractor. The generator synthesizes a stego MIDI file from the secret message, while the discriminator ensures that the stego MIDI file approaches the authentic rather than the synthetic MIDI file as much as possible in statistical distribution. The extractor recovers the secret message from the stego MIDI file or chord sequence. Experimental results demonstrate that MIDI-GAN has high concealment and security, as the stego MIDI generated by our method is closely similar to the authentic MIDI files and maintains excellent anti-detection ability against deep learning-based steganalysis.

引用

页码：5196 / 5207

页数：12

共 7 条

[1] Using shared arrays in message-driven parallel programs
Miller, Phil
Becker, Aaron
Kale, Laxmikant
PARALLEL COMPUTING, 2012, 38 (1-2) : 66 - 74
[2] Spectrally Efficient Anti-jamming System Design using Message-Driven Frequency Hopping
Zhang, Lei
Ren, Jian
Li, Tongtong
2009 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-8, 2009, : 580 - 584
[3] BEHM-GAN: Bandwidth Extension of Historical Music Using Generative Adversarial Networks
Moliner, Eloi
Valimaki, Vesa
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 943 - 956
[4] Deep Learning-driven Explainable AI using Generative Adversarial Network (GAN)
Maan, Jitendra
2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
[5] Developing a data-driven technology roadmapping method using generative adversarial network (GAN)
Kim, Sunhye
Jang, Hyejin
Yoon, Byungun
COMPUTERS IN INDUSTRY, 2023, 145
[6] Real-Time Emotion-Based Piano Music Generation Using Generative Adversarial Network (GAN)
Zheng, Lijun
Li, Chenglong
IEEE ACCESS, 2024, 12 : 87489 - 87500
[7] GAN computers generate arts? A survey on visual arts, music, and literary text generation using generative adversarial network
Shahriar, Sakib
DISPLAYS, 2022, 73

← 1 →