A DUAL-PATH FRAMEWORK WITH FREQUENCY-AND-TIME EXCITED NETWORK FOR ANOMALOUS SOUND DETECTION

被引:0
|
作者
Zhang, Yucong [1 ,2 ]
Liu, Juan [1 ]
Tian, Yao [3 ]
Liu, Haifeng [4 ]
Li, Ming [1 ,2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Duke Kunshan Univ, Suzhou Municipal Key Lab Multimodal Intelligent, Kunshan, Peoples R China
[3] OPPO, Data & AI Engn Syst, Beijing, Peoples R China
[4] Univ Sci & Technol China, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Anomalous sound detection; squeeze and excitation; frequency pattern analysis; temporal periodicity analysis;
D O I
10.1109/ICASSP48485.2024.10448126
中图分类号
学科分类号
摘要
In contrast to human speech, machine-generated sounds of the same type often exhibit consistent frequency characteristics and discernible temporal periodicity. However, leveraging these dual attributes in anomaly detection remains relatively under-explored. In this paper, we propose an automated dual-path framework that learns prominent frequency and temporal patterns for diverse machine types. One pathway uses a novel Frequency-and-Time Excited Network (FTE-Net) to learn the salient features across frequency and time axes of the spectrogram. It incorporates a Frequency-and-Time Chunkwise Encoder (FTC-Encoder) and an excitation network. The other pathway uses a 1D convolutional network for utterance-level spectrum. Experimental results on the DCASE 2023 task 2 dataset show the state-of-the-art performance of our proposed method. Moreover, visualizations of the intermediate feature maps in the excitation network are provided to illustrate the effectiveness of our method.
引用
收藏
页码:1266 / 1270
页数:5
相关论文
共 50 条
  • [21] Intracranial Aneurysm Segmentation with a Dual-Path Fusion Network
    Wang, Ke
    Zhang, Yong
    Fang, Bin
    BIOENGINEERING-BASEL, 2025, 12 (02):
  • [22] Dual-path multi-branch feature residual network for salient object detection
    Chen, Zhensen
    Lu, Yaosheng
    Long, Shun
    Bai, Jieyun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [23] DPNET: DUAL-PATH NETWORK FOR EFFICIENT OBJECT DETECTION WITH LIGHTWEIGHT SELF-ATTENTION
    Shi, Huimin
    Zhou, Quan
    Ni, Yinghao
    Wu, Xiaofu
    Latecki, Longin Jan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 771 - 775
  • [24] A Digital Implementation of a Dual-Path Time-to-Time Integrator
    Ali-Bakhshian, Mohammad
    Roberts, Gordon W.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2012, 59 (11) : 2578 - 2591
  • [25] Unsupervised Detection via Artificial Dual-Path Transmission
    Hussain, Ahmed
    Ibrahim, Mohamed Salah
    Sidiropoulos, Nicholas D.
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 538 - 542
  • [26] Dual-Path Recurrent Network for Image Super-Resolution
    Li, Xinyao
    Zhang, Dongyang
    Shao, Jie
    NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 99 - 108
  • [27] A lightweight framework for unsupervised anomalous sound detection based on selective learning of time-frequency domain features
    Wang, Yawei
    Zhang, Qiaoling
    Zhang, Weiwei
    Zhang, Yi
    APPLIED ACOUSTICS, 2025, 228
  • [28] Single Image Dehazing via Dual-Path Recurrent Network
    Zhang, Xiaoqin
    Jiang, Runhua
    Wang, Tao
    Luo, Wenhan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5211 - 5222
  • [29] A DEEP DUAL-PATH NETWORK FOR IMPROVED MAMMOGRAM IMAGE PROCESSING
    Li, Heyi
    Chen, Dongdong
    Nailon, William H.
    Davies, Mike E.
    Laurenson, Dave
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1224 - 1228
  • [30] DMPNet: dual-path and multi-scale pansharpening network
    Kaur, Gurpreet
    Malhotra, Manisha
    Singh, Dilbag
    Singhal, Sunita
    FRONTIERS IN COMPUTER SCIENCE, 2025, 6