Heterogeneous Dual-Task Clustering with Visual-Textual Information

被引:3
|
作者
Yan, Xiaoqiang [1 ]
Mao, Yiqiao [1 ]
Hu, Shizhe [1 ]
Ye, Yangdong [1 ]
机构
[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1137/1.9781611976236.74
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing visual-textual cross-modal clustering techniques focus on finding a clustering partition of different modalities by dealing with each modality dependently or integrating multiple modalities into a shared space, which may results in unsatisfactory performance due to the heterogeneous gap of different modalities. Aiming at this problem, we propose a novel heterogeneous dual-task clustering (HDC) method, which is capable of exploring high-level relatedness between visual and textual data to improve the performance of individual task. Our intuition is that although the visual and textual data are heterogenous to each other, they may share related high-level semantics and rich latent correlations, which can lead to improved performance if we treat the clustering of visual and textual data as different but related learning tasks. Specifically, the problem of heterogeneous dual-task clustering is formulated as an information theoretic function, in which the low-level information in each modality and high-level relatedness between multiple modalities are maximally preserved. Then, a progressive optimization method is proposed to ensure a local optimal solution. Extensive experiments show noticeable performance of the HDC approach in comparison with several state-of-the-art baselines.
引用
收藏
页码:658 / 666
页数:9
相关论文
共 50 条
  • [41] Joint Visual-Textual Sentiment Analysis with Deep Neural Networks
    You, Quanzeng
    Luo, Jiebo
    Jin, Hailin
    Yang, Jianchao
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1071 - 1074
  • [42] Visual-Textual Encounters with a German Grandfather: The Work of Angela Findlay
    Pettitt, Joanne
    JEWISH FILM & NEW MEDIA-AN INTERNATIONAL JOURNAL, 2023, 11 (01)
  • [43] Task shifting in dual-task settings
    Hsieh, S
    PERCEPTUAL AND MOTOR SKILLS, 2002, 94 (02) : 407 - 414
  • [44] Hybrid Representation and Decision Fusion towards Visual-textual Sentiment
    Yin, Chunyong
    Zhang, Sun
    Zeng, Qingkui
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (03)
  • [45] Visual-Textual Alignment for Generalizable Person Reidentification in Internet of Things
    Liu, Xiaosheng
    Zhou, Zhiheng
    Niu, Chang
    Wu, Qingru
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (15) : 13865 - 13875
  • [46] THE FACILITATION OF DUAL-TASK PERFORMANCE USING A PERIPHERAL VISUAL HORIZON DEVICE
    BELLENKES, AH
    AVIATION SPACE AND ENVIRONMENTAL MEDICINE, 1984, 55 (05): : 456 - 456
  • [47] Evaluating the Measurement Properties of the ScanCourse, a Dual-Task Assessment of Visual Scanning
    Lund, Paige
    Moir, Caitlyn
    Kristalovich, Lisa
    Ben Mortenson, W.
    AMERICAN JOURNAL OF OCCUPATIONAL THERAPY, 2020, 74 (01):
  • [48] Effect of task difficulty on dual-task cost during dual-task walking in people with multiple sclerosis
    Gulsen, Cagri
    Soke, Fatih
    Aydin, Fatma
    Gulsen, Elvan Ozcan
    Yilmaz, Oznur
    Kocer, Bilge
    Curuk, Etem
    Demirkaya, Seref
    Yucesan, Canan
    GAIT & POSTURE, 2024, 114 : 95 - 100
  • [49] Effect of Auditory or Visual Working Memory Training on Dual-Task Interference
    Kimura, Takehide
    Matsuura, Ryouta
    MOTOR CONTROL, 2020, 24 (02) : 304 - 317
  • [50] Dual-Task Rehabilitation: The Science of Prioritization, Compensation, and Rehabilitating Dual-Task Tolerance in Geriatrics
    Studer, Mike
    TOPICS IN GERIATRIC REHABILITATION, 2018, 34 (01) : 54 - 64