Multi-Modal Learning over User-Contributed Content from Cross-Domain Social Media

被引:0
|
作者
Lee, Wen-Yu [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of the research is to discover and summarize data from the emerging social media into information of interests. Specifically, leveraging user-contributed data from cross-domain social media, the idea is to perform multi-modal learning for a given photo, aiming to present people's description or comments, geographical information, and events of interest, closely related to the photo. These information then can be used for various purposes, such as being a real-time guide for the tourists to improve the quality of tourism. As a result, this research investigates modern challenges of image annotation, image retrieval, and cross-media mining, followed by presenting promising ways to conquer the challenges.
引用
收藏
页码:4301 / 4302
页数:2
相关论文
共 50 条
  • [1] MmAP : Multi-Modal Alignment Prompt for Cross-Domain Multi-Task Learning
    Xin, Yi
    Du, Junlong
    Wang, Qiang
    Yan, Ke
    Ding, Shouhong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 16076 - 16084
  • [2] Multi-modal Instance Refinement for Cross-Domain Action Recognition
    Qing, Yuan
    Wu, Naixing
    Wan, Shaohua
    Duan, Lixin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 284 - 296
  • [3] Improving Cross-domain, Cross-lingual and Multi-modal Deception Detection
    Panda, Subhadarshi
    Levitan, Sarah Ita
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 383 - 390
  • [4] Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
    Fang, Xiang
    Liu, Daizong
    Zhou, Pan
    Hu, Yuchong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7517 - 7532
  • [5] A Multi-Domain and Multi-Modal Representation Disentangler for Cross-Domain Image Manipulation and Classification
    Yang, Fu-En
    Chang, Jing-Cheng
    Tsai, Chung-Chi
    Wang, Yu-Chiang Frank
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2795 - 2807
  • [6] A Cross-Domain Exploration of Audio and Textual Data for Multi-Modal Emotion Detection
    Haque, Mohd Ariful
    George, Roy
    Rifat, Rakib Hossain
    Uddin, Md Shihab
    Kamal, Marufa
    Gupta, Kishor Datta
    17TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2024, 2024, : 375 - 381
  • [7] A privacy-preserving framework with multi-modal data for cross-domain recommendation
    Wang, Li
    Sang, Lei
    Zhang, Quangui
    Wu, Qiang
    Xu, Min
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [8] Multi-Modal Self-Supervised Learning for Cross-Domain One-Shot Bearing Fault Diagnosis
    Chen, Xiaohan
    Xue, Yihao
    Huang, Mengjie
    Yang, Rui
    IFAC PAPERSONLINE, 2024, 58 (04): : 746 - 751
  • [9] COOPNET: MULTI-MODAL COOPERATIVE GENDER PREDICTION IN SOCIAL MEDIA USER PROFILING
    Li, Lin
    Hu, Kaixi
    Zheng, Yunpei
    Liu, Jianquan
    Lee, Kong Aik
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4310 - 4314
  • [10] Deep Transfer Learning for Social Media Cross-Domain Sentiment Classification
    Zhao, Chuanjun
    Wang, Suge
    Li, Deyu
    SOCIAL MEDIA PROCESSING, SMP 2017, 2017, 774 : 232 - 243