Multi-task Deep Learning for Image Understanding

被引:0
|
作者
Yu, Bo [1 ,2 ,3 ]
Lane, Ian [3 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, State Key Lab Remote Sensing Sci, Beijing 100101, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
[3] Carnegie Mellon Univ, Moffett Field, CA 94043 USA
关键词
image segmentation; deep learning; multi-task learning; FACE DETECTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep learning models can obtain state-of-the-art performance across many speech and image processing tasks, often significantly outperforming earlier methods. In this paper, we attempt to further improve the performance of these models by introducing multi-task training, in which a combined deep learning model is trained for two inter-related tasks. We show that by introducing a secondary task (such as shape identification in the object classification task) we are able to significantly improve the performance of the main task for which the model is trained. Using public datasets we evaluated our approach on two image understanding tasks, image segmentation and object classification. On the image segmentation task, we observed that the multi-task model almost doubled the accuracy of segmentation at the pixel-level (from 18.7% to 35.6%) compared to the single task model, and improved the performance of face-detection by 10.2% (from 70.1% to 80.3%). For the object classification task, we observed a 2.1% improvement in classification accuracy (from 91.6% to 93.7%) compared to a single-task model. The proposed multi-task models obtained significantly higher accuracies than previously published results on these datasets, obtaining 22.0% and 6.2% higher accuracies on the face-detetction and object classification tasks respectively. These results demonstrate the effectiveness of multi-task training of deep learning models for image understanding tasks.
引用
收藏
页码:37 / 42
页数:6
相关论文
共 50 条
  • [41] Bacterial image analysis using multi-task deep learning approaches for clinical microscopy
    Chin, Shuang Yee
    Dong, Jian
    Hasikin, Khairunnisa
    Ngui, Romano
    Lai, Khin Wee
    Yeoh, Pauline Shan Qing
    Wu, Xiang
    PeerJ Computer Science, 2024, 10
  • [42] Multi-task Deep Learning for Colon Cancer Grading
    Thi Le Trinh Vuong
    Lee, Daigeun
    Kwak, Jin Tae
    Kim, Kyungeun
    2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [43] Saliency-Regularized Deep Multi-Task Learning
    Bai, Guangji
    Zhao, Liang
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 15 - 25
  • [44] Unsupervised learning of multi-task deep variational model
    Tan, Lu
    Li, Ling
    Liu, Wan-Quan
    An, Sen-Jian
    Munyard, Kylie
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [45] Learning multi-task local metrics for image annotation
    Xing Xu
    Atsushi Shimada
    Hajime Nagahara
    Rin-ichiro Taniguchi
    Multimedia Tools and Applications, 2016, 75 : 2203 - 2231
  • [46] In Defense of the Unitary Scalarization for Deep Multi-Task Learning
    Kurin, Vitaly
    De Palma, Alessandro
    Kostrikov, Ilya
    Whiteson, Shimon
    Kumar, M. Pawan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [47] Facial Landmark Detection by Deep Multi-task Learning
    Zhang, Zhanpeng
    Luo, Ping
    Loy, Chen Change
    Tang, Xiaoou
    COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 94 - 108
  • [48] Multi-Task Deep Neural Networks for Natural Language Understanding
    Liu, Xiaodong
    He, Pengcheng
    Chen, Weizhu
    Gao, Jianfeng
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4487 - 4496
  • [49] Multi-Task Learning for Screen Content Image Coding
    Heris, Rashid Zamanshoar
    Bajic, Ivan V.
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [50] MULTI-TASK RANK LEARNING FOR IMAGE QUALITY ASSESSMENT
    Xu, Long
    Li, Jia
    Lin, Weisi
    Zhang, Yongbing
    Ma, Lin
    Fang, Yuming
    Zhang, Yun
    Yan, Yihua
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1339 - 1343