AesCLIP: Multi-Attribute Contrastive Learning for Image Aesthetics Assessment

被引:6
|
作者
Sheng, Xiangfei [1 ,2 ]
Li, Leida [1 ]
Chen, Pengfei [1 ]
Wu, Jinjian [1 ]
Dong, Weisheng [1 ]
Yang, Yuzhe [2 ]
Xu, Liwu [2 ]
Li, Yaqian [2 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China
[2] OPPO Res Inst, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Image aesthetics assessment; CLIP; Aesthetics attributes; Contrastive Learning;
D O I
10.1145/3581783.3611969
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image aesthetics assessment (IAA) aims at predicting the aesthetic quality of images. Recently, large pre-trained vision-language models, like CLIP, have shown impressive performances on various visual tasks. When it comes to IAA, a straightforward way is to finetune the CLIP image encoder using aesthetic images. However, this can only achieve limited success without considering the uniqueness of multimodal data in the aesthetics domain. People usually assess image aesthetics according to fine-grained visual attributes, e.g., color, light and composition. However, how to learn aesthetics-aware attributes from CLIP-based semantic space has not been addressed before. With this motivation, this paper presents a CLIP-based multi-attribute contrastive learning framework for IAA, dubbed AesCLIP. Specifically, AesCLIP consists of two major components, i.e., aesthetic attribute-based comment classification and attribute-aware learning. The former classifies the aesthetic comments into different attribute categories. Then the latter learns an aesthetic attribute-aware representation by contrastive learning, aiming to mitigate the domain shift from the general visual domain to the aesthetics domain. Extensive experiments have been done by using the pre-trained AesCLIP on four popular IAA databases, and the results demonstrate the advantage of AesCLIP over the state-of-the-arts. The source code will be public at https://github.com/OPPOMKLab/AesCLIP.
引用
收藏
页码:1117 / 1126
页数:10
相关论文
共 50 条
  • [21] Learning and strategy selection in multi-attribute decision making
    Arndt, Broeder
    Tilmann, Betsch
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 547 - 547
  • [22] Mixed Type Multi-attribute Pairwise Comparisons Learning
    Qomariyah, Nunung Nurul
    Kazakov, Dimitar
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 1094 - 1097
  • [23] Fuzzy multi-attribute security risk assessment model
    Gu Yonghao
    Yong, Liu
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON RISK ANALYSIS AND CRISIS RESPONSE, 2007, 2 : 369 - 374
  • [24] Learning Single/Multi-Attribute of Object With Symmetry and Group
    Li, Yong-Lu
    Xu, Yue
    Xu, Xinyu
    Mao, Xiaohan
    Lu, Cewu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9043 - 9055
  • [25] GRAPH LEARNING FROM MULTI-ATTRIBUTE SMOOTH SIGNALS
    Tugnait, Jitendra K.
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [26] MRAM: Multi-scale Regional Attribute-weighting via Meta-learning for Personalized Image Aesthetics Assessment
    Nie, Xixi
    Huang, Shixin
    Gao, Xinbo
    Luo, Jiawei
    Zhang, Guo
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [27] USE OF A MULTI-ATTRIBUTE ATTITUDE MODEL IN A STORE IMAGE STUDY
    JAMES, DL
    DURAND, RM
    DREVES, RA
    JOURNAL OF RETAILING, 1976, 52 (02) : 23 - 32
  • [28] A Multi-attribute Controllable Generative Model for Histopathology Image Synthesis
    Ye, Jiarong
    Xue, Yuan
    Liu, Peter
    Zaino, Richard
    Cheng, Keith C.
    Huang, Xiaolei
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 613 - 623
  • [29] A Hierarchical Multi-Attribute Model for Bank Reputational Risk Assessment
    Bohanec, Marko
    Aprile, Giorgio
    Costante, Maria
    Foti, Morena
    Trdin, Nejc
    DSS 2.0 - SUPPORTING DECISION MAKING WITH NEW TECHNOLOGIES, 2014, 261 : 92 - +
  • [30] Attribute-assisted Multimodal Network for Image Aesthetics Assessment
    Zhu, Tong
    Li, Leida
    Chen, Pengfei
    Wu, Jinjian
    Yang, Yuzhe
    Li, Yaqian
    Guo, Yandong
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2477 - 2482