AesCLIP: Multi-Attribute Contrastive Learning for Image Aesthetics Assessment

被引:6
|
作者
Sheng, Xiangfei [1 ,2 ]
Li, Leida [1 ]
Chen, Pengfei [1 ]
Wu, Jinjian [1 ]
Dong, Weisheng [1 ]
Yang, Yuzhe [2 ]
Xu, Liwu [2 ]
Li, Yaqian [2 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China
[2] OPPO Res Inst, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Image aesthetics assessment; CLIP; Aesthetics attributes; Contrastive Learning;
D O I
10.1145/3581783.3611969
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image aesthetics assessment (IAA) aims at predicting the aesthetic quality of images. Recently, large pre-trained vision-language models, like CLIP, have shown impressive performances on various visual tasks. When it comes to IAA, a straightforward way is to finetune the CLIP image encoder using aesthetic images. However, this can only achieve limited success without considering the uniqueness of multimodal data in the aesthetics domain. People usually assess image aesthetics according to fine-grained visual attributes, e.g., color, light and composition. However, how to learn aesthetics-aware attributes from CLIP-based semantic space has not been addressed before. With this motivation, this paper presents a CLIP-based multi-attribute contrastive learning framework for IAA, dubbed AesCLIP. Specifically, AesCLIP consists of two major components, i.e., aesthetic attribute-based comment classification and attribute-aware learning. The former classifies the aesthetic comments into different attribute categories. Then the latter learns an aesthetic attribute-aware representation by contrastive learning, aiming to mitigate the domain shift from the general visual domain to the aesthetics domain. Extensive experiments have been done by using the pre-trained AesCLIP on four popular IAA databases, and the results demonstrate the advantage of AesCLIP over the state-of-the-arts. The source code will be public at https://github.com/OPPOMKLab/AesCLIP.
引用
收藏
页码:1117 / 1126
页数:10
相关论文
共 50 条
  • [31] Multi-Attribute Auction-Based Grouped Federated Learning
    Lu, Renhao
    Yang, Hongwei
    Wang, Yan
    He, Hui
    Li, Qiong
    Zhong, Xiaoxiong
    Zhang, Weizhe
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (03) : 1056 - 1071
  • [32] A Multi-attribute Assessment Method for E-Commerce Risks
    Zhou, Caiying
    Huang, Longjun
    PROCEEDINGS OF INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2009), 2009, : 285 - 288
  • [33] Multi-attribute value approach to business airplane product assessment
    Downen, T.D. (downen@alum.mit.edu), 1600, American Institute of Aeronautics and Astronautics Inc. (42):
  • [34] Multi-attribute value approach to business airplane product assessment
    Downen, TD
    Nightingale, DJ
    Magee, CL
    JOURNAL OF AIRCRAFT, 2005, 42 (06): : 1387 - 1395
  • [35] Risk assessment with multi-attribute utility theory for building projects
    Campos V.R.
    Moreira D.J.S.
    Journal of Building Pathology and Rehabilitation, 2022, 7 (1)
  • [36] Multi-attribute assessment of a river electromobility concept in the Amazon region
    Bonilla, Rosa Zuloeta
    Bhandari, Ramchandra
    Rodarte, Aldo Perez
    ENERGY FOR SUSTAINABLE DEVELOPMENT, 2021, 61 : 139 - 152
  • [37] Multi-attribute auctions with different types of attributes: Enacting properties in multi-attribute auctions
    Pla, Albert
    Lopez, Beatriz
    Murillo, Javier
    Maudet, Nicolas
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (10) : 4829 - 4843
  • [38] Multi-attribute assessment of acceptability of operations in the pulp and paper industries
    Mikkilä, M
    Kolehmainen, O
    Pukkala, T
    FOREST POLICY AND ECONOMICS, 2005, 7 (02) : 227 - 243
  • [39] Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries
    Hasan, Shohedul
    Thirumuruganathan, Saravanan
    Augustine, Jees
    Koudas, Nick
    Das, Gautam
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 1035 - 1050
  • [40] Deep Learning Method for Multi-Attribute Analysis of Fingerprint Images
    Maiti, Diptadip
    Basak, Madhuchhanda
    Das, Debashis
    COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2024, 32 (02) : 199 - 222