Hybrid CNN Based Attention with Category Prior for User Image Behavior Modeling

被引:2
|
作者
Chen, Xin [1 ]
Tang, Qingtao [1 ]
Hu, Ke [1 ]
Xu, Yue [1 ]
Qiu, Shihang [1 ]
Cheng, Jia [1 ]
Lei, Jun [1 ]
机构
[1] Meituan, Beijing, Peoples R China
关键词
online advertising; user modeling; image behavior;
D O I
10.1145/3477495.3531854
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
User historical behaviors are proved useful for Click Through Rate (CTR) prediction in online advertising system. In Meituan, one of the largest e-commerce platform in China, an item is typically displayed with its image and whether a user clicks the item or not is usually influenced by its image, which implies that user's image behaviors are helpful for understanding user's visual preference and improving the accuracy of CTR prediction. Existing user image behavior models typically use a two-stage architecture, which extracts visual embeddings of images through off-the-shelf Convolutional Neural Networks (CNNs) in the first stage, and then jointly trains a CTR model with those visual embeddings and non-visual features. We find that the two-stage architecture is sub-optimal for CTR prediction. Meanwhile, precisely labeled categories in online ad systems contain abundant visual prior information, which can enhance the modeling of user image behaviors. However, off-the-shelf CNNs without category prior may extract category unrelated features, limiting CNN's expression ability. To address the two issues, we propose a hybrid CNN based attention module, unifying user's image behaviors and category prior, for CTR prediction. Our approach achieves significant improvements in both online and offline experiments on a billion scale real serving dataset.
引用
收藏
页码:2336 / 2340
页数:5
相关论文
共 50 条
  • [41] MODELING USER-PERCEIVED RELIABILITY BASED ON USER BEHAVIOR GRAPHS
    Wang, Dazhi
    Trivedi, Kishor S.
    INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2009, 16 (04) : 303 - 329
  • [42] Sentiment Analysis of Review Text Based on BiGRU-Attention and Hybrid CNN
    Zhu, Qiannan
    Jiang, Xiaofan
    Ye, Renzhen
    IEEE ACCESS, 2021, 9 : 149077 - 149088
  • [43] An Attention-based Hybrid LSTM-CNN Model for Arrhythmias Classification
    Liu, Fan
    Zhou, Xingshe
    Wang, Tianben
    Cao, Jinli
    Wang, Zhu
    Wang, Hua
    Zhang, Yanchun
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [44] Deep CNN based Image Compression with Redundancy Minimization via Attention Guidance
    Mishra, Dipti
    Singh, Satish Kumar
    Singh, Rajat Kumar
    NEUROCOMPUTING, 2022, 507 : 397 - 411
  • [45] Image Aesthetics Assessment Based on User Social Behavior
    Liu, Huihui
    Cui, Chaoran
    Ma, Yuling
    Shi, Cheng
    Xu, Yongchao
    Yin, Yilong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 755 - 766
  • [46] CTNet: hybrid architecture based on CNN and transformer for image inpainting detection
    Xiao, Fengjun
    Zhang, Zhuxi
    Yao, Ye
    MULTIMEDIA SYSTEMS, 2023, 29 (06) : 3819 - 3832
  • [47] An Ultralightweight Hybrid CNN Based on Redundancy Removal for Hyperspectral Image Classification
    Ma, Xiaohu
    Wang, Wuli
    Li, Wei
    Wang, Jianbu
    Ren, Guangbo
    Ren, Peng
    Liu, Baodi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 12
  • [48] Medical Image Classification with a Hybrid SSM Model Based on CNN and Transformer
    Hu, Can
    Cao, Ning
    Zhou, Han
    Guo, Bin
    ELECTRONICS, 2024, 13 (15)
  • [49] CTNet: hybrid architecture based on CNN and transformer for image inpainting detection
    Fengjun Xiao
    Zhuxi Zhang
    Ye Yao
    Multimedia Systems, 2023, 29 (6) : 3819 - 3832
  • [50] Tensor-Based Emotional Category Classification via Visual Attention-Based Heterogeneous CNN Feature Fusion
    Moroto, Yuya
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    SENSORS, 2020, 20 (07)