Real-Time Bidding with Soft Actor-Critic Reinforcement Learning in Display Advertising

被引:0
|
作者
Yakovleva, Dania [1 ]
Popov, Artem [1 ]
Filchenkov, Andrey [1 ]
机构
[1] ITMO Univ, St Petersburg, Russia
来源
PROCEEDINGS OF THE 2019 25TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT) | 2019年
关键词
D O I
10.23919/fruct48121.2019.8981496
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The main task of advertising companies is to sell goods and services interesting to the user. Online auctions are the main mechanism for selecting ads to the user. Dynamic bidding allows advertiser to automatically calculate the bid that is profitable to set to maximize goals (for example, the number of clicks on an ad), depending on the user who sees the ad. In this case the advertiser must specify the budget of the ad and the optimization goal. During the advertising campaign the bid for each impression will be calculated by a special algorithm. In this paper, we propose a novel algorithm for calculating the dynamic bid for each impression of the ad in order to maximize the advertiser's goals, which takes into account settings of the advertising campaign, budget, the ad lifetime and other parameters. This task is formulated as reinforcement learning problem, where states are the status of auction and parameters of the advertising campaign, the actions are bidding for each ad based on the input state. Every ad has an agent who observes the states all the time and calculates the bid for the impression. We evaluated the proposed model on real advertising campaigns in a large social network. Our method achieved average 26% improvement in comparison with the state-of-the-art approach.
引用
收藏
页码:373 / 382
页数:10
相关论文
共 50 条
  • [21] Real-time bidding with multi-agent reinforcement learning in multi-channel display advertising
    Chen Chen
    Gao Wang
    Baoyu Liu
    Siyao Song
    Keming Mao
    Shiyu Yu
    Jingyu Liu
    Neural Computing and Applications, 2025, 37 (1) : 499 - 511
  • [22] Multi-actor mechanism for actor-critic reinforcement learning
    Li, Lin
    Li, Yuze
    Wei, Wei
    Zhang, Yujia
    Liang, Jiye
    INFORMATION SCIENCES, 2023, 647
  • [23] SOFT ACTOR-CRITIC REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATOR WITH HINDSIGHT EXPERIENCE REPLAY
    Yan, Tao
    Zhang, Wenan
    Yang, Simon X.
    Yu, Li
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2019, 34 (05): : 536 - 543
  • [24] SAC-FACT: Soft Actor-Critic Reinforcement Learning for Counterfactual Explanations
    Ezzeddine, Fatima
    Ayoub, Omran
    Andreoletti, Davide
    Giordano, Silvia
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT I, 2023, 1901 : 195 - 216
  • [25] CONTROLLED SENSING AND ANOMALY DETECTION VIA SOFT ACTOR-CRITIC REINFORCEMENT LEARNING
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4198 - 4202
  • [26] Reinforcement learning for automatic quadrilateral mesh generation: A soft actor-critic approach
    Pan, Jie
    Huang, Jingwei
    Cheng, Gengdong
    Zeng, Yong
    NEURAL NETWORKS, 2023, 157 : 288 - 304
  • [27] Real-Time Optimization of Urban Rail Transit Train Scheduling via Advantage Actor-Critic Deep Reinforcement Learning
    Wen, Longhui
    Zhou, Wei
    Liu, Jiajun
    Ren, Gang
    Zhang, Ning
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2024, 150 (09)
  • [28] Research Progress of Real-Time Bidding for Display Advertising
    Liu M.-J.
    Yue W.
    Qiu L.-Z.
    Li J.-X.
    Qin Z.-G.
    1810, Science Press (43): : 1810 - 1841
  • [29] Actor-Critic reinforcement learning based on prior knowledge
    Yang, Zhenyu, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
  • [30] Real-Time Bidding n Online Display Advertising
    Sayedi, Amin
    MARKETING SCIENCE, 2018, 37 (04) : 553 - 568