Interactive Interior Design Recommendation via Coarse-to-fine Multimodal Reinforcement Learning

被引:1
|
作者
Zhang, He [1 ]
Sun, Ying [1 ]
Guo, Weiyu [1 ]
Liu, Yafei [2 ]
Lu, Haonan [2 ]
Lin, Xiaodong [3 ]
Xiong, Hui [1 ]
机构
[1] Hong Kong Univ Sci & Technol Guangzhou, Thrust Artificial Intelligence, Guangzhou, Peoples R China
[2] OPPO Res Inst, Chengdu, Sichuan, Peoples R China
[3] Rutgers State Univ, New Brunswick, NJ USA
关键词
Multimodal Interaction; Interactive Recommendation; Reinforcement Learning; Interior Design;
D O I
10.1145/3581783.3612420
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Personalized interior decoration design often incurs high labor costs. Recent efforts in developing intelligent interior design systems have focused on generating textual requirement-based decoration designs while neglecting the problem of how to mine homeowner's hidden preferences and choose the proper initial design. To fill this gap, we propose an Interactive Interior Design Recommendation System (IIDRS) based on reinforcement learning (RL). IIDRS aims to find an ideal plan by interacting with the user, who provides feedback on the gap between the recommended plan and their ideal one. To improve decision-making efficiency and effectiveness in large decoration spaces, we propose a Decoration Recommendation Coarse-to-Fine Policy Network (DecorRCFN). Additionally, to enhance generalization in online scenarios, we propose an object-aware feedback generation method that augments model training with diversified and dynamic textual feedback. Extensive experiments on a real-world dataset demonstrate our method outperforms traditional methods by a large margin in terms of recommendation accuracy. Further user studies demonstrate that our method reaches higher real-world user satisfaction than baseline methods.
引用
收藏
页码:6472 / 6480
页数:9
相关论文
共 50 条
  • [41] Vision-Language Recommendation via Attribute Augmented Multimodal Reinforcement Learning
    Yu, Tong
    Shen, Yilin
    Zhang, Ruiyi
    Zeng, Xiangyu
    Jin, Hongxia
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 39 - 47
  • [42] A coarse-to-fine user preferences prediction method for point-of-interest recommendation
    Cai, Liangqi
    Wen, Wen
    Wu, Biao
    Yang, Xiaowei
    NEUROCOMPUTING, 2021, 422 : 1 - 11
  • [43] A coarse-to-fine user preferences prediction method for point-of-interest recommendation
    Cai, Liangqi
    Wen, Wen
    Wu, Biao
    Yang, Xiaowei
    Neurocomputing, 2021, 422 : 1 - 11
  • [44] CMDBENCH: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems
    Feng, Yanlin
    Rahman, Sajjadur
    Feng, Aaron
    Chen, Vincent
    Kandogan, Eser
    FIRST WORKSHOP ON GOVERNANCE, UNDERSTANDING, AND INTEGRATION OF DATA FOR EFFECTIVE AND RESPONSIBLE AI, GUIDE-AI 2024, 2024, : 16 - 25
  • [45] Coarse-to-fine matching via cross fusion of satellite images
    Li, Liangzhi
    Han, Ling
    Gao, Kyle
    He, Hongjie
    Wang, Lanying
    Li, Jonathan
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 125
  • [46] Pedestrian re-identification via coarse-to-fine ranking
    Liu Xiaokai
    IET COMPUTER VISION, 2016, 10 (05) : 366 - 373
  • [47] Balanced coarse-to-fine federated learning for noisy heterogeneous clients
    Han, Longfei
    Zhai, Ying
    Jia, Yanan
    Cai, Qiang
    Li, Haisheng
    Huang, Xiankai
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (02)
  • [48] MATGAN: Unified GANs for Multimodal Attribute Transfer by Coarse-to-Fine Disentangling Representations
    Guo, Xi
    Rao, Qiang
    He, Kun
    Chen, Fang
    Yu, Bing
    Feng, Bailan
    Huang, Jian
    Yang, Qin
    2022 INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML, 2022, : 109 - 119
  • [49] Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation
    James, Stephen
    Wada, Kentaro
    Laidlow, Tristan
    Davison, Andrew J.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13729 - 13738
  • [50] Stack-Captioning: Coarse-to-Fine Learning for Image Captioning
    Gu, Jiuxiang
    Cai, Jianfei
    Wang, Gang
    Chen, Tsuhan
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6837 - 6844