Conversational Image Search: A Sketch-based Approach

被引:1
|
作者
Braghis, Daniel D. [1 ]
Liu, Haiming [1 ]
机构
[1] Univ Southampton, Sch Elect & Comp Sci, Southampton, Hants, England
来源
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024 | 2024年
关键词
Conversational product search; natural language feedback; multi-modal interaction; sketch-based image retrieval; Stable Diffusion; ControlNet; GPT Assistant;
D O I
10.1145/3652583.3657594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conversational image search has emerged as a progressive step beyond traditional keyword-based methodologies, which addresses challenges in human-computer interaction during the information retrieval process. This paper introduces a demonstration called DoodleShoper, a forward-thinking conversational image search assistant centered around sketching, specifically tailored for online product searches. It underscores the importance of visual diversity, often eluding verbal expression while highlighting the efficacy of a sketch-based approach in enhancing user interaction. The proposed modular architecture integrates a state-of-the-art Language Model with advanced Stable Diffusion technologies in the image generation field to offer users a more intuitive and precise conversational search experience. Unlike most conventional methods that directly align prompts or sketches with images, our approach leverages a generative model to produce an intermediate search outcome. This strategic shift streamlines the search process from a zero-shot query - where the query directly corresponds to an image - to a reverse image search task, facilitating the discovery of similar images through multimodal interaction. The implemented demonstration involves refining and expanding the application to diverse user information needs and preferences, including exploring the potential of utilising sketches as an alternative or complementary search environment, a novel concept rooted in current research.
引用
收藏
页码:1265 / 1269
页数:5
相关论文
共 50 条
  • [1] Design of Sketch-based Image Search UI for Finger Gesture
    Kasai, Takayuki
    Takano, Kosuke
    PROCEEDINGS OF 2016 10TH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS (CISIS), 2016, : 516 - 521
  • [2] Sketch-based Image Similarity Search with a Pen and Paper Interface
    Al Kabary, Ihab
    Schuldt, Heiko
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1014 - 1014
  • [3] IdeaPanel: A Large Scale Interactive Sketch-based Image Search System
    Xiao, Changcheng
    Wang, Changhu
    Zhang, Liqing
    Zhang, Lei
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 667 - 668
  • [4] A Sketch-Based User Interface for Image Search Using Sample Photos
    Sugimura, Hitoshi
    Tsukiji, Hayato
    Kumada, Mizuki
    Iiba, Toshiya
    Takano, Kosuke
    Human Interface and the Management of Information: Information, Design and Interaction, Pt I, 2016, 9734 : 361 - 370
  • [5] Edgel Index for Large-Scale Sketch-based Image Search
    Cao, Yang
    Wang, Changhu
    Zhang, Liqing
    Zhang, Lei
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 761 - 768
  • [6] A BAG-OF-REGIONS APPROACH TO SKETCH-BASED IMAGE RETRIEVAL
    Hu, Rui
    Wang, Tinghuai
    Collomosse, John
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [7] Sketch-Based Image Warping Interface
    Xia, Jiazhi
    Cheng, Zhi-Quan
    ADVANCES IN VISUAL COMPUTING, ISVC 2013, PT I, 2013, 8033 : 459 - 467
  • [8] Sketch-based Image Retrieval using Sketch Tokens
    Wang, Shu
    Miao, Zhenjiang
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 396 - 400
  • [9] A survey of sketch-based image retrieval
    Yi Li
    Wenzhao Li
    Machine Vision and Applications, 2018, 29 : 1083 - 1100
  • [10] SKETCH-BASED AERIAL IMAGE RETRIEVAL
    Jiang, Tianbi
    Xia, Gui-Song
    Lu, Qikai
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3690 - 3694