Mobile Visual Search Using Image and Text Features

被引:0
|
作者
Tsai, Sam S. [1 ]
Chen, Huizhong [1 ]
Chen, David [1 ]
Vedantham, Ramakrishna [2 ]
Grzeszczuk, Radek [2 ]
Girod, Bernd [1 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[2] Nokia Res Ctr, Palo Alto, CA 94304 USA
关键词
mobile visual search; image retrieval; document retrieval; document analysis;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a mobile visual search system that utilizes both text and low bit-rate image features. Using a cameraphone, a user can snap a picture of a document image and search for the document in online databases. From the query image, the title text is detected and recognized and image features are extracted and compressed, as well. Both types of information are sent from the cameraphone client to a server. The server uses the recognized title to retrieve candidate documents from online databases. Then, image features are used to select the correct document(s). We show that by using a novel geometric verification method that incorporates both text and image feature information, we can reduce the missed positives up to 50%. The proposed method can also speed up the geometric process, enabling a larger set of verified titles, resulting in a superior performance compared to previous schemes.
引用
收藏
页码:845 / 849
页数:5
相关论文
共 50 条
  • [1] MOBILE VISUAL SEARCH ON PRINTED DOCUMENTS USING TEXT AND LOW BIT-RATE FEATURES
    Tsai, Sam S.
    Chen, Huizhong
    Chen, David
    Schroth, Georg
    Grzeszczuk, Radek
    Girod, Bernd
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [2] Visual Text Features for Image Matching
    Tsai, Sam S.
    Chen, Huizhong
    Chen, David
    Parameswaran, Vasu
    Grzeszczuk, Radek
    Girod, Bernd
    2012 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2012, : 408 - 412
  • [3] Bridging the Semantic Gap in Image Search via Visual Semantic Descriptors by Integrating Text and Visual Features
    Lekshmi, V. L.
    John, Ansamma
    COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 207 - 215
  • [4] iLike: Bridging the Semantic Gap in Vertical Image Search by Integrating Text and Visual Features
    Chen, Yuxin
    Sampathkumar, Hariprasad
    Luo, Bo
    Chen, Xue-wen
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (10) : 2257 - 2270
  • [5] Image Classification Based on the Combination of Text Features and Visual Features
    Tian, Lexiao
    Zheng, Dequan
    Zhu, Conghui
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2013, 28 (03) : 242 - 256
  • [6] A World Wide Web based image search engine using text and image content features
    Luo, B
    Wang, XG
    Tang, XO
    INTERNET IMAGING IV, 2003, 5018 : 123 - 130
  • [7] SEARCH BY MOBILE IMAGE BASED ON VISUAL AND SPATIAL CONSISTENCY
    Liu, Xianglong
    Lou, Yihua
    Yu, Adams Wei
    Lang, Bo
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [8] Mobile Visual Search from Dynamic Image Databases
    Chen, Xi
    Koskela, Markus
    IMAGE ANALYSIS: 17TH SCANDINAVIAN CONFERENCE, SCIA 2011, 2011, 6688 : 196 - 205
  • [9] MOBILE 3D VISUAL SEARCH USING THE HELMERT TRANSFORMATION OF STEREO FEATURES
    Li, Haopeng
    Flierl, Markus
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3470 - 3474
  • [10] The effects of layout types, visual features and text labels on icon visual search performance
    Deng, Li
    Liu, Ruiying
    ERGONOMICS, 2024,