Mobile Visual Search Using Image and Text Features

被引:0
|
作者
Tsai, Sam S. [1 ]
Chen, Huizhong [1 ]
Chen, David [1 ]
Vedantham, Ramakrishna [2 ]
Grzeszczuk, Radek [2 ]
Girod, Bernd [1 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[2] Nokia Res Ctr, Palo Alto, CA 94304 USA
来源
2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR) | 2011年
关键词
mobile visual search; image retrieval; document retrieval; document analysis;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a mobile visual search system that utilizes both text and low bit-rate image features. Using a cameraphone, a user can snap a picture of a document image and search for the document in online databases. From the query image, the title text is detected and recognized and image features are extracted and compressed, as well. Both types of information are sent from the cameraphone client to a server. The server uses the recognized title to retrieve candidate documents from online databases. Then, image features are used to select the correct document(s). We show that by using a novel geometric verification method that incorporates both text and image feature information, we can reduce the missed positives up to 50%. The proposed method can also speed up the geometric process, enabling a larger set of verified titles, resulting in a superior performance compared to previous schemes.
引用
收藏
页码:845 / 849
页数:5
相关论文
共 50 条
  • [41] Spatial Aggregation of Visual Features for Image Data Search in a Large Geo-tagged Image Dataset
    Alfarrarjeh, Abdullah
    Kim, Seon Ho
    Bright, Arvind
    Hegde, Vinuta
    Akshansh
    Shahabi, Cyrus
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2019), 2019, : 48 - 57
  • [42] The Joint Effect of Image Blur and Illumination Distortions for Mobile Visual Search of Print Media
    Cao, Yi
    Ritz, Christian
    Raad, Raad
    2013 13TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT): COMMUNICATION AND INFORMATION TECHNOLOGY FOR NEW LIFE STYLE BEYOND THE CLOUD, 2013, : 507 - 512
  • [43] Image based rendering for a mobile robot using visual landmarks
    Otsuka, K
    Tanaka, K
    Okada, N
    Kondo, E
    2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 3783 - 3788
  • [44] Image-based visual servoing using position and angle of image features
    Cho, JS
    Kim, HW
    Kweon, IS
    ELECTRONICS LETTERS, 2001, 37 (13) : 862 - 864
  • [45] The Analysis on City Image in Visual Text
    Wu, Rong
    JOINT 2016 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE AND ENVIRONMENTAL SCIENCE (SSES 2016) AND INTERNATIONAL CONFERENCE ON FOOD SCIENCE AND ENGINEERING (ICFSE 2016), 2016, : 198 - 201
  • [46] Text image processing for visual prostheses
    Wang, Song
    Li, Yi
    Barnes, Nick
    2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2012, : 2977 - 2980
  • [47] Visual Coverage Using Autonomous Mobile Robots for Search and Rescue Applications
    Del Bue, A.
    Tamassia, M.
    Signorini, F.
    Murino, V.
    Farinelli, A.
    2013 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2013,
  • [48] Interaction Design for Mobile Visual Search
    Sang, Jitao
    Mei, Tao
    Xu, Ying-Qing
    Zhao, Chen
    Xu, Changsheng
    Li, Shipeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (07) : 1665 - 1676
  • [49] Visual Arts Search on Mobile Devices
    Mao, Hui
    She, James
    Cheung, Ming
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (02)
  • [50] Retrieving images using cross-language text and image features
    Adriani, Mirna
    Arnely, Framadhan
    ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 733 - 736