Large-Scale Visual Font Recognition

被引:22
|
作者
Chen, Guang [2 ]
Yang, Jianchao [1 ]
Jin, Hailin [1 ]
Brandt, Jonathan [1 ]
Shechtman, Eli [1 ]
Agarwala, Aseem [1 ]
Han, Tony X. [2 ]
机构
[1] Adobe Res, San Jose, CA USA
[2] Univ Missouri, Columbia, MO 65211 USA
关键词
D O I
10.1109/CVPR.2014.460
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the large-scale visual font recognition (VFR) problem, which aims at automatic identification of the typeface, weight, and slope of the text in an image or photo without any knowledge of content. Although visual font recognition has many practical applications, it has largely been neglected by the vision community. To address the VFR problem, we construct a large-scale dataset containing 2,420 font classes, which easily exceeds the scale of most image categorization datasets in computer vision. As font recognition is inherently dynamic and open-ended, i.e., new classes and data for existing categories are constantly added to the database over time, we propose a scalable solution based on the nearest class mean classifier (NCM). The core algorithm is built on local feature embedding, local feature metric learning and max-margin template selection, which is naturally amenable to NCM and thus to such open-ended classification problems. The new algorithm can generalize to new classes and new data at little added cost. Extensive experiments demonstrate that our approach is very effective on our synthetic test images, and achieves promising results on real world test images.
引用
收藏
页码:3598 / 3605
页数:8
相关论文
共 50 条
  • [31] Large-Scale Visual Odometry for Rough Terrain
    Konolige, Kurt
    Agrawal, Motilal
    Sola, Joan
    ROBOTICS RESEARCH, 2010, 66 : 201 - 212
  • [32] The Visual Perception of Large-Scale Distances Outdoors
    Norman, J. Farley
    Dukes, Jessica M.
    Shapiro, Hannah K.
    Peterson, Ashley E.
    PERCEPTION, 2020, 49 (09) : 968 - 977
  • [33] Visual Exploration of Large-Scale System Evolution
    Wettel, Richard
    Lanza, Michele
    FIFTEENTH WORKING CONFERENCE ON REVERSE ENGINEERING, PROCEEDINGS, 2008, : 219 - 228
  • [34] Implementation of Large-scale Object Recognition System
    Kim, Min-Uk
    Yoon, Kyoungro
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA 2013), 2013,
  • [35] RFIW: Large-Scale Kinship Recognition Challenge
    Robinson, Joseph P.
    Shao, Ming
    Zhao, Handong
    Wu, Yue
    Gillis, Timothy
    Fu, Yun
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1971 - 1973
  • [36] Large-Scale Human Action Recognition with Spark
    Wang, Hanli
    Zheng, Xiaobin
    Xiao, Bo
    2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [37] Large-scale Pollen Recognition with Deep Learning
    de Geus, Andre R.
    Barcelos, Celia A. Z.
    Batista, Marcos A.
    da Silva, Sergio F.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [38] Exploring Transformers for Large-Scale Speech Recognition
    Lu, Liang
    Liu, Changliang
    Li, Jinyu
    Gong, Yifan
    INTERSPEECH 2020, 2020, : 5041 - 5045
  • [39] Boosting face recognition on a large-scale database
    Lu, J
    Plataniotis, KN
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2002, : 109 - 112
  • [40] Clusformer: A Transformer based Clustering Approach to Unsupervised Large-scale Face and Visual Landmark Recognition
    Xuan-Bac Nguyen
    Duc Toan Bui
    Chi Nhan Duong
    Bui, Tien D.
    Luu, Khoa
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10842 - 10851