Capturing Handwritten Ink Strokes with a Fast Video Camera

被引:2
|
作者
Kim, Chelhwon [1 ]
Chiu, Patrick [1 ]
Oda, Hideto [1 ]
机构
[1] FX Palo Alto Lab, Palo Alto, CA 94304 USA
关键词
D O I
10.1109/ICDAR.2017.209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a system for capturing ink strokes written with ordinary pen and paper using a fast camera with a frame rate comparable to a stylus digitizer. From the video frames, ink strokes are extracted and used as input to an online handwriting recognition engine. A key component in our system is a pen up/down detection model for detecting the contact of the pen-tip with the paper in the video frames. The proposed model consists of feature representation with convolutional neural networks and classification with a recurrent neural network. We also use a high speed tracker with kernelized correlation filters to track the pen-tip. For training and evaluation, we collected labeled video data of users writing English and Japanese phrases from public datasets, and we report on character accuracy scores for different frame rates in the two languages.
引用
收藏
页码:1269 / 1274
页数:6
相关论文
共 50 条
  • [21] Capturing Chaos: Rendering Handwritten Language Documents
    Henderson, John
    LANGUAGE DOCUMENTATION & CONSERVATION, 2008, 2 (02): : 212 - 243
  • [22] Efficient Preprocessing Algorithm for Online Handwritten Arabic Strokes
    AbdElNafea, Mohamed
    Heshmat, Samia
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN COMPUTER ENGINEERING (ITCE 2019), 2019, : 64 - 69
  • [23] Handwritten gesture recognition driven by the spatial context of strokes
    Bouteruche, F
    Anquetil, T
    Ragot, N
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1221 - 1225
  • [24] Light-Field Camera for Fast Switching of Time-Sequential Two-Dimensional and Three-Dimensional Image Capturing at Video Rate
    Joo, Kyung-Il
    Park, Min-Kyu
    Park, Heewon
    Lee, Tae-Hyun
    Kwon, Ki-Chul
    Lim, Young-Tae
    Erdenebat, Munkh-Uchral
    Lee, Hyun
    Lee, Gwangsoon
    Kim, Nam
    Kim, Hak-Rin
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (08) : 6975 - 6985
  • [25] Ink matching of cursive Chinese handwritten annotations
    Lopresti, DP
    Ma, MY
    Wang, PSP
    Crisman, JD
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1998, 12 (01) : 119 - 141
  • [26] A Dual-Camera Surveillance Video Summarization Generating Strategy for Multi-Target Capturing
    Shen, Qingyun
    Yang, Cihui
    Wen, Shipin
    ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 121 - 125
  • [27] MOEMS-based Time-of-Flight Camera for 3D Video Capturing
    You, Jang-Woo
    Park, Yong-Hwa
    Cho, Yong-Chul
    Park, Chang-Young
    Yoon, Heesun
    Lee, Sang-Hun
    Lee, Seung-Wan
    MOEMS AND MINIATURIZED SYSTEMS XII, 2013, 8616
  • [28] A 360-Degree Video Shooting Technique that Can Avoid Capturing the Camera Operator in Frame
    Zhu, Tianyu
    Fujimoto, Takayuki
    COOPERATIVE DESIGN, VISUALIZATION, AND ENGINEERING (CDVE 2021), 2021, 12983 : 44 - 52
  • [29] A unified framework for capturing facial images in video surveillance systems using cooperative camera system
    Chan, Fai
    Moon, Yiu-Sang
    Chen, Jiansheng
    Ma, Yiu-Kwan
    Tsang, Wai-Hung
    Fu, Kah-Kuen
    ACQUISITION, TRACKING, POINTING, AND LASER SYSTEMS TECHNOLOGIES XXII, 2008, 6971
  • [30] LOW COMPLEXITY UNSUPERVISED MULTI-CAMERA COLOR CALIBRATION WITH APPLICATION TO PANORAMIC VIDEO CAPTURING
    Helwani, Karim
    Kondrad, Lukasz
    Piotto, Nicola
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1359 - 1363