Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects for Blind Persons

被引：32

作者：

Yi, Chucai ^{[1
]}

Tian, Yingli ^{[2
]}

Arditi, Aries ^{[3
,4
]}

机构：

[1] CUNY, Grad Ctr, New York, NY 10016 USA

[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA

[3] Lighthouse Int, New York, NY 10022 USA

[4] IBM TJ Watson Res Ctr, New York, NY 10504 USA

来源：

IEEE-ASME TRANSACTIONS ON MECHATRONICS | 2014年 / 19卷 / 03期

基金：

美国国家科学基金会; 美国国家卫生研究院;

关键词：

Assistive devices; blindness; distribution of edge pixels; hand-held objects; optical character recognition (OCR); stroke orientation; text reading; text region localization; ROBUST; RECOGNITION; IMAGES;

D O I：

10.1109/TMECH.2013.2261083

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a camera-based assistive text reading framework to help blind persons read text labels and product packaging from hand-held objects in their daily lives. To isolate the object from cluttered backgrounds or other surrounding objects in the camera view, we first propose an efficient and effective motion-based method to define a region of interest (ROI) in the video by asking the user to shake the object. This method extracts moving object region by a mixture-of-Gaussians-based background subtraction method. In the extracted ROI, text localization and recognition are conducted to acquire text information. To automatically localize the text regions from the object ROI, we propose a novel text localization algorithm by learning gradient features of stroke orientations and distributions of edge pixels in an Adaboost model. Text characters in the localized text regions are then binarized and recognized by off-the-shelf optical character recognition software. The recognized text codes are output to blind users in speech. Performance of the proposed text localization algorithm is quantitatively evaluated on ICDAR-2003 and ICDAR-2011 Robust Reading Datasets. Experimental results demonstrate that our algorithm achieves the state of the arts. The proof-of-concept prototype is also evaluated on a dataset collected using ten blind persons to evaluate the effectiveness of the system's hardware. We explore user interface issues and assess robustness of the algorithm in extracting and reading text from different objects with complex backgrounds.

引用

页码：808 / 817

页数：10

共 2 条

[1] Camera-based scrolling interface for hand-held devices
Davanzo, Giorgio
Medvet, Eric
Bartoli, Alberto
PROCEEDINGS OF THE 12TH INTERNATIONAL INFORMATION VISUALISATION, 2008, : 527 - 532
[2] Improved text-detection methods for a camera-based text reading system for blind persons
Ezaki, N
Kiyota, K
Minh, BT
Bulacu, M
Schomaker, L
EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 257 - 261

← 1 →