Data Extraction and Question Answering on Chart Images Towards Accessibility and Data Interpretation

被引:0
|
作者
Shahira, K. C. [1 ]
Joshi, Pulkit [1 ]
Lijiya, A. [1 ]
机构
[1] Natl Inst Technol Calicut, Kozhikode 673601, India
关键词
Data mining; Bars; Visualization; Question answering (information retrieval); Data visualization; Pipelines; Feature extraction; Chart accessibility; data visualization; information retrieval; image processing; object detection; table question answering;
D O I
10.1109/OJCS.2023.3328767
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graphical representations such as chart images are integral to web pages and documents. Automating data extraction from charts is possible by reverse-engineering the visualization pipeline. This study proposes a framework that automates data extraction from bar charts and integrates it with question-answering. The framework employs an object detector to recognize visual cues in the image, followed by text recognition. Mask-RCNN for plot element detection achieves a mean average precision of 95.04% at a threshold of 0.5 which decreases as the Intersection over Union (IoU) threshold increases. A contour approximation-based approach is proposed for extracting the bar coordinates, even at a higher IoU of 0.9. The textual and visual cues are associated with the legend text and preview, and the chart data is finally extracted in tabular format. We introduce an extension to the TAPAS model, called TAPAS++, by incorporating new operations and table question answering is done using TAPAS++ model. The chart summary or description is also produced in an audio format. In the future, this approach could be expanded to enable interactive question answering on charts by accepting audio inquiries from individuals with visual impairments and do more complex reasoning using Large Language Models.
引用
收藏
页码:314 / 325
页数:12
相关论文
共 50 条
  • [1] Towards an Efficient Framework for Data Extraction from Chart Images
    Ma, Weihong
    Zhang, Hesuo
    Yan, Shuang
    Yao, Guangshun
    Huang, Yichao
    Li, Hui
    Wu, Yaqiang
    Jin, Lianwen
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 583 - 597
  • [2] Information Extraction over Structured Data: Question Answering with Freebase
    Yao, Xuchen
    Van Durme, Benjamin
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 956 - 966
  • [3] Automatic Question-Answering Based on Wikipedia Data Extraction
    Huang, Xiangzhou
    Wei, Baogang
    Zhang, Yin
    2015 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE), 2015, : 314 - 317
  • [4] ChartSense: Interactive Data Extraction from Chart Images
    Jung, Daekyoung
    Kim, Wonjae
    Song, Hyunjoo
    Hwang, Jeong-In
    Lee, Bongshin
    Kim, Bohyoung
    Seo, Jinwook
    PROCEEDINGS OF THE 2017 ACM SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'17), 2017, : 6706 - 6717
  • [5] Towards Visual Question Answering on Pathology Images
    He, Xuehai
    Cai, Zhuo
    Wei, Wenlan
    Zhang, Yichen
    Mou, Luntian
    Xing, Eric
    Xie, Pengtao
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 708 - 718
  • [6] SINA: Semantic interpretation of user queries for question answering on interlinked data
    Shekarpour, Saeedeh
    Marx, Edgard
    Ngomo, Axel-Cyrille Ngonga
    Auer, Soeren
    JOURNAL OF WEB SEMANTICS, 2015, 30 : 39 - 51
  • [7] Data Augmentation Method for Question Answering
    Ding J.
    Xiao K.
    Ye H.
    Zhou X.
    Zhang M.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (01): : 54 - 60
  • [8] Towards a Passages Extraction Method for Arabic Question Answering Systems
    Lahbari, Imane
    Alami, Hamza
    Zidani, Khalid Alaoui
    Ouatik, Said El Alaoui
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2019): VOL 1 - ADVANCED INTELLIGENT SYSTEMS FOR EDUCATION AND INTELLIGENT LEARNING SYSTEM, 2020, 1102 : 230 - 237
  • [9] Improving Event Duration Question Answering by Leveraging Existing Temporal Information Extraction Data
    Virgo, Felix Giovanni
    Cheng, Fei
    Kurohashi, Sadao
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4451 - 4457
  • [10] GQA: Grammatical Question Answering for RDF Data
    Zimina, Elizaveta
    Nummenmaa, Jyrki
    Jarvelin, Kalervo
    Peltonen, Jaakko
    Stefanidis, Kostas
    Hyyro, Heikki
    SEMANTIC WEB CHALLENGES, SEMWEBEVAL 2018, 2018, 927 : 82 - 97