Workshop on Document Intelligence Understanding

被引:0
|
作者
Han, Soyeon Caren [1 ,2 ]
Ding, Yihao [2 ]
Luo, Siwen [1 ,2 ]
Poon, Josiah [2 ]
Yoon, Hee-Guen [3 ]
Huang, Zhe [4 ]
Duuring, Paul [5 ]
Holden, Eun-Jung [1 ]
机构
[1] Univ Western Australia, Perth, WA, Australia
[2] Univ Sydney, Sydney, NSW, Australia
[3] Natl Informat Soc Agcy, Daegu, South Korea
[4] Alibaba Grp, Ant Grp, Shanghai, Peoples R China
[5] Ind Regulat & Safety, Dept Mines, Perth, WA, Australia
关键词
Document Intelligence; Document Structure Understanding; Document Content Understanding;
D O I
10.1145/3583780.3615312
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document understanding and information extraction include different tasks to understand a document and extract valuable information automatically. Recently, there has been a rising demand for developing document understanding among different domains, including business, law, and medicine, to boost the efficiency of work that is associated with a large number of documents. This workshop aims to bring together researchers and industry developers in the field of document intelligence and understanding diverse document types to boost automatic document processing and understanding techniques. We also release a data challenge on the recently introduced document-level VQA dataset, PDFVQA. The PDFVQA challenge(1) examines the model's structural and contextual understandings on the natural full document level of multiple consecutive document pages by including questions with a sequence of answers extracted from multi-pages of the full document. This task helps to boost the document understanding step from the single-page level to the full document level understanding.
引用
收藏
页码:5273 / 5276
页数:4
相关论文
共 50 条