Workshop on Document Intelligence Understanding

被引:0
|
作者
Han, Soyeon Caren [1 ,2 ]
Ding, Yihao [2 ]
Luo, Siwen [1 ,2 ]
Poon, Josiah [2 ]
Yoon, Hee-Guen [3 ]
Huang, Zhe [4 ]
Duuring, Paul [5 ]
Holden, Eun-Jung [1 ]
机构
[1] Univ Western Australia, Perth, WA, Australia
[2] Univ Sydney, Sydney, NSW, Australia
[3] Natl Informat Soc Agcy, Daegu, South Korea
[4] Alibaba Grp, Ant Grp, Shanghai, Peoples R China
[5] Ind Regulat & Safety, Dept Mines, Perth, WA, Australia
关键词
Document Intelligence; Document Structure Understanding; Document Content Understanding;
D O I
10.1145/3583780.3615312
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document understanding and information extraction include different tasks to understand a document and extract valuable information automatically. Recently, there has been a rising demand for developing document understanding among different domains, including business, law, and medicine, to boost the efficiency of work that is associated with a large number of documents. This workshop aims to bring together researchers and industry developers in the field of document intelligence and understanding diverse document types to boost automatic document processing and understanding techniques. We also release a data challenge on the recently introduced document-level VQA dataset, PDFVQA. The PDFVQA challenge(1) examines the model's structural and contextual understandings on the natural full document level of multiple consecutive document pages by including questions with a sequence of answers extracted from multi-pages of the full document. This task helps to boost the document understanding step from the single-page level to the full document level understanding.
引用
收藏
页码:5273 / 5276
页数:4
相关论文
共 50 条
  • [1] DI-2022: The Third Document Intelligence Workshop
    Nenkova, Ani
    Burdick, Douglas
    Han, Benjamin
    Lewis, Dave
    Tata, Sandeep
    Tecuci, Dan
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4890 - 4891
  • [2] DI-2021: The Second Document Intelligence Workshop
    Han, Benjamin
    Burdick, Douglas
    Lewis, Dave
    Lu, Yijuan
    Motahari, Hamid
    Tata, Sandeep
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4127 - 4128
  • [3] International Workshop on Computational Intelligence for Multimedia Understanding
    Moroni, Davide
    Toreyin, Behcet Ugur
    Cetin, A. Enis
    ERCIM NEWS, 2018, (113): : 57 - 57
  • [4] International Workshop on Computational Intelligence for Multimedia Understanding
    Moroni, Davide
    Trocan, Maria
    Prochazka, Ales
    ERCIM NEWS, 2016, (104): : 4 - 4
  • [5] International Workshop on Computational Intelligence for Multimedia Understanding
    Salerno, Emanuele
    ERCIM NEWS, 2012, (89): : 7 - 7
  • [6] IWCIM: International Workshop on Computational Intelligence for Multimedia Understanding
    Moroni, Davide
    Trocan, Maria
    Töreyin, Behçet Ugur
    Proceedings - 15th International Conference on Signal Image Technology and Internet Based Systems, SISITS 2019, 2019,
  • [7] 11th International Workshop on Computational Intelligence for Multimedia understanding
    Toreyin, Behcet Ugur
    Trocan, Maria
    Moroni, Davide
    ERCIM NEWS, 2023, (134):
  • [8] MUSCLE Working Group International Workshop on Computational Intelligence for Multimedia Understanding
    Trocan, Maria
    Salerno, Emanuele
    Cetin, Enis
    ERCIM NEWS, 2015, (100): : 52 - 53
  • [9] Workshop: Document imaging and document management pre-conference workshop
    Gilheany, S
    IAMSLIC 2000: TIDES OF TECHNOLOGY, 2001, : 3 - 9
  • [10] Workshop on web intelligence and interaction
    Takama, Yasufumi
    Kawai, Yukiko
    Kitayama, Daisuke
    Sugihara, Taro
    Yoshida, Mitsuo
    CEUR Workshop Proceedings, 2018, 2068