A Hybrid Probabilistic Approach for Table Understanding

被引:0
|
作者
Sun, Kexuan [1 ]
Rayudu, Harsha [1 ]
Pujara, Jay [1 ]
机构
[1] Univ Southern Calif, Informat Sci Inst, Los Angeles, CA 90089 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables of data are used to record vast amounts of socioeconomic, scientific, and governmental information. Although humans create tables using underlying organizational principles, unfortunately AI systems struggle to understand the contents of these tables. This paper introduces an end-to-end system for table understanding, the process of capturing the relational structure of data in tables. We introduce models that identify cell types, group these cells into blocks of data that serve a similar functional role, and predict the relationships between these blocks. We introduce a hybrid, neuro-symbolic approach, combining embedded representations learned from thousands of tables with probabilistic constraints that capture regularities in how humans organize tables. Our neurosymbolic model is better able to capture positional invariants of headers and enforce homogeneity of data types. One limitation in this research area is the lack of rich datasets for evaluating end-to-end table understanding, so we introduce a new benchmark dataset comprised of 431 diverse tables from data.gov. The evaluation results show that our system achieves the state-of-the-art performance on cell type classification, block identification, and relationship prediction, improving over prior efforts by up to 7% of macro F1 score.
引用
收藏
页码:4366 / 4374
页数:9
相关论文
共 50 条
  • [31] A Hybrid Approach to Header Size and Forwarding Table Optimization in Segment Routing
    Roy, Anushree
    Sarkar, Tania
    Singh, Pranav Kumar
    Maity, Ranjan
    IEEE Networking Letters, 2023, 5 (04): : 275 - 278
  • [32] Understanding Probabilistic Programs
    Katoen, Joost-Pieter
    Gretz, Friedrich
    Jansen, Nils
    Kaminski, Benjamin Lucien
    Olmedo, Federico
    CORRECT SYSTEM DESIGN: SYMPOSIUM IN HONOR OF ERNST-RUDIGER OLDEROG ON THE OCCASION OF HIS 60TH BIRTHDAY, 2015, 9360 : 15 - 32
  • [33] Moving vehicle tracking and scene understanding: A hybrid approach
    Liu, Xiaoxu
    Yan, Wei Qi
    Kasabov, Nikola
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 51541 - 51558
  • [34] Moving vehicle tracking and scene understanding: A hybrid approach
    Xiaoxu Liu
    Wei Qi Yan
    Nikola Kasabov
    Multimedia Tools and Applications, 2024, 83 : 51541 - 51558
  • [35] A hybrid approach to record linkage using a combination of deterministic and probabilistic methodology
    Ong, Toan C.
    Duca, Lindsey M.
    Kahn, Michael G.
    Crume, Tessa L.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (04) : 505 - 513
  • [36] A hybrid approach for learning parameters of probabilistic networks from incomplete databases
    Haider, S
    DESIGN AND APPLICATION OF HYBRID INTELLIGENT SYSTEMS, 2003, 104 : 321 - 330
  • [37] Hybrid Fuzzy-Probabilistic Approach to Supply Chain Resilience Assessment
    Pavlov, Alexander
    Ivanov, Dmitry
    Dolgui, Alexandre
    Sokolov, Boris
    IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2018, 65 (02) : 303 - 315
  • [38] Robust Hybrid Interval-Probabilistic Approach for the Kidnapped Robot Problem
    Neuland, Renata
    Mantelli, Mathias
    Hummes, Bernardo
    Jaulin, Luc
    Maffei, Renan
    Prestes, Edson
    Kolberg, Mariana
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2021, 29 (02) : 313 - 331
  • [39] An approximate dynamic programming approach to probabilistic reachability for stochastic hybrid systems
    Abate, Alessandro
    Prandini, Maria
    Lygeros, John
    Sastry, Shankar
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 4018 - 4023
  • [40] Medium-Term Probabilistic Forecasting of Electricity Prices: A Hybrid Approach
    Bello, Antonio
    Bunn, Derek W.
    Reneses, Javier
    Munoz, Antonio
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2017, 32 (01) : 334 - 343