Grounding Ontologies with Pre-Trained Large Language Models for Activity Based Intelligence

被引：0

作者：

Azim, Anee ^{[1
]}

Clark, Leon ^{[1
]}

Lau, Caleb ^{[1
]}

Cobb, Miles ^{[2
]}

Jenner, Kendall ^{[1
]}

机构：

[1] Lockheed Martin Australia, STELaRLab, Melbourne, Vic, Australia

[2] Lockheed Martin Space, Sunnyvale, CA USA

来源：

SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXXIII | 2024年 / 13057卷

关键词：

Activity Based Intelligence; Ontology; Large Language Model; Track Association;

D O I：

10.1117/12.3013332

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The development of Activity Based Intelligence (ABI) requires an understanding of individual actors' intents, their interactions with other entities in the environment, and how these interactions facilitate accomplishment of their goals. Statistical modelling alone is insufficient for such analyses, mandating higher-level representations such as ontology to capture important relationships. However, constructing ontologies for ABI, ensuring they remain grounded to real-world entities, and maintaining their applicability to downstream tasks requires substantial hand-tooling by domain experts. In this paper, we propose the use of a Large Language Model (LLM) to bootstrap a grounding for such an ontology. Subsequently, we demonstrate that the experience encoded within the weights of a pre-trained LLM can be used in a zero-shot manner to provide a model of normalcy, enabling ABI analysis at the semantics level, agnostic to the precise coordinate data. This is accomplished through a sequence of two transformations, made upon a kinematic track, toward natural language narratives suitable for LLM input. The first transformation generates an abstraction of the low-level kinematic track, embedding it within a knowledge graph using a domain-specific ABI ontology. Secondly, we employ a template-driven narrative generation process to form natural language descriptions of behavior. Computation of the LLM perplexity score upon these narratives achieves grounding of the ontology. This use does not rely on any prompt engineering. In characterizing the perplexity score for any given track, we observe significant variability given chosen parameters such as sentence verbosity, attribute count, clause ordering, and so on. Consequently, we propose an approach that considers multiple generated narratives for an individual track and the distribution of perplexity scores for downstream applications. We demonstrate the successful application of this methodology against a semantic track association task. Our subsequent analysis establishes how such an approach can be used to augment existing kinematics-based association algorithms.

引用

页数：11

共 50 条

[1] Machine Unlearning of Pre-trained Large Language Models
Yao, Jin
Chien, Eli
Du, Minxin
Niu, Xinyao
Wang, Tianhao
Cheng, Zezhou
Yue, Xiang
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 8403 - 8419
[2] Probing Toxic Content in Large Pre-Trained Language Models
Ousidhoum, Nedjma
Zhao, Xinran
Fang, Tianqing
Song, Yangqiu
Yeung, Dit-Yan
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4262 - 4274
[3] Pre-Trained Language Models and Their Applications
Wang, Haifeng
Li, Jiwei
Wu, Hua
Hovy, Eduard
Sun, Yu
ENGINEERING, 2023, 25 : 51 - 65
[4] Automated Program Repair in the Era of Large Pre-trained Language Models
Xia, Chunqiu Steven
Wei, Yuxiang
Zhang, Lingming
2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 1482 - 1494
[5] SMT Solver Validation Empowered by Large Pre-trained Language Models
Sun, Maolin
Yang, Yibiao
Wang, Yang
Wen, Ming
Jia, Haoxiang
Zhou, Yuming
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1288 - 1300
[6] Automated LOINC Standardization Using Pre-trained Large Language Models
Tu, Tao
Loreaux, Eric
Chesley, Emma
Lelkes, Adam D.
Gamble, Paul
Bellaiche, Mathias
Seneviratne, Martin
Chen, Ming-Jun
MACHINE LEARNING FOR HEALTH, VOL 193, 2022, 193 : 343 - 355
[7] A Data Cartography based MixUp for Pre-trained Language Models
Park, Seo Yeon
Caragea, Cornelia
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4244 - 4250
[8] Assessing Phrase Break of ESL Speech with Pre-trained Language Models and Large Language Models
Wang, Zhiyi
Mao, Shaoguang
Wu, Wenshan
Xia, Yan
Deng, Yan
Tien, Jonathan
INTERSPEECH 2023, 2023, : 4194 - 4198
[9] Pre-trained transformer-based language models for Sundanese
Wilson Wongso
Henry Lucky
Derwin Suhartono
Journal of Big Data, 9
[10] Pre-trained transformer-based language models for Sundanese
Wongso, Wilson
Lucky, Henry
Suhartono, Derwin
JOURNAL OF BIG DATA, 2022, 9 (01)

← 1 2 3 4 5 →