An Information Theoretic Approach to Symbolic Learning in Synthetic Languages

被引：2

作者：

Back, Andrew D. ^{[1
]}

Wiles, Janet ^{[1
]}

机构：

[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia

来源：

ENTROPY | 2022年 / 24卷 / 02期

关键词：

information theoretic models; synthetic language; entropy; Zipf-Mandelbrot-Li law; language models; behavior prediction; NUMERICAL COMPUTATION; MAXIMUM-LIKELIHOOD; ENTROPY; SPEECH; LAW; RECOGNITION; INTELLIGIBILITY; QUANTIZATION; DIVERSITY; INDEX;

D O I：

10.3390/e24020259

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

An important aspect of using entropy-based models and proposed "synthetic languages", is the seemingly simple task of knowing how to identify the probabilistic symbols. If the system has discrete features, then this task may be trivial; however, for observed analog behaviors described by continuous values, this raises the question of how we should determine such symbols. This task of symbolization extends the concept of scalar and vector quantization to consider explicit linguistic properties. Unlike previous quantization algorithms where the aim is primarily data compression and fidelity, the goal in this case is to produce a symbolic output sequence which incorporates some linguistic properties and hence is useful in forming language-based models. Hence, in this paper, we present methods for symbolization which take into account such properties in the form of probabilistic constraints. In particular, we propose new symbolization algorithms which constrain the symbols to have a Zipf-Mandelbrot-Li distribution which approximates the behavior of language elements. We introduce a novel constrained EM algorithm which is shown to effectively learn to produce symbols which approximate a Zipfian distribution. We demonstrate the efficacy of the proposed approaches on some examples using real world data in different tasks, including the translation of animal behavior into a possible human language understandable equivalent.

引用

页数：25

共 50 条

[21] An information-theoretic approach to curiosity-driven reinforcement learning
Still, Susanne
Precup, Doina
THEORY IN BIOSCIENCES, 2012, 131 (03) : 139 - 148
[22] An information theoretic approach to manipulation
Greferath, M
Schmidt, SE
8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATIONS: I, 2004, : 86 - 87
[23] An Information Theoretic Approach to Econometrics
Rossi, Francesca
ECONOMICA, 2014, 81 (323) : 596 - 597
[24] Group-theoretic approach for symbolic tensor manipulation
Manssur, LRU
Portugal, R
Svaiter, BF
INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2002, 13 (07): : 859 - 879
[25] A Domain-theoretic Approach to Statistical Programming Languages
Goubault-Larrecq, Jean
Jia, Xiaodong
Theron, Clement
JOURNAL OF THE ACM, 2023, 70 (05)
[26] Consistency of Learning Bayesian Network Structures with Continuous Variables: An Information Theoretic Approach
Suzuki, Joe
ENTROPY, 2015, 17 (08): : 5752 - 5770
[27] Information-Theoretic Odometry Learning
Sen Zhang
Jing Zhang
Dacheng Tao
International Journal of Computer Vision, 2022, 130 : 2553 - 2570
[28] Information-theoretic competitive learning
Kamimura, R
IASTED: PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, 2003, : 359 - 365
[29] Information theoretic learning with adaptive kernels
Singh, Abhishek
Principe, Jose C.
SIGNAL PROCESSING, 2011, 91 (02) : 203 - 213
[30] Information-Theoretic Odometry Learning
Zhang, Sen
Zhang, Jing
Tao, Dacheng
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2553 - 2570

← 1 2 3 4 5 →