Complex-valued Neural Network-based Quantum Language Models

被引:8
|
作者
Zhang, Peng [1 ]
Hui, Wenjie [1 ]
Wang, Benyou [2 ]
Zhao, Donghao [1 ]
Song, Dawei [3 ]
Lioma, Christina [4 ]
Simonsen, Jakob Grue [4 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Padua, Padua, Italy
[3] Beijing Inst Technol, Beijing, Peoples R China
[4] Univ Copenhagen, Copenhagen, Denmark
关键词
Quantum theory; language model; question answering; neural network; PROBABILITY RANKING PRINCIPLE; PHASE;
D O I
10.1145/3505138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Language modeling is essential in Natural Language Processing and Information Retrieval related tasks. After the statistical language models, Quantum Language Model (QLM) has been proposed to unify both single words and compound terms in the same probability space without extending term space exponentially. Although QLM achieved good performance in ad hoc retrieval, it still has two major limitations: (1) QLM cannot make use of supervised information, mainly due to the iterative and non-differentiable estimation of the density matrix, which represents both queries and documents in QLM. (2) QLM assumes the exchangeability of words or word dependencies, neglecting the order or position information of words. This article aims to generalize QLM and make it applicable to more complicated matching tasks (e.g., Question Answering) beyond ad hoc retrieval. We propose a complex-valued neural network-based QLM solution called C-NNQLM to employ an end-to-end approach to build and train density matrices in a lightweight and differentiable manner, and it can therefore make use of external well-trained word vectors and supervised labels. Furthermore, C-NNQLM adopts complex-valued word vectors whose phase vectors can directly encode the order (or position) information of words. Note that complex numbers are also essential in the quantum theory. We show that the real-valued NNQLM (R-NNQLM) is a special case of C-NNQLM. The experimental results on the QA task show that both R-NNQLM and C-NNQLM achieve much better performance than the vanilla QLM, and C-NNQLM's performance is on par with state-of-the-art neural network models. We also evaluate the proposed C-NNQLM on text classification and document retrieval tasks. The results on most datasets show that the C-NNQLM can outperform R-NNQLM, which demonstrates the usefulness of the complex representation for words and sentences in C-NNQLM.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] Multicore Photonic Complex-Valued Neural Network with Transformation Layer
    Wang, Ruiting
    Wang, Pengfei
    Lyu, Chen
    Luo, Guangzhen
    Yu, Hongyan
    Zhou, Xuliang
    Zhang, Yejin
    Pan, Jiaoqing
    PHOTONICS, 2022, 9 (06)
  • [42] Synchronization of complex-valued neural network with sliding mode control
    Zhang Hao
    Wang Xing-yuan
    Lin Xiao-hui
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2016, 353 (02): : 345 - 358
  • [43] Bifurcation Analysis of Delayed Complex-Valued Neural Network with Diffusions
    Dong, Tao
    Bai, Jiaqi
    Yang, Lei
    NEURAL PROCESSING LETTERS, 2019, 50 (02) : 1019 - 1033
  • [44] Asymptotical Stability for a Class of Complex-Valued Projective Neural Network
    Li, Jin-dong
    Huang, Nan-jing
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2018, 177 (01) : 261 - 270
  • [45] Asymptotical Stability for a Class of Complex-Valued Projective Neural Network
    Jin-dong Li
    Nan-jing Huang
    Journal of Optimization Theory and Applications, 2018, 177 : 261 - 270
  • [46] Small-time Scale Network Traffic Prediction Based on Complex-valued Neural Network
    Yang, Bin
    4TH INTERNATIONAL CONFERENCE ON MECHANICS AND MECHATRONICS RESEARCH (ICMMR 2017), 2017, 224
  • [47] Atmospheric turbulence removal with complex-valued convolutional neural network
    Anantrasirichai, Nantheera
    PATTERN RECOGNITION LETTERS, 2023, 171 : 69 - 75
  • [48] Surface classification using ANN and complex-valued neural network
    Prashanth, A
    Kalra, PK
    Vyas, NS
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 1094 - 1098
  • [49] Neneta: Heterogeneous Computing Complex-Valued Neural Network Framework
    Lekic, Vladimir
    Babic, Zdenka
    2017 40TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2017, : 192 - 196
  • [50] A blind equalization algorithm of complex-valued feedforward neural network
    Zhang, XQ
    Lu, R
    Wang, HK
    Zhang, LY
    ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 1114 - 1116