Complex-valued Neural Network-based Quantum Language Models

被引:8
|
作者
Zhang, Peng [1 ]
Hui, Wenjie [1 ]
Wang, Benyou [2 ]
Zhao, Donghao [1 ]
Song, Dawei [3 ]
Lioma, Christina [4 ]
Simonsen, Jakob Grue [4 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Padua, Padua, Italy
[3] Beijing Inst Technol, Beijing, Peoples R China
[4] Univ Copenhagen, Copenhagen, Denmark
关键词
Quantum theory; language model; question answering; neural network; PROBABILITY RANKING PRINCIPLE; PHASE;
D O I
10.1145/3505138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Language modeling is essential in Natural Language Processing and Information Retrieval related tasks. After the statistical language models, Quantum Language Model (QLM) has been proposed to unify both single words and compound terms in the same probability space without extending term space exponentially. Although QLM achieved good performance in ad hoc retrieval, it still has two major limitations: (1) QLM cannot make use of supervised information, mainly due to the iterative and non-differentiable estimation of the density matrix, which represents both queries and documents in QLM. (2) QLM assumes the exchangeability of words or word dependencies, neglecting the order or position information of words. This article aims to generalize QLM and make it applicable to more complicated matching tasks (e.g., Question Answering) beyond ad hoc retrieval. We propose a complex-valued neural network-based QLM solution called C-NNQLM to employ an end-to-end approach to build and train density matrices in a lightweight and differentiable manner, and it can therefore make use of external well-trained word vectors and supervised labels. Furthermore, C-NNQLM adopts complex-valued word vectors whose phase vectors can directly encode the order (or position) information of words. Note that complex numbers are also essential in the quantum theory. We show that the real-valued NNQLM (R-NNQLM) is a special case of C-NNQLM. The experimental results on the QA task show that both R-NNQLM and C-NNQLM achieve much better performance than the vanilla QLM, and C-NNQLM's performance is on par with state-of-the-art neural network models. We also evaluate the proposed C-NNQLM on text classification and document retrieval tasks. The results on most datasets show that the C-NNQLM can outperform R-NNQLM, which demonstrates the usefulness of the complex representation for words and sentences in C-NNQLM.
引用
收藏
页数:31
相关论文
共 50 条
  • [31] A synthesis method based on stability analysis for complex-valued Hopfield neural network
    Liu, Xiaoyu
    Fang, Kangling
    Liu, Bin
    ASCC: 2009 7TH ASIAN CONTROL CONFERENCE, VOLS 1-3, 2009, : 1245 - 1250
  • [32] Secret Key Generation for FDD Systems Based on Complex-Valued Neural Network
    Zhang, Xinwei
    Li, Guyue
    Hou, Zongyue
    Hu, Aiqun
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [33] Complex-Valued Neural Network Based Detector for MIMO-OFDM Systems
    Ma, Kai
    Hu, Fengye
    Zhang, Peng
    ADVANCES IN INTELLIGENT SYSTEMS, 2012, 138 : 95 - 101
  • [34] An Improved Complex-Valued Recurrent Neural Network Model for Time-Varying Complex-Valued Sylvester Equation
    Ding, Lei
    Xiao, Lin
    Zhou, Kaiqing
    Lan, Yonghong
    Zhang, Yongsheng
    Li, Jichun
    IEEE ACCESS, 2019, 7 : 19291 - 19302
  • [35] Modeling Walking Behavior of Powered Exoskeleton based on Complex-Valued Neural Network
    Ishizuka, Yudai
    Murai, Shota
    Takahashi, Yasutake
    Kawai, Masayuki
    Taniai, Yoshiaki
    Naniwa, Tomohide
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 1927 - 1932
  • [36] Complex projective synchronization of complex-valued neural network with structure identification
    Zhang, Hao
    Wang, Xing-yuan
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2017, 354 (12): : 5011 - 5025
  • [37] Complex-valued neural networks
    Department of Electrical Engineering and Information Systems, University of Tokyo, 7-3-1, Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
    IEEJ Trans. Electron. Inf. Syst., 1 (2-8):
  • [38] Quantum state classification via complex-valued neural networks
    Dong, Yu-Chao
    Li, Xi-Kun
    Yang, Ming
    Lu, Yan
    Liao, Yan-Lin
    Ullah, Arif
    Lin, Zhi
    LASER PHYSICS LETTERS, 2024, 21 (10)
  • [39] Equivariant bifurcation in a coupled complex-valued neural network rings
    Zhang, Chunrui
    Sui, Zhenzhang
    Li, Hongpeng
    CHAOS SOLITONS & FRACTALS, 2017, 98 : 22 - 30
  • [40] Bifurcation Analysis of Delayed Complex-Valued Neural Network with Diffusions
    Tao Dong
    Jiaqi Bai
    Lei Yang
    Neural Processing Letters, 2019, 50 : 1019 - 1033