Complex-valued Neural Network-based Quantum Language Models

被引:8
|
作者
Zhang, Peng [1 ]
Hui, Wenjie [1 ]
Wang, Benyou [2 ]
Zhao, Donghao [1 ]
Song, Dawei [3 ]
Lioma, Christina [4 ]
Simonsen, Jakob Grue [4 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Padua, Padua, Italy
[3] Beijing Inst Technol, Beijing, Peoples R China
[4] Univ Copenhagen, Copenhagen, Denmark
关键词
Quantum theory; language model; question answering; neural network; PROBABILITY RANKING PRINCIPLE; PHASE;
D O I
10.1145/3505138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Language modeling is essential in Natural Language Processing and Information Retrieval related tasks. After the statistical language models, Quantum Language Model (QLM) has been proposed to unify both single words and compound terms in the same probability space without extending term space exponentially. Although QLM achieved good performance in ad hoc retrieval, it still has two major limitations: (1) QLM cannot make use of supervised information, mainly due to the iterative and non-differentiable estimation of the density matrix, which represents both queries and documents in QLM. (2) QLM assumes the exchangeability of words or word dependencies, neglecting the order or position information of words. This article aims to generalize QLM and make it applicable to more complicated matching tasks (e.g., Question Answering) beyond ad hoc retrieval. We propose a complex-valued neural network-based QLM solution called C-NNQLM to employ an end-to-end approach to build and train density matrices in a lightweight and differentiable manner, and it can therefore make use of external well-trained word vectors and supervised labels. Furthermore, C-NNQLM adopts complex-valued word vectors whose phase vectors can directly encode the order (or position) information of words. Note that complex numbers are also essential in the quantum theory. We show that the real-valued NNQLM (R-NNQLM) is a special case of C-NNQLM. The experimental results on the QA task show that both R-NNQLM and C-NNQLM achieve much better performance than the vanilla QLM, and C-NNQLM's performance is on par with state-of-the-art neural network models. We also evaluate the proposed C-NNQLM on text classification and document retrieval tasks. The results on most datasets show that the C-NNQLM can outperform R-NNQLM, which demonstrates the usefulness of the complex representation for words and sentences in C-NNQLM.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] Neural Cryptography Based on Complex-Valued Neural Network
    Dong, Tao
    Huang, Tingwen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4999 - 5004
  • [2] Complex-Valued Neural Network and Complex-Valued Backpropagation Learning Algorithm
    Nitta, Tohru
    ADVANCES IN IMAGING AND ELECTRON PHYSICS, VOL 152, 2008, 152 : 153 - 220
  • [3] Some novel real/complex-valued neural network models
    Garimella, Ramamurthy
    Computational Intelligence, Theory and Application, 2006, : 473 - 483
  • [4] Novel complex-valued neural network for dynamic complex-valued matrix inversion
    Liao B.
    Xiao L.
    Jin J.
    Ding L.
    Liu M.
    2016, Fuji Technology Press (20) : 132 - 138
  • [5] Novel Complex-Valued Neural Network for Dynamic Complex-Valued Matrix Inversion
    Liao, Bolin
    Xiao, Lin
    Jin, Jie
    Ding, Lei
    Liu, Mei
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (01) : 132 - 138
  • [6] Uniqueness of feedforward complex-valued neural network with a given complex-valued function
    Nitta, T
    KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 550 - 554
  • [7] Differential Complex-Valued Convolutional Neural Network-Based Individual Recognition of Communication Radiation Sources
    Ying, Long
    Li, Jingchao
    Zhang, Bin
    IEEE ACCESS, 2021, 9 : 132533 - 132540
  • [8] Quantum-Inspired Complex-Valued Language Models for Aspect-Based Sentiment Classification
    Zhao, Qin
    Hou, Chenguang
    Xu, Ruifeng
    ENTROPY, 2022, 24 (05)
  • [9] Comparison of complex-valued neural network and fuzzy clustering complex-valued neural network for load-flow analysis
    Ceylan, Murat
    Cetinkaya, Nurettin
    Ceylan, Rahime
    Ozbay, Yuksel
    ARTIFICIAL INTELLIGENCE AND NEURAL NETWORKS, 2006, 3949 : 92 - 99
  • [10] A Complex-Valued Neural Network Based Robust Image Compression
    Luo, Can
    Bao, Youneng
    Tan, Wen
    Li, Chao
    Meng, Fanyang
    Liang, Yongsheng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 53 - 64