Experiments in Character-level Neural Network Models for Punctuation

被引:7
|
作者
Gale, William [1 ]
Parthasarathy, Sarangarajan [2 ]
机构
[1] Univ Adelaide, Adelaide, SA, Australia
[2] Microsoft, Redmond, WA USA
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
speech recognition; punctuation prediction; neural networks;
D O I
10.21437/Interspeech.2017-1710
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We explore character-level neural network models for inferring punctuation from text-only input. Punctuation inference is treated as a sequence tagging problem where the input is a sequence of un-punctuated characters, and the output is a corresponding sequence of punctuation tags. We experiment with six architectures, all of which use a long short-term memory (LSTM) network for sequence modeling. They differ in the way the context and lookahead for a given character is derived: from simple character embedding and delayed output to enable lookahead, to complex convolutional neural networks (CNN) to capture context. We demonstrate that the accuracy of proposed character-level models are competitive with the accuracy of a state-of-the-art word-level Conditional Random Field (CRF) baseline with carefully crafted features.
引用
收藏
页码:2794 / 2798
页数:5
相关论文
共 50 条
  • [1] Character-Level Convolutional Neural Network for Paraphrase Detection and Other Experiments
    Maraev, Vladislav
    Saedi, Chakaveh
    Rodrigues, Joao
    Branco, Antonio
    Silva, Joao
    ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE, 2018, 789 : 293 - 304
  • [2] A Character-Level Convolutional Neural Network for Predicting Exploitability of Vulnerability
    Lyu, Jinghui
    Bai, Yude
    Xing, Zhenchang
    Li, Xiaohong
    Ge, Weimin
    2021 INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF SOFTWARE ENGINEERING (TASE 2021), 2021, : 119 - 126
  • [3] Improving Bug Localization with Character-level Convolutional Neural Network and Recurrent Neural Network
    Xiao, Yan
    Keung, Jacky
    2018 25TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2018), 2018, : 703 - 704
  • [4] Character-level neural network for biomedical named entity recognition
    Gridach, Mourad
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 70 : 85 - 91
  • [5] Joint Word- and Character-level Embedding CNN-RNN Models for Punctuation Restoration
    Tundik, Mate Akos
    Szaszak, Gyorgy
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2018, : 135 - 140
  • [6] Web Application Firewall using Character-level Convolutional Neural Network
    Ito, Michiaki
    Iyatomi, Hitoshi
    2018 IEEE 14TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2018), 2018, : 103 - 106
  • [7] Neural Character-Level Syntactic Parsing for Chinese
    Li, Zuchao
    Zhou, Junru
    Zhao, Hai
    Zhang, Zhisong
    Li, Haonan
    Ju, Yuqi
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 73 : 461 - 509
  • [8] Neural Character-Level Syntactic Parsing for Chinese
    Li Z.
    Zhou J.
    Zhao H.
    Zhang Z.
    Li H.
    Ju Y.
    Journal of Artificial Intelligence Research, 2022, 73 : 461 - 509
  • [9] Neural Character-Level Dependency Parsing for Chinese
    Li, Haonan
    Zhang, Zhisong
    Ju, Yuqi
    Zhao, Hai
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5205 - 5212
  • [10] Disambiguation of biomedical acronyms based on a bidirectional recurrent neural network of character-level features
    Kai R.
    Na L.
    Wei X.
    Shi-Wen W.
    Journal of Engineering Science and Technology Review, 2019, 12 (06) : 105 - 112