Keyphrase Generation With CopyNet and Semantic Web

被引:2
|
作者
Zhu, Xun [1 ,2 ]
Lyu, Chen [3 ]
Ji, Donghong [1 ]
机构
[1] Wuhan Univ, Sch Cyber Sci & Engn, Minist Educ, Key Lab Aerosp Informat Secur & Trusted Comp, Wuhan 430072, Peoples R China
[2] Jianghan Univ, Sch Math & Comp Sci, Wuhan 430056, Peoples R China
[3] Guangdong Univ Foreign Studies, Collaborat Innovat Ctr Language Res & Serv, Guangzhou 510420, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国国家自然科学基金; 国家教育部科学基金资助;
关键词
Task analysis; Semantic Web; Decoding; Semantics; Vocabulary; Transforms; Neural networks; Keyphrase generation; encoder-decoder model; copying mechanism; semantic web;
D O I
10.1109/ACCESS.2020.2977508
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Keyphrases provide core information for users to understand the document. Most previous works utilize machine learning based methods for keyphrases extraction and achieve promising performance. However, these methods focus on identify keyphrases from the input text, and can not extract keyphrases that do not appear in the text. In this paper, we present an encoder-decoder framework, which incorporating copying mechanism, to generate keyphrases for the given text. This framework (CopyNet) integrates the generation part and copying part. The generation part generates the keyphrase from the predefined vocabulary, and the copy part gets the keyphrases from the source text. Furthermore, we improve the CopyNet by using different probability of the two parts. To incorporate more related information for keyphrase generation, the automatically built keyphrase semantic web is merged into the dataset to participate in the training process of the neural network. Semantic similarity based and word co-occurrence based methods are used for keyphrase semantic web construction. We build a large-scale biomedical keyphrase dataset to evaluate the system performance. Experiments show that our improved CopyNet can achieve better performance with different portions of the generation and copying part, and the incorporation of the semantic web also effectively improves the keyphrase generation.
引用
收藏
页码:44202 / 44210
页数:9
相关论文
共 50 条
  • [1] Keyphrase extraction from Chinese news web pages based on semantic relations
    Xie, Fei
    Wu, Xindong
    Hu, Xue-Gang
    Wang, Fei-Yue
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2008, 5075 : 490 - 495
  • [2] Keyphrase extraction from Chinese news web pages based on semantic relations
    Xie, Fei
    Wu, Xindong
    Hu, Xue-Gang
    Wang, Fei-Yue
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2008, 5075 : 490 - +
  • [3] Deep Keyphrase Generation
    Meng, Rui
    Zhao, Sanqiang
    Han, Shuguang
    He, Daqing
    Brusilovsky, Peter
    Chi, Yu
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 582 - 592
  • [4] Enhancing Story Generation with the Semantic Web
    LaBouve, Eric
    Miller, Erik
    Khosmood, Foaad
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES (FDG'19), 2019,
  • [5] On the synthetic generation of semantic web schemas
    Theoharis, Yannis
    Georgakopoulos, George
    Christophides, Vassilis
    SEMANTIC WEB, ONTOLOGIES AND DATABASES, 2008, 5005 : 98 - +
  • [6] Next generation Semantic Web applications
    Motta, Enrico
    Sabou, Marta
    SEMANTIC WEB - ASWC 2006, PROCEEDINGS, 2006, 4185 : 24 - 29
  • [7] Automatic generation of semantic fields for resource discovery in the semantic web
    Navas, I
    Sanz, I
    Aldana, JF
    Berlanga, R
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 706 - 715
  • [8] Semantic Web approach to smart link generation for Web navigations
    Kao, Shang-Juh
    Hsu, I-Ching
    SOFTWARE-PRACTICE & EXPERIENCE, 2007, 37 (08): : 857 - 879
  • [9] Unsupervised Deep Keyphrase Generation
    Shen, Xianjie
    Wang, Yinghan
    Meng, Rui
    Shang, Jingbo
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11303 - 11311
  • [10] Unsupervised Keyphrase Extraction for Web Pages
    Haarman, Tim
    Zijlema, Bastiaan
    Wiering, Marco
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2019, 3 (03)