Mining Innovative Topics Based on Deep Learning

被引:0
|
作者
Fu C. [1 ]
Qian L. [1 ,2 ]
Zhang H. [3 ]
Zhao H. [1 ]
Xie J. [1 ,2 ]
机构
[1] National Science Library, Chinese Academy of Sciences, Beijing
[2] Department of Library, Information and Archives Management, University of Chinese Academy of Sciences, Beijing
[3] School of Computer Science & Technology, Beijing Institute of Technology, Beijing
关键词
Deep Learning; Innovative Topic; Intelligent Mining; Seq2Seq;
D O I
10.11925/infotech.2096-3467.2018.1365
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
[Objective] This paper aims to identify innovative topics from massive volumes of texts. [Methods] First, we extracted knowledge points with heavier weights from the data of scholarly knowledge graph. Then, these knowledge points were labeled as innovative seeds from the perspectives of “popularity”, “novelty” and “authority”. Third, we computed the knowledge correlation of the innovative seeds. Finally, the results were input to a deep learning model trained by large amounts of sci-tech papers to generate innovative topics. Note: the model is sequence to sequence with Bi-LSTM. [Results] We used Chinese research papers on artificial intelligence as the experimental data and found the average innovation score of the retrieved topics was 6.52, which were evaluated by experts manually. [Limitations] At present, contents of the knowledge graph and the training datasets need to be improved. [Conclusions] The proposed model, which identifies innovative topics from scholarly papers, could be optimized in the future. © 2023 Chin J Gen Pract. All rights reserved.
引用
收藏
页码:46 / 54
页数:8
相关论文
共 29 条
  • [1] Wang Shan, Wang Huiju, Qin Xiongpai, Et al., Architecting Big Data: Challenges, Studies and Forecasts, Chinese Journal of Computers, 34, 10, pp. 1741-1752, (2011)
  • [2] Li Jiaqing, Approches and Strategies of Knowledge Organizations, Library and Information Service, 49, 5, pp. 41-44, (2005)
  • [3] Su Xinning, Research on Knowledge Service-oriented Knowledge Organizations, Information and Documentation Services, 36, 1, (2015)
  • [4] Wen Youkui, Wen Hao, Xu Duanyi, Et al., Knowledge Element Mining in Knowledge Management, Journal of the China Society for Scientific and Technical Information, 24, 6, pp. 663-668, (2005)
  • [5] Klavans J L, Muresan S., DEFINDER: Rule-based Methods for the Extraction of Medical Terminology and Their Associated Definitions from On-line Text, AMIA Annual Symposium Proceedings, (1999)
  • [6] Liu B, Chin C W, Ng H T., Mining Topic-specific Concepts and Definitions on the Web, Proceedings of International Conference on World Wide Web, pp. 251-260, (2003)
  • [7] Leng Fuhai, Bai Rujiang, Zhu Qingsong, A Hybrid Semantic Information Extraction Method for Scientific Research Papers, Library and Information Service, 57, 11, pp. 112-119, (2013)
  • [8] Zhang Fan, Le Xiaoqiu, Research on Innovation Points Extraction from Scientific Research Paper Based on Field Thesaurus, New Technology of Library and Information Service, 9, pp. 15-21, (2014)
  • [9] Mikolov T, Chen K, Corrado G, Et al., Efficient Estimation of Word Representations in Vector Space
  • [10] Mikolov T, Sutskever I, Chen K, Et al., Distributed Representations of Words and Phrases and Their Compositionality[C], Proceedings of International Conference on Neural Information Processing Systems, pp. 3111-3119, (2013)