Venue Classification of Research Papers in Scholarly Digital Libraries

被引:1
|
作者
Caragea, Cornelia [1 ]
Florescu, Corina [2 ]
机构
[1] Kansas State Univ, Comp Sci, Manhattan, KS 66502 USA
[2] Univ North Texas, Comp Sci & Engn, Denton, TX 76207 USA
来源
DIGITAL LIBRARIES FOR OPEN KNOWLEDGE, TPDL 2018 | 2018年 / 11057卷
基金
美国国家科学基金会;
关键词
Text classification; Digital libraries; Venue classification;
D O I
10.1007/978-3-030-00066-0_11
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Open-access scholarly digital libraries crawl periodically a list of URLs in order to obtain appropriate collections of freely-available research papers. The metadata of the crawled papers, e.g., title, authors, and references, are automatically extracted before the papers are indexed in a digital library. The venue of publication is another important aspect about a scientific paper, which reflects its authoritativeness. However, the venue is not always readily available for a paper. Instead, it needs to be extracted from the references lists of other papers that cite the target paper. We explore a supervised learning approach to automatically classifying the venue of a research paper using information solely available from the content of the paper and show experimentally on a dataset of approximately 44,000 papers that this approach outperforms several baselines and prior work.
引用
收藏
页码:129 / 136
页数:8
相关论文
共 50 条