Big Scholarly Data in CiteSeerX: Information Extraction from the Web

被引:5
|
作者
Ororbia, Alexander G., II [1 ]
Wu, Jian [1 ]
Khabsa, Madian [1 ]
Williams, Kyle [1 ]
Giles, C. Lee [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
关键词
scholarly big data; citeseerx; information acquisition and extraction; digital library search engine; intelligent systems; METADATA EXTRACTION; TABLE;
D O I
10.1145/2740908.2741736
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We examine CiteSeerX, an intelligent system designed with the goal of automatically acquiring and organizing largescale collections of scholarly documents from the world wide web. From the perspective of automatic information extraction and modes of alternative search, we examine various functional aspects of this complex system in order to investigate and explore ongoing and future research developments(1).
引用
收藏
页码:597 / 602
页数:6
相关论文
共 50 条
  • [21] Extraction of structural information from the web
    Murata, T
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 1204 - 1207
  • [22] Perceptions of credibility of scholarly information on the web
    Liu, ZM
    INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (06) : 1027 - 1038
  • [23] Guest Editorial: Scholarly Big Data
    Xia, Feng
    Giles, C. Lee
    Liu, Huan
    Wang, Kuansan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (01) : 200 - 203
  • [24] Scholarly Big Data Knowledge and Semantics
    Giles, C. Lee
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 371 - 371
  • [25] Information Extraction for Scholarly Digital Libraries
    Williams, Kyle
    Wu, Jian
    Wu, Zhaohui
    Giles, C. Lee
    2016 IEEE/ACM JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2016, : 287 - 288
  • [26] Weaving Scholarly Legacy Data into Web of Data
    Latif, Atif
    Afzal, Muhammad Tanvir
    Maurer, Hermann
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2012, 18 (16) : 2301 - 2318
  • [27] Data extraction from Web data sources
    Robinson, J
    15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 282 - 288
  • [28] Data cross-locating in web information extraction
    School of Software Engineering, South China University of Technology, Guangzhou 510006, China
    Huanan Ligong Daxue Xuebao, 2008, 5 (43-47+52): : 43 - 47
  • [29] Information Visualization Design of Web under the Background of Big Data
    Deng, Ran
    Ni, Taile
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [30] Associative Feature Information Extraction Using Text Mining from Health Big Data
    Kim, Joo-Chang
    Chung, Kyungyong
    WIRELESS PERSONAL COMMUNICATIONS, 2019, 105 (02) : 691 - 707