Big Scholarly Data in CiteSeerX: Information Extraction from the Web

被引:5
|
作者
Ororbia, Alexander G., II [1 ]
Wu, Jian [1 ]
Khabsa, Madian [1 ]
Williams, Kyle [1 ]
Giles, C. Lee [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
关键词
scholarly big data; citeseerx; information acquisition and extraction; digital library search engine; intelligent systems; METADATA EXTRACTION; TABLE;
D O I
10.1145/2740908.2741736
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We examine CiteSeerX, an intelligent system designed with the goal of automatically acquiring and organizing largescale collections of scholarly documents from the world wide web. From the perspective of automatic information extraction and modes of alternative search, we examine various functional aspects of this complex system in order to investigate and explore ongoing and future research developments(1).
引用
收藏
页码:597 / 602
页数:6
相关论文
共 50 条
  • [1] A Web Service for Scholarly Big Data Information Extraction
    Williams, Kyle
    Li, Lichi
    Khabsa, Madian
    Wu, Jian
    Shih, Patrick C.
    Giles, C. Lee
    2014 IEEE 21ST INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2014), 2014, : 105 - 112
  • [2] Scholarly Big Data: Information Extraction and Data Mining
    Giles, C. Lee
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1 - 1
  • [3] CiteSeerX-2018: A Cleansed Multidisciplinary Scholarly Big Dataset
    Wu, Jian
    Kandimalla, Bharath
    Rohatgi, Shaurya
    Sefid, Athar
    Mao, Jianyu
    Giles, C. Lee
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5465 - 5467
  • [4] Scholarly Big Data Information Extraction and Integration in the CiteSeerχ Digital Library
    Williams, Kyle
    Wu, Jian
    Choudhury, Sagnik Ray
    Khabsa, Madian
    Giles, C. Lee
    2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2014, : 68 - 73
  • [5] Scholarly information web
    Ricart, Glenn
    Computers in Physics, 1995, 9 (04):
  • [6] A survey on scholarly data: From big data perspective
    Khan, Samiya
    Liu, Xiufeng
    Shakil, Kashish A.
    Alam, Mansaf
    INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (04) : 923 - 944
  • [7] Big Data and the Web Discovering Meaningful Information from Web Data using Data Mining Techniques
    Abd Wahab, Mohd Helmy
    2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [8] Dynamic Information Extraction for the Big Data
    Jin, Xue-bo
    Dou, Chao
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 1027 - 1030
  • [9] VizioMetrix: A Platform for Analyzing the Visual Information in Big Scholarly Data
    Lee, Po-Shen
    West, Jevin D.
    Howe, Bill
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 413 - 418
  • [10] Joint Information Extraction from the Web Using Linked Data
    Augenstein, Isabelle
    SEMANTIC WEB - ISWC 2014, PT II, 2014, 8797 : 505 - 512