Big Scholarly Data in CiteSeerX: Information Extraction from the Web

被引:5
|
作者
Ororbia, Alexander G., II [1 ]
Wu, Jian [1 ]
Khabsa, Madian [1 ]
Williams, Kyle [1 ]
Giles, C. Lee [1 ]
机构
[1] Penn State Univ, University Pk, PA 16802 USA
关键词
scholarly big data; citeseerx; information acquisition and extraction; digital library search engine; intelligent systems; METADATA EXTRACTION; TABLE;
D O I
10.1145/2740908.2741736
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We examine CiteSeerX, an intelligent system designed with the goal of automatically acquiring and organizing largescale collections of scholarly documents from the world wide web. From the perspective of automatic information extraction and modes of alternative search, we examine various functional aspects of this complex system in order to investigate and explore ongoing and future research developments(1).
引用
收藏
页码:597 / 602
页数:6
相关论文
共 50 条
  • [31] The Necessity of Information Extraction from Big Data Systems for the Purpose of Business Process Optimization
    Shoilekova, Kamelia
    Ivanova, Boyana
    SOFTWARE ENGINEERING PERSPECTIVES IN SYSTEMS, VOL. 1, 2022, 501 : 48 - 54
  • [32] Associative Feature Information Extraction Using Text Mining from Health Big Data
    Joo-Chang Kim
    Kyungyong Chung
    Wireless Personal Communications, 2019, 105 : 691 - 707
  • [33] Web Page Recommendation from Sparse Big Web Data
    Leung, Carson K.
    Jiang, Fan
    Souza, Joglas
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 592 - 597
  • [34] FNG-IE: an improved graph-based method for keyword extraction from scholarly big-data
    Tahir, Noman
    Asif, Muhammad
    Ahmad, Shahbaz
    Malik, Muhammad Sheraz Arshad
    Aljuaid, Hanan
    Butt, Muhammad Arif
    Rehman, Mobashar
    PEERJ COMPUTER SCIENCE, 2021, PeerJ Inc. (07) : 1 - 24
  • [35] From big data to important information
    Bar-Yam, Yaneer
    COMPLEXITY, 2016, 21 (S2) : 73 - 98
  • [36] From Big Data to Big Information and Big Knowledge: the Case of Earth Observation Data
    Bereta, Konstantina
    Koubarakis, Manolis
    Manegold, Stefan
    Stamoulis, George
    Demir, Beguem
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 2293 - 2294
  • [37] Remotely Sensed Big Data Era and Intelligent Information Extraction
    Zhang B.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2018, 43 (12): : 1861 - 1871
  • [38] From Big Scholarly Data to Solution-Oriented Knowledge Repository
    Zhang, Yu
    Wang, Min
    Saberi, Morteza
    Chang, Elizabeth
    FRONTIERS IN BIG DATA, 2019, 2
  • [39] context generalization for information extraction from the web
    Habegger, B
    Quafafou, M
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 720 - 723
  • [40] Information extraction from a whole Web site
    Gao, Xiaoying
    Zhang, Mengjie
    ADVANCES IN INTELLIGENT IT: ACTIVE MEDIA TECHNOLOGY 2006, 2006, 138 : 52 - +