A novel machine-learning approach to measuring scientific knowledge flows using citation context analysis

被引:53
|
作者
Saeed-Ul Hassan [1 ]
Safder, Iqra [1 ]
Akram, Anam [1 ]
Kamiran, Faisal [1 ]
机构
[1] Informat Technol Univ, 346-B,Ferozepur Rd, Lahore 54700, Pakistan
关键词
Knowledge flows; Machine learning; Citation context classification; Influential citations; Citation analysis; INFORMATION-SCIENCE; PATENT CITATIONS; INSTITUTIONS; SPECIALTY; DIFFUSION; SPACE; US;
D O I
10.1007/s11192-018-2767-x
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We measure the knowledge flows between countries by analysing publication and citation data, arguing that not all citations are equally important. Therefore, in contrast to existing techniques that utilize absolute citation counts to quantify knowledge flows between different entities, our model employs a citation context analysis technique, using a machine-learning approach to distinguish between important and non-important citations. We use 14 novel features (including context-based, cue words-based and text-based) to train a Support Vector Machine (SVM) and Random Forest classifier on an annotated dataset of 20,527 publications downloaded from the Association for Computational Linguistics anthology (http://allenai.org/data.html). Our machine-learning models outperform existing state-of-the-art citation context approaches, with the SVM model reaching up to 61% and the Random Forest model up to a very encouraging 90% Precision-Recall Area Under the Curve, with 10-fold cross-validation. Finally, we present a case study to explain our deployed method for datasets of PLoS ONE full-text publications in the field of Computer and Information Sciences. Our results show that a significant volume of knowledge flows from the United States, based on important citations, are consumed by the international scientific community. Of the total knowledge flow from China, we find a relatively smaller proportion (only 4.11%) falling into the category of knowledge flow based on important citations, while The Netherlands and Germany show the highest proportions of knowledge flows based on important citations, at 9.06 and 7.35% respectively. Among the institutions, interestingly, the findings show that at the University of Malaya more than 10% of the knowledge produced falls into the category of important. We believe that such analyses are helpful to understand the dynamics of the relevant knowledge flows across nations and institutions.
引用
收藏
页码:973 / 996
页数:24
相关论文
共 50 条
  • [41] Predicting Vehicles' Positions using Roadside Units: a Machine-Learning Approach
    Sangare, Mamoudou
    Banerjee, Soumya
    Muhlethaler, Paul
    Bouzefrane, Samia
    2018 IEEE CONFERENCE ON STANDARDS FOR COMMUNICATIONS AND NETWORKING (IEEE CSCN), 2018,
  • [42] An Efficient Approach to Recognize Hand Gestures Using Machine-Learning Algorithms
    Wahid, Md Ferdous
    Tafreshi, Reza
    Al-Sowaidi, Mubarak
    Langari, Reza
    2018 IEEE 4TH MIDDLE EAST CONFERENCE ON BIOMEDICAL ENGINEERING (MECBME), 2018, : 171 - 176
  • [43] NOVEL MACHINE-LEARNING ANALYSIS TO PREDICT OUTCOMES DURING INPATIENT REHABILITATION
    Wu, B.
    Upadhyaya, P.
    Savitz, S.
    Jiang, X.
    Shams, S.
    INTERNATIONAL JOURNAL OF STROKE, 2021, 16 (2_SUPPL) : 38 - 38
  • [44] Detection of Colchicum autumnale in drone images, using a machine-learning approach
    Petrich, Lukas
    Lohrmann, Georg
    Neumann, Matthias
    Martin, Fabio
    Frey, Andreas
    Stoll, Albert
    Schmidt, Volker
    PRECISION AGRICULTURE, 2020, 21 (06) : 1291 - 1303
  • [45] Machine-learning Approach for the Development of a Novel Predictive Model for the Diagnosis of Hepatocellular Carcinoma
    Sato, Masaya
    Morimoto, Kentaro
    Kajihara, Shigeki
    Tateishi, Ryosuke
    Shiina, Shuichiro
    Koike, Kazuhiko
    Yatomi, Yutaka
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [46] A novel infrasound and audible machine-learning approach to the diagnosis of COVID-19
    Dori, Guy
    Bachner-Hinenzon, Noa
    Kasim, Nour
    Zaidani, Haitem
    Perl, Sivan Haia
    Maayan, Shlomo
    Shneifi, Amin
    Kian, Yousef
    Tiosano, Tuvia
    Adler, Doron
    Adir, Yochai
    ERJ OPEN RESEARCH, 2022, 8 (04)
  • [47] Machine-learning Approach for the Development of a Novel Predictive Model for the Diagnosis of Hepatocellular Carcinoma
    Masaya Sato
    Kentaro Morimoto
    Shigeki Kajihara
    Ryosuke Tateishi
    Shuichiro Shiina
    Kazuhiko Koike
    Yutaka Yatomi
    Scientific Reports, 9
  • [48] A Novel Machine-Learning Approach to Predict Stress-Responsive Genes in Arabidopsis
    Nazari, Leyla
    Ghotbi, Vida
    Nadimi, Mohammad
    Paliwal, Jitendra
    ALGORITHMS, 2023, 16 (09)
  • [49] Measuring executive personality using machine-learning algorithms: A new approach and audit fee-based validation tests
    Hrazdil, Karel
    Novak, Jiri
    Rogo, Rafael
    Wiedman, Christine
    Zhang, Ray
    JOURNAL OF BUSINESS FINANCE & ACCOUNTING, 2020, 47 (3-4) : 519 - 544
  • [50] Scientific and Technological Knowledge Flow and Technological Innovation: Quantitative Approach Using Patent Citation
    Park, Hyun-Woo
    Suh, Sang-Hyuk
    Lee, Jong-Taik
    2011 PROCEEDINGS OF PICMET 11: TECHNOLOGY MANAGEMENT IN THE ENERGY-SMART WORLD (PICMET), 2011,