Generic features selection for structure classification of diverse styled scholarly articles

被引:1
|
作者
Waqas, Muhammad [1 ]
Anjum, Nadeem [1 ]
机构
[1] Capital Univ Sci & Technol, Dept Comp Sci, ICT, Expressway,Kahuta Rd,Zone 5, Islamabad, Pakistan
关键词
Features Engineering; Machine Learning; Research Article; Metadata Extraction; Text mining; KNOWLEDGE; SYSTEM;
D O I
10.1007/s11042-023-16128-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The enormous growth in online research publications in diversified domains has attracted the research community to extract these valuable scientific resources by searching online digital libraries and publishers' websites. A precise search is desired to enlist most related articles by applying semantic queries to the document's metadata and the structural elements. The online search engines and digital libraries offer only keyword-based search on full-body text, which creates excessive results. Therefore, the research article's structural and metadata information has to be stored in machine comprehendible form by the online research publishers. The research community in recent years has adopted different approaches to extract structural information from research documents like rule-based heuristics and machine-learning-based approaches. Studies suggest that machine-learning-based techniques have produced optimum results for document structure extraction from publishers having diversified publication layouts. In this paper, we have proposed thirteen different logical layout structural (LLS) components. We have identified a two-staged innovative set of generic features that are associated with the LLS. This approach has given our technique an advantage against the state-of-the-art for structural classification of digital scientific articles with diversified publication styles. We have applied chi-square (chi(2)) for feature selection, and the final result has revealed that SVM (Kernal function) has produced an optimum result with an overall F-measure of 0.95.
引用
收藏
页码:16623 / 16655
页数:33
相关论文
共 50 条
  • [31] Selection of Features for Multimodal Vocalic Segments Classification
    Zaporowski, Szymon
    Czyzewski, Andrzej
    MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, 2019, 833 : 490 - 500
  • [32] Online Feature Selection with Streaming Features for Classification
    You D.-L.
    Guo S.
    Zhao C.-H.
    Yuan F.-Y.
    Shen L.-M.
    Chen Z.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (02): : 321 - 332
  • [33] Features extraction and selection for emotional speech classification
    Xiao, ZZ
    Dellandrea, E
    Dou, WB
    Chen, LM
    AVSS 2005: ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2005, : 411 - 416
  • [34] Features selection and classification to estimate elbow movements
    Rubiano, A.
    Ramirez, J. L.
    El Korso, M. N.
    Jouandeau, N.
    Gallimard, L.
    Polit, O.
    5TH INTERNATIONAL WORKSHOP ON NEW COMPUTATIONAL METHODS FOR INVERSE PROBLEMS (NCMIP2015), 2015, 657
  • [35] Entropic Selection of Histogram Features for Efficient Classification
    Utasi, Akos
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 264 - 272
  • [36] News articles classification using random forests and weighted multimodal features
    Liparas, Dimitris
    HaCohen-Kerner, Yaakov
    Moumtzidou, Anastasia
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8849 : 63 - 75
  • [38] Sensor Faults Detection and Classification using SVM with Diverse Features
    Jan, Sana Ullah
    Koo, In Soo
    2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 576 - 578
  • [39] Structure features for SAT instances classification
    Ansotegui, Carlos
    Luisa Bonet, Maria
    Giraldez-Cru, Jesus
    Levy, Jordi
    JOURNAL OF APPLIED LOGIC, 2017, 23 : 27 - 39
  • [40] Quality of Wikipedia Articles: Analyzing Features and Building a Ground Truth for Supervised Classification
    Bassani, Elias
    Viviani, Marco
    KDIR: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL 1: KDIR, 2019, : 338 - 346