Classification of Keyphrases using Random Forest

被引:0
|
作者
Tovar Vidal, Mireya [1 ]
Flores Petlacalco, Gerardo [1 ]
Montes Rendon, Azucena [2 ]
Contreras Gonzalez, Meliza [1 ]
Cervantes Marquez, Ana Patricia [1 ]
机构
[1] Benemerita Univ Autonoma Puebla, Fac Comp Sci, Puebla, Mexico
[2] Inst Tecnol Tlalpan, TecNM, Mexico City, DF, Mexico
关键词
Keyphrases; Natural Language Processing; Machine Learning; Latent Semantic Analysis; LATENT SEMANTIC ANALYSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyphrases are words or phrases from a document that can describe its meaning. A keyphrase integrates the general idea of a document and implicitly contains the resources that the author used during the development of its research to achieve his goal. Therefore, there is a need to create classification models that allow the clustering of keyphrases according to their content for simplify reading. In this paper, keyphrases classification from scientific publications based on LSA and some classifying techniques is proposed and implemented. The aim is to create a classification model based on the extraction of features from the input corpus, without enriching it using external resources such as Wikipedia or online resources. Process, task, and material are the classes considered from Computer Science, Material Sciences, and Physics publications domains. Results show that Random Forest was found to be the best classification technique of keyphrases with 60% of measure-F-1.
引用
收藏
页码:506 / 511
页数:6
相关论文
共 50 条
  • [21] CLASSIFICATION OF URBAN ENVIRONMENTS USING FEATURE EXTRACTION AND RANDOM FOREST
    dos Anjos, Camila Souza
    Lacerda, Marielcio Goncalves
    Andrade, Leidiane do Livramento
    Salles, Roberto Neves
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 1205 - 1208
  • [22] Adrenal Gland Abnormality Detection Using Random Forest Classification
    Saiprasad, Ganesh
    Chang, Chein-I
    Safdar, Nabile
    Saenz, Naomi
    Siegel, Eliot
    JOURNAL OF DIGITAL IMAGING, 2013, 26 (05) : 891 - 897
  • [23] An Acceleration of a Random Forest Classification using Altera SDK for OpenCL
    Nakahara, Hiroki
    Jinguji, Akira
    Fujii, Tomonori
    Sato, Simpei
    2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 289 - 292
  • [24] Automatic structure classification of small proteins using random forest
    Jain, Pooja
    Hirst, Jonathan D.
    BMC BIOINFORMATICS, 2010, 11
  • [25] Classification of Continuous Sky Brightness Data Using Random Forest
    Priyatikanto, Rhorom
    Mayangsari, Lidia
    Prihandoko, Rudi A.
    Admiranto, Agustinus G.
    ADVANCES IN ASTRONOMY, 2020, 2020
  • [26] Defocus blur radius classification using Random Forest Classifier
    Gajjar, Ruchi
    Zaveri, Tanish
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN ELECTRONICS, SIGNAL PROCESSING AND COMMUNICATION (IESC), 2017, : 219 - 223
  • [27] Malignant Brain Tumor Classification Using the Random Forest Method
    Zhang, Lichi
    Zhang, Han
    Rekik, Islem
    Gao, Yaozong
    Wang, Qian
    Shen, Dinggang
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2018, 2018, 11004 : 14 - 21
  • [28] Automatic structure classification of small proteins using random forest
    School of Chemistry, The University of Nottingham, University Park, Nottingham, NG7 2RD, United Kingdom
    BMC Bioinform.,
  • [29] Secondary triage classification using an ensemble random forest technique
    Azeez, Dhifaf
    Gan, K. B.
    Ali, M. A. Mohd
    Ismail, M. S.
    TECHNOLOGY AND HEALTH CARE, 2015, 23 (04) : 419 - 428
  • [30] Automatic habitat classification using image analysis and random forest
    Torres, Mercedes
    Qiu, Guoping
    ECOLOGICAL INFORMATICS, 2014, 23 : 126 - 136