Classification of Keyphrases using Random Forest

被引:0
|
作者
Tovar Vidal, Mireya [1 ]
Flores Petlacalco, Gerardo [1 ]
Montes Rendon, Azucena [2 ]
Contreras Gonzalez, Meliza [1 ]
Cervantes Marquez, Ana Patricia [1 ]
机构
[1] Benemerita Univ Autonoma Puebla, Fac Comp Sci, Puebla, Mexico
[2] Inst Tecnol Tlalpan, TecNM, Mexico City, DF, Mexico
关键词
Keyphrases; Natural Language Processing; Machine Learning; Latent Semantic Analysis; LATENT SEMANTIC ANALYSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyphrases are words or phrases from a document that can describe its meaning. A keyphrase integrates the general idea of a document and implicitly contains the resources that the author used during the development of its research to achieve his goal. Therefore, there is a need to create classification models that allow the clustering of keyphrases according to their content for simplify reading. In this paper, keyphrases classification from scientific publications based on LSA and some classifying techniques is proposed and implemented. The aim is to create a classification model based on the extraction of features from the input corpus, without enriching it using external resources such as Wikipedia or online resources. Process, task, and material are the classes considered from Computer Science, Material Sciences, and Physics publications domains. Results show that Random Forest was found to be the best classification technique of keyphrases with 60% of measure-F-1.
引用
收藏
页码:506 / 511
页数:6
相关论文
共 50 条
  • [1] Classification using Probabilistic Random Forest
    Gondane, Rajhans
    Devi, V. Susheela
    2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 174 - 179
  • [2] Texture Classification Using Random Forest
    Razooq, Mohammed M.
    Nordin, Md Jan
    ADVANCED SCIENCE LETTERS, 2014, 20 (10-12) : 1918 - 1921
  • [3] Material Classification Using Random Forest
    Zhao, Ziming
    Li, Cuihua
    Shi, Hua
    Zou, Quan
    ADVANCED MEASUREMENT AND TEST, PTS 1-3, 2011, 301-303 : 73 - 79
  • [4] Prediction of rockburst classification using Random Forest
    Dong, Long-jun
    Li, Xi-bing
    Peng, Kang
    TRANSACTIONS OF NONFERROUS METALS SOCIETY OF CHINA, 2013, 23 (02) : 472 - 477
  • [5] Feature selection and classification of leukocytes using random forest
    Saraswat, Mukesh
    Arya, K. V.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2014, 52 (12) : 1041 - 1052
  • [6] Pathological Lung Classification Using Random Forest Classifier
    Vijayakumari, B.
    Manikumaran, M.
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL (I2C2), 2017,
  • [7] Automatic fruit classification using random forest algorithm
    Zawbaa, Hossam M.
    Hazman, Maryam
    Abbass, Mona
    Hassanien, Aboul Ella
    2014 14TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2014, : 164 - 168
  • [8] Face Classification Using Gabor Wavelets and Random Forest
    Ghosal, Vidyut
    Tikmani, Paras
    Gupta, Phalguni
    2009 CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION, 2009, : 68 - 73
  • [9] Feature selection and classification of leukocytes using random forest
    Mukesh Saraswat
    K. V. Arya
    Medical & Biological Engineering & Computing, 2014, 52 : 1041 - 1052
  • [10] Methodology for Malware Classification using a Random Forest Classifier
    Domenick Morales-Molina, Carlos
    Santamaria-Guerrero, Diego
    Sanchez-Perez, Gabriel
    Toscano-Medina, Karina
    Perez-Meana, Hector
    Hernandez-Suarez, Aldo
    2018 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2018,