The VINITI RAS Automatic Text Classification System for Processing the Flow of Scientific Publications

被引:1
|
作者
Egorov, V. S. [1 ]
Kozlova, E. S. [1 ]
Lomotin, K. E. [1 ]
Fedorets, O. V. [1 ]
Filimonov, A. V. [1 ]
Shapkin, A. V. [1 ]
机构
[1] Russian Acad Sci, All Russian Inst Sci & Tech Informat VINITI, Moscow 125315, Russia
关键词
automatic text classification; Word2Vec; machine learning; perceptron; logistic regression; natural language processing; production technology of the information center;
D O I
10.3103/S0005105520030048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the results of the development and testing of an automatic classification system for scientific texts that provides the functionality to determine the topic of texts by three classification schemes in batch and dialog modes. The structural and functional components, the methods used to assess the quality of classification, the teaching methodology, the selection of the optimal classification model, and the main areas for the introduction of an automatic classifier in the processing of electronic document flow at the VINITI RAS are described.
引用
收藏
页码:113 / 123
页数:11
相关论文
共 41 条
  • [21] A multiprocessor based automatic hail flow image processing system
    Li, XQ
    Wee, WG
    Niu, AQ
    HYBRID IMAGE AND SIGNAL PROCESSING VI, 1998, 3389 : 124 - 132
  • [22] Automatic Image Processing Filter Generation for Visual Defects Classification System
    Hata, Seiji
    Hayashi, Junichiro
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS, VOLS 1 AND 2, 2009, : 486 - 491
  • [23] AUTOMATIC SYSTEM FOR BLOOD TYPE CLASSIFICATION USING IMAGE PROCESSING TECHNIQUES
    Ferraz, Ana
    Moreira, Vania
    Silva, Diana
    Carvalho, Vitor
    Soares, Filomena O.
    BIODEVICES 2011, 2011, : 368 - 373
  • [24] Automatic Lung-RADS™ classification with a natural language processing system
    Beyer, Sebastian E.
    McKee, Brady J.
    Regis, Shawn M.
    McKee, Andrea B.
    Flacke, Sebastian
    El Saadawi, Gilan
    Wald, Christoph
    JOURNAL OF THORACIC DISEASE, 2017, 9 (09) : 3114 - +
  • [25] The scientometric analysis of publications based on the materials of the peer-reviewed journal The Physics of Nanoobjects and Nanotechnologies of the All-Union Institute for Scientific and Technical Information of the Russian Academy of Sciences (VINITI RAS): Graphene
    Builova N.M.
    Osipov A.I.
    Scientific and Technical Information Processing, 2011, 38 (4) : 285 - 289
  • [26] Natural Language Processing System for Text Classification Corpus Based on Machine Learning
    Su, Yawen
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (08)
  • [27] AUTOMATIC EXTRACTION AND CLASSIFICATION OF PATIENTS' SMOKING STATUS FROM FREE TEXT USING NATURAL LANGUAGE PROCESSING
    Caccamisi, A.
    JorgensenJorgensen, L.
    Dalianis, H.
    Rosenlund, M.
    VALUE IN HEALTH, 2016, 19 (07) : A373 - A373
  • [28] Improving the classification accuracy of automatic text processing systems using context vectors and back-propagation algorithms
    Farkas, J
    1996 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING - CONFERENCE PROCEEDINGS, VOLS I AND II: THEME - GLIMPSE INTO THE 21ST CENTURY, 1996, : 696 - 699
  • [29] Automatic detecting system for traffic flow of bicycles based on pattern recognition and classification
    Wang, GZ
    Liang, B
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 4360 - 4363
  • [30] BIOPHYSICAL SIGNAL PROCESSING FOR AUTOMATIC ANXIETY CLASSIFICATION IN A VIRTUAL REALITY EXPOSURE THERAPY SYSTEM
    Jinga, Nicolae
    Petrescu, Catalin-Dumitru
    Mitrut, Oana
    Moldoveanu, Alin
    Moldoveanu, Florica
    Petrescu, Livia
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2024, 86 (02): : 73 - 84