Data and Knowledge Organization for Natural Language Processing: Searching and Identifying Better Arrangements of Texts Based on Multimodal Information Architecture

被引:0
|
作者
Kuroki Junior, George Hideyuki [1 ,3 ]
Gottschalg-Duque, Claudio [2 ]
机构
[1] Univ Brasilia, Brasilia, DF, Brazil
[2] Univ Fed Goias, Artificial Intelligence Excellence Ctr, CEIA, Goiania, Brazil
[3] Univ Brasilia, Informat Sci Coll, Campus Darcy Ribeiro,Biblioteca Cent, BR-70910900 Brasilia, DF, Brazil
来源
SAGE OPEN | 2024年 / 14卷 / 01期
关键词
data arrangement; Information Science; Information Architecture; Information Treatment; artificial intelligence; natural language processing;
D O I
10.1177/21582440231177042
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Processing texts of multiple knowledge areas is a hard task. This article presents an Information Science contribution to natural language processing based on artificial neural networks through data arrangement. An extended concept of Information architecture was used, aggregating a multimodal view of organizing data. The Multimodal Information Architecture definition served as foundations for a five-step procedure to design, analyze and transform data used for artificial neural networks training and learning methods, complementing gaps identified by authors focused on Computer Science implementations. The proposal was validated with three datasets formed by texts coming from 16 knowledge areas. Results obtained through the usage of pre-processed data and raw data where compared. In each of the three datasets, the method identified arrangements which led to better and worst results, separating which corpus samples are more susceptible for knowledge extraction.
引用
收藏
页数:22
相关论文
共 14 条
  • [1] Knowledge-based Data Processing for Multilingual Natural Language Analysis
    Jain, Deepak Kumar
    Eyre, Yamila Garcia-Martinez
    Kumar, Akshi
    Gupta, Brij B.
    Kotecha, Ketan
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (05)
  • [2] Exploring Multimodal Data Approach in Natural Language Processing Based on Speech Recognition Algorithms
    Oleh, Basystiuk
    Ihor, Farmaha
    Zoriana, Rybchak
    2023 17TH INTERNATIONAL CONFERENCE ON THE EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS, CADSM, 2023,
  • [3] Comparing general and medical texts for information retrieval based on natural language processing:: An inquiry into lexical disambiguation
    Ruch, P
    Baud, R
    Geissbühler, A
    Rassinoux, AM
    MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 261 - 265
  • [4] Identifying Patient Populations in Texts Describing Drug Approvals Through Deep Learning-Based Information Extraction: Development of a Natural Language Processing Algorithm
    Gendrin, Aline
    Souliotis, Leonidas
    Loudon-Griffiths, James
    Aggarwal, Ravisha
    Amoako, Daniel
    Desouza, Gregory
    Dimitrievska, Sashka
    Metcalfe, Paul
    Louvet, Emilie
    Sahni, Harpreet
    JMIR FORMATIVE RESEARCH, 2023, 7
  • [5] Research on the Technology of Data Mining and Knowledge Discovery based on Natural Language Processing Algorithm
    Sun, JiangYan
    2015 2ND INTERNATIONAL SYMPOSIUM ON ENGINEERING TECHNOLOGY, EDUCATION AND MANAGEMENT (ISETEM 2015), 2015, : 142 - 147
  • [6] INFORMATION-RETRIEVAL AND NATURAL-LANGUAGE - INTEGRATED PROCESSING OF DATA AND TEXTS IN THE CONDOR MODEL - GERMAN - FISCHER,HG
    ENGELBERT, H
    ZENTRALBLATT FUR BIBLIOTHEKSWESEN, 1985, 99 (03): : 127 - 129
  • [7] Information architecture applied on natural language processing: a proposal Information Science contributions on data pre- processing for training and learning of artificial neural networks
    Kuroki Junior, George Hideyuki
    Gottschalg-Duque, Claudio
    RDBCI-REVISTA DIGITAL DE BIBLIOTECONOMIA E CIENCIA DA INFORMACAO, 2023, 21
  • [8] Knowledge Graph Completion Algorithm Based on Probabilistic Fuzzy Information Aggregation and Natural Language Processing Technology
    Zhang, Canlin
    Lu, Kai
    MATHEMATICS, 2022, 10 (23)
  • [9] Information architecture applied on natural language processing: a proposal Information Science contributions on data preprocessing for training and learning of artificial neural networks
    Kuroki Junior, George Hideyuki
    Gottschalg-Duque, Claudio
    RDBCI-REVISTA DIGITAL DE BIBLIOTECONOMIA E CIENCIA DA INFORMACAO, 2023, 21
  • [10] Natural Language Processing to Extract Meaningful Information from a Corpus of Written Knowledge in Breast Cancer: Transforming Books into Data
    Catanuto, Giuseppe
    Rocco, Nicola
    Balafa, Konstantina
    Masannat, Yazan
    Karakatsanis, Andreas
    Maglia, Anna
    Barry, Peter
    Pappalardo, Francesco
    Nava, Maurizio Bruno
    Caruso, Francesco
    BREAST CARE, 2023, 18 (03) : 209 - 212