Metadata Generation for Multi-Text Classification in Structured Data

被引:0
|
作者
Trejo, Karla [1 ]
Garcia, Pere [1 ]
Puyol-Gruart, Josep [1 ]
机构
[1] IIIA CSIC, UAB Campus, E-08193 Bellaterra, Catalonia, Spain
关键词
text analysis; text mining; data formatting; multi-text classification; topology; metadata; structured data;
D O I
10.3233/FAIA190154
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
dIn today's information-saturated world, text analysis has become an indispensable resource to extract useful data from massive amounts of texts. A large portion of this information is unstructured. Hence, it has created a need for methodologies -Named Entity Recognition (NER), Part-of-Speech (PoS) Tagging, N-grams, Term Frequency - Inverse Document Frequency (TF-IDF)- which can read and understand information based on their meaning, context and linguistic cohesion. However, these approaches on their own fall short if applied in already structured data. The idea of generating metadata which can simultaneously provide situational information from structured text data is proposed in this paper. The abstraction of text as a "group of concepts" can boost the relevance of a word in a collection of documents, which allows a more refined separation of classes and a better performance in multi-text classification tasks.
引用
收藏
页码:417 / 421
页数:5
相关论文
共 50 条
  • [1] MULTI-TEXT TECHNIQUES IN TEACHING LITERATURE
    ISAACS, MAL
    ENGLISH JOURNAL, 1979, 68 (02): : 90 - 92
  • [2] Algorithm for Information Hiding in Optional multi-Text
    Shu, Yuanzhong
    Liu, Lei
    Tian, Weina
    Miao, Xiaofeng
    CEIS 2011, 2011, 15
  • [3] INTERPRETING MULTI-TEXT ANALYSIS - IS A THEORY OF ADAPTATION POSSIBLE
    NICHOLS, G
    THEATRE RESEARCH IN CANADA-RECHERCHES THEATRALES AU CANADA, 1992, 13 (1-2): : 152 - 167
  • [4] Multi-text multi-modal reading processes and comprehension
    Cromley, Jennifer G.
    Kunze, Andrea J.
    Dane, Aygul Parpucu
    LEARNING AND INSTRUCTION, 2021, 71
  • [5] A MULTI-LANGUAGE MULTI-TEXT CONCORDANCE AS AN AID IN MANUSCRIPT STUDY
    DAWSON, JL
    COMPUTERS AND THE HUMANITIES, 1980, 14 (01): : 21 - 28
  • [6] Adaptive multi-text union for stable text-to-image synthesis learning
    Zhou, Yan
    Qian, Jiechang
    Zhang, Huaidong
    Xu, Xuemiao
    Sun, Huajie
    Zeng, Fanzhi
    Zhou, Yuexia
    PATTERN RECOGNITION, 2024, 152
  • [7] Data2Text Studio: Automated Text Generation from Structured Data
    Dou, Longxu
    Qin, Guanghui
    Wang, Jinpeng
    Yao, Jin-Ge
    Lin, Chin-Yew
    CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2018, : 13 - 18
  • [8] Papyrus 32 (Titus) as a Multi-text Codex: A New Reconstruction
    Gathergood, Emily
    NEW TESTAMENT STUDIES, 2013, 59 (04) : 588 - 606
  • [9] Multi-text Fusion Computation Based on Flexible Interval Control
    Xu, Lingyu
    Zhang, Na
    Huang, Wentao
    Sun, Shijie
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 1795 - 1800
  • [10] Harnessing the potential of trace data and linguistic analysis to predict learner performance in a multi-text writing task
    Rakovic, Mladen
    Iqbal, Sehrish
    Li, Tongguang
    Fan, Yizhou
    Singh, Shaveen
    Surendrannair, Surya
    Kilgour, Jonathan
    van Der Graaf, Joep
    Lim, Lyn
    Molenaar, Inge
    Bannert, Maria
    Moore, Johanna
    Gasevic, Dragan
    JOURNAL OF COMPUTER ASSISTED LEARNING, 2023, 39 (03) : 703 - 718