Learning from syntax generalizations for automatic semantic annotation

被引:0
|
作者
Guido Boella
Luigi Di Caro
Alice Ruggeri
Livio Robaldo
机构
[1] University of Turin,
关键词
Ontology learning; Automatic annotation; Information extraction;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, there is a huge amount of textual data coming from on-line social communities like Twitter or encyclopedic data provided by Wikipedia and similar platforms. This Big Data Era created novel challenges to be faced in order to make sense of large data storages as well as to efficiently find specific information within them. In a more domain-specific scenario like the management of legal documents, the extraction of semantic knowledge can support domain engineers to find relevant information in more rapid ways, and to provide assistance within the process of constructing application-based legal ontologies. In this work, we face the problem of automatically extracting structured knowledge to improve semantic search and ontology creation on textual databases. To achieve this goal, we propose an approach that first relies on well-known Natural Language Processing techniques like Part-Of-Speech tagging and Syntactic Parsing. Then, we transform these information into generalized features that aim at capturing the surrounding linguistic variability of the target semantic units. These new featured data are finally fed into a Support Vector Machine classifier that computes a model to automate the semantic annotation. We first tested our technique on the problem of automatically extracting semantic entities and involved objects within legal texts. Then, we focus on the identification of hypernym relations and definitional sentences, demonstrating the validity of the approach on different tasks and domains.
引用
收藏
页码:231 / 246
页数:15
相关论文
共 50 条
  • [1] Learning from syntax generalizations for automatic semantic annotation
    Boella, Guido
    Di Caro, Luigi
    Ruggeri, Alice
    Robaldo, Livio
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 43 (02) : 231 - 246
  • [2] An instance learning approach for automatic semantic annotation
    Shu, W
    Enhong, C
    COMPUTATIONAL AND INFORMATION SCIENCE, PROCEEDINGS, 2004, 3314 : 962 - 968
  • [3] Learning Semantic Concepts from Noisy Media Collection for Automatic Image Annotation
    TIAN Feng
    SHEN Xukun
    Chinese Journal of Electronics, 2015, 24 (04) : 790 - 794
  • [4] Learning Semantic Concepts from Noisy Media Collection for Automatic Image Annotation
    Tian Feng
    Shen Xukun
    CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (04) : 790 - 794
  • [5] Semantic disambiguation in Automatic Semantic Annotation
    Qi, Xin
    Xiao, Min
    2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL IV, 2010, : 64 - 67
  • [6] Semantic Disambiguation in Automatic Semantic Annotation
    Qi, Xin
    Xiao, Min
    APPLIED INFORMATICS AND COMMUNICATION, PT 4, 2011, 227 : 135 - 142
  • [7] Learning Objects Automatic Semantic Annotation by Learner Relevance Feedback
    Zhang, Tong-Zhen
    Shen, Rui-Ming
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 2279 - 2282
  • [8] Review of the application of machine learning to the automatic semantic annotation of images
    Olaode, Abass
    Naghdy, Golshah
    IET IMAGE PROCESSING, 2019, 13 (08) : 1232 - 1245
  • [9] Automatic Image Annotation by Sequentially Learning From Multi-Level Semantic Neighborhoods
    Li, Houjie
    Li, Wei
    Zhang, Hongda
    He, Xin
    Zheng, Mingxiao
    Song, Haiyu
    IEEE ACCESS, 2021, 9 : 135742 - 135754
  • [10] Image Semantic Description and Automatic Semantic Annotation
    Liang Meiyu
    Du Junping
    Jia Yingmin
    Sun Zengqi
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 1192 - 1195