Knowledge Discovery with CRF-Based Clustering of Named Entities without a Priori Classes

被引:0
|
作者
Claveau, Vincent [1 ,2 ]
Ncibi, Abir [1 ,2 ]
机构
[1] IRISA CNRS, Campus Beaulieu, F-35042 Rennes, France
[2] INRIA IRISA, F-35042 Rennes, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge discovery aims at bringing out coherent groups of entities. It is usually based on clustering which necessitates defining a notion of similarity between the relevant entities. In this paper, we propose to divert a supervised machine learning technique (namely Conditional Random Fields, widely used for supervised labeling tasks) in order to calculate, indirectly and without supervision, similarities among text sequences. Our approach consists in generating artificial labeling problems on the data to reveal regularities between entities through their labeling. We describe how this framework can be implemented and experiment it on two information extraction/discovery tasks. The results demonstrate the usefulness of this unsupervised approach, and open many avenues for defining similarities for complex representations of textual data.
引用
收藏
页码:415 / 428
页数:14
相关论文
共 50 条
  • [31] ICRC-DSEDL: A Film Named Entity Discovery and Linking System Based on Knowledge Bases
    Zhao, YaHui
    Li, Haodi
    Chen, Qingcai
    Hu, Jianglu
    Zhang, Guangpeng
    Huang, Dong
    Tang, Buzhou
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: SEMANTIC, KNOWLEDGE, AND LINKED BIG DATA, 2016, 650 : 205 - 213
  • [32] Improved Density Based Spatial Clustering of Applications of Noise Clustering Algorithm for Knowledge Discovery in Spatial Data
    Sharma, Arvind
    Gupta, R. K.
    Tiwari, Akhilesh
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2016, 2016
  • [33] DKDD_C: A Clustering- Based Approach for Distributed Knowledge Discovery
    Bouraoui, Marwa
    Bezzezi, Houssem
    Touzi, Amel Grissa
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2016, PT II, 2016, 9713 : 187 - 197
  • [34] A clustering based on Self-Organizing Map and knowledge discovery by neural network
    Nakagawa, K
    Kamiura, N
    Hata, Y
    NEW PARADIGM OF KNOWLEDGE ENGINEERING BY SOFT COMPUTING, 2001, 5 : 273 - 296
  • [35] Knowledge Discovery on Functional Disabilities: Clustering Based on Rules versus other Approaches
    Gibert, K.
    Annicchiarico, R.
    Cortes, U.
    Caltagirone, C.
    CONNECTING MEDICAL INFORMATICS AND BIO-INFORMATICS, 2005, 116 : 163 - 168
  • [36] A clustering-based knowledge discovery process for data centre infrastructure management
    Diego García-Saiz
    Marta Zorrilla
    José Luis Bosque
    The Journal of Supercomputing, 2017, 73 : 215 - 226
  • [37] A clustering-based knowledge discovery process for data centre infrastructure management
    Garcia-Saiz, Diego
    Zorrilla, Marta
    Bosque, Jose Luis
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (01): : 215 - 226
  • [38] Clustering and rough set-based knowledge discovery for product family planning
    Zhou, C. J.
    Lin, Z. H.
    E-ENGINEERING & DIGITAL ENTERPRISE TECHNOLOGY, 2008, 10-12 : 45 - 50
  • [39] Explorative Hyperbolic-Tree-Based Clustering Tool for Unsupervised Knowledge Discovery
    Riegler, Michael
    Pogorelov, Konstantin
    Lux, Mathias
    Halvorsen, Pal
    Griwodz, Carsten
    de lange, Thomas
    Eskeland, Sigrun Losada
    2016 14TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2016,
  • [40] An image mining approach for clustering traffic behaviors based on knowledge discovery of image databases
    Fashandi, H
    Eftekhari-Moghadam, AM
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MEASUREMENT SYSTEMS AND APPLICATIONS, 2005, : 203 - 207