Knowledge Discovery with CRF-Based Clustering of Named Entities without a Priori Classes

被引:0
|
作者
Claveau, Vincent [1 ,2 ]
Ncibi, Abir [1 ,2 ]
机构
[1] IRISA CNRS, Campus Beaulieu, F-35042 Rennes, France
[2] INRIA IRISA, F-35042 Rennes, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge discovery aims at bringing out coherent groups of entities. It is usually based on clustering which necessitates defining a notion of similarity between the relevant entities. In this paper, we propose to divert a supervised machine learning technique (namely Conditional Random Fields, widely used for supervised labeling tasks) in order to calculate, indirectly and without supervision, similarities among text sequences. Our approach consists in generating artificial labeling problems on the data to reveal regularities between entities through their labeling. We describe how this framework can be implemented and experiment it on two information extraction/discovery tasks. The results demonstrate the usefulness of this unsupervised approach, and open many avenues for defining similarities for complex representations of textual data.
引用
收藏
页码:415 / 428
页数:14
相关论文
共 50 条
  • [21] Ontology based Spatial Clustering Framework for Implicit Knowledge Discovery
    Bhattacharjee, Shrutilipi
    Dwivedi, Akash
    Prasad, Rendhir R.
    Ghosh, Soumya K.
    2012 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2012, : 561 - 566
  • [22] Decryption of BSS based encrypted speech without a priori knowledge of the key signal
    Farhati, Anissa
    Ben Aicha, Anis
    Bouallegue, Ridha
    2018 4TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2018,
  • [23] Discovering relations between named entities from a large raw corpus using tree similarity-based clustering
    Zhang, M
    Su, J
    Wang, DM
    Zhou, GD
    Tan, CL
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 378 - 389
  • [24] High-Quality Article Classification Based on Named Entities of Knowledge Graph and Multi-head Attention
    Liang, Zhancheng
    He, Zhenkun
    Jia, Peipei
    CCKS 2022 - EVALUATION TRACK, 2022, 1711 : 107 - 119
  • [25] Clustering without prior knowledge based on gene expression programming
    Chen, Yu
    Tang, Changjie
    Zhu, Jun
    Li, Chuan
    Qiao, Shaojie
    Li, Rui
    Wu, Jiang
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2007, : 451 - +
  • [26] Knowledge discovery method for typical process sequence based on clustering analysis
    Inst. of CAPP and Manufacturing. Eng. Software, Northwestern Polytechnical Univ., Xi'an 710072, China
    Jisuanji Jicheng Zhizao Xitong, 2006, 7 (996-1001):
  • [27] Knowledge Discovery in a Wastewater Treatment Plant with Clustering Based on Rules by Statesq
    Gibert, Karina
    Rodriguez Silva, Gustavo
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2007, 163 : 359 - +
  • [28] Two stage fuzzy clustering based on knowledge discovery and its application
    Qian, Ye
    APPLIED ARTIFICIAL INTELLIGENCE, 2006, : 253 - 258
  • [29] Knowledge discovery with clustering based on rules by states: A water treatment application
    Gibert, K.
    Rodriguez-Silva, G.
    Rodriguez-Roda, I.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2010, 25 (06) : 712 - 723
  • [30] Segmentation of 3D Point Clouds using a New Spectral Clustering Algorithm Without a-priori Knowledge
    Kisner, Hannes
    Thomas, Ulrike
    VISAPP: PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL 4: VISAPP, 2018, : 315 - 322