Knowledge Discovery with CRF-Based Clustering of Named Entities without a Priori Classes

被引:0
|
作者
Claveau, Vincent [1 ,2 ]
Ncibi, Abir [1 ,2 ]
机构
[1] IRISA CNRS, Campus Beaulieu, F-35042 Rennes, France
[2] INRIA IRISA, F-35042 Rennes, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge discovery aims at bringing out coherent groups of entities. It is usually based on clustering which necessitates defining a notion of similarity between the relevant entities. In this paper, we propose to divert a supervised machine learning technique (namely Conditional Random Fields, widely used for supervised labeling tasks) in order to calculate, indirectly and without supervision, similarities among text sequences. Our approach consists in generating artificial labeling problems on the data to reveal regularities between entities through their labeling. We describe how this framework can be implemented and experiment it on two information extraction/discovery tasks. The results demonstrate the usefulness of this unsupervised approach, and open many avenues for defining similarities for complex representations of textual data.
引用
收藏
页码:415 / 428
页数:14
相关论文
共 50 条
  • [41] An Ant-based Clustering System for Knowledge Discovery in DNA Chip Analysis Data
    Lee, Minsoo
    Kim, Yun-mi
    Kim, Yearn Jeong
    Lee, Yoon-kyung
    Yoon, Hyejung
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 23, 2007, 23 : 261 - 266
  • [42] A clustering method based on rough sets and its application to knowledge discovery in the medical database
    Hirano, S
    Tsumoto, S
    Okuzaki, T
    Hata, Y
    MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 206 - 210
  • [43] Cluster_KDD: A Visual Clustering and Knowledge Discovery Platform Based on Concept Lattice
    Touzi, Amel Grissa
    Aloui, Amira
    Mahouachi, Rim
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2012, PT II, 2012, 7332 : 127 - 136
  • [44] Computer vision based pose detection of agricultural implements without a priori knowledge of their geometry and visual appearance
    Erz, Michael
    2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2018, : 489 - 494
  • [45] Application of a priori knowledge-enhanced fuzzy clustering to acoustic emission-based damage identification of composite laminates
    Ma, Weijie
    Dong, Fan
    Li, Yazhi
    Li, Biao
    Zhou, Chunping
    APPLIED ACOUSTICS, 2025, 229
  • [46] Image-Based Target Tracking Using Least-Squares Trajectory Estimation without A Priori Knowledge
    Matchen, Timothy D.
    Nadler, Brett R.
    2014 IEEE AEROSPACE CONFERENCE, 2014,
  • [47] Model based optimal multipolar stimulation without a priori knowledge of nerve structure: application to vagus nerve stimulation
    Dali, Melissa
    Rossel, Olivier
    Andreu, David
    Laporte, Laure
    Hernandez, Alfredo
    Laforet, Jeremy
    Marijon, Eloi
    Hagege, Albert
    Clerc, Maureen
    Henry, Christine
    Guiraud, David
    JOURNAL OF NEURAL ENGINEERING, 2018, 15 (04)
  • [48] Deep Learning-Based Near-Field Source Localization Without a Priori Knowledge of the Number of Sources
    Lee, Hojun
    Kim, Yongcheol
    Seol, Seunghwan
    Chung, Jaehak
    IEEE Access, 2022, 10 : 55360 - 55368
  • [49] Deep Learning-Based Near-Field Source Localization Without a Priori Knowledge of the Number of Sources
    Lee, Hojun
    Kim, Yongcheol
    Seol, Seunghwan
    Chung, Jaehak
    IEEE ACCESS, 2022, 10 : 55360 - 55368
  • [50] Knowledge Discovery and Modeling based on Conditional Fuzzy Clustering with Interval Type-2 Fuzzy
    Byeon, Yeong-Hyeon
    Kwak, Keun-Chang
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 440 - 444