Domain-agnostic discovery of similarities and concepts at scale

被引:1
|
作者
Gornerup, Olof [1 ]
Gillblad, Daniel [1 ]
Vasiloudis, Theodore [1 ]
机构
[1] Swedish Inst Comp Sci SICS, S-16429 Kista, Sweden
关键词
Similarity discovery; Concept mining; Distributional semantics; Graph processing; NETWORKS; DATABASE;
D O I
10.1007/s10115-016-0984-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Appropriately defining and efficiently calculating similarities from large data sets are often essential in data mining, both for gaining understanding of data and generating processes and for building tractable representations. Given a set of objects and their correlations, we here rely on the premise that each object is characterized by its context, i.e., its correlations to the other objects. The similarity between two objects can then be expressed in terms of the similarity between their contexts. In this way, similarity pertains to the general notion that objects are similar if they are exchangeable in the data. We propose a scalable approach for calculating all relevant similarities among objects by relating them in a correlation graph that is transformed to a similarity graph. These graphs can express rich structural properties among objects. Specifically, we show that concepts-abstractions of objects-are constituted by groups of similar objects that can be discovered by clustering the objects in the similarity graph. These principles and methods are applicable in a wide range of fields and will be demonstrated here in three domains: computational linguistics, music, and molecular biology, where the numbers of objects and correlations range from small to very large.
引用
收藏
页码:531 / 560
页数:30
相关论文
共 50 条
  • [31] Unsupervised domain-agnostic identification of product names in social media posts
    Pogrebnyakov, Nicolai
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3711 - 3716
  • [32] Defining Operational Design Domain for Autonomous Systems: A Domain-Agnostic and Risk-Based Approach
    Adedjouma, Morayo
    Botella, Bernard
    Ibanez-Guzman, Javier
    Mantissa, Kevin
    Proum, Chauk-Mean
    Smaoui, Asma
    2024 19TH ANNUAL SYSTEM OF SYSTEMS ENGINEERING CONFERENCE, SOSE 2024, 2024, : 166 - 171
  • [33] Meta-Prototypical Learning for Domain-Agnostic Few-Shot Recognition
    Wang, Rui-Qi
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6990 - 6996
  • [34] ONTOCONNECT: Domain-Agnostic Ontology Alignment using Graph Embedding with Negative Sampling
    Chakraborty, Jaydeep
    Zahera, Hamada M.
    Sherif, Mohamed Ahmed
    Bansal, Srividya K.
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 942 - 945
  • [35] T3: Domain-Agnostic Neural Time-series Narration
    Sharma, Mandar
    Brownstein, John S.
    Ramakrishnan, Naren
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1324 - 1329
  • [36] DOMAIN-AGNOSTIC META-LEARNING FOR CROSS-DOMAIN FEW-SHOT CLASSIFICATION
    Lee, Wei-Yu
    Wang, Jheng-Yu
    Wang, Yu-Chiang Frank
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1715 - 1719
  • [37] Learning deep domain-agnostic features from synthetic renders for industrial visual inspection
    Abubakr, Abdelrahman G.
    Jovancevic, Igor
    Mokhtari, Nour Islam
    Ben Abdallah, Hamdi
    Orteu, Jean-Jose
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
  • [38] Domain-Agnostic Segmentation of Thalamic Nuclei from Joint Structural and Diffusion MRI
    Tregidgo, Henry F. J.
    Soskic, Sonja
    Olchanyi, Mark D.
    Althonayan, Juri
    Billot, Benjamin
    Maffei, Chiara
    Golland, Polina
    Yendiki, Anastasia
    Alexander, Daniel C.
    Bocchetta, Martina
    Rohrer, Jonathan D.
    Iglesias, Juan Eugenio
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VIII, 2023, 14227 : 247 - 257
  • [39] DREAM: Domain-Agnostic Reverse Engineering Attributes of Black-Box Model
    Li, Rongqing
    Yu, Jiaqi
    Li, Changsheng
    Luo, Wenhan
    Yuan, Ye
    Wang, Guoren
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 8009 - 8022
  • [40] Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models
    Arar, Moab
    Gal, Rinon
    Atzmon, Yuval
    Chechik, Gal
    Cohen-Or, Daniel
    Shamir, Ariel
    Bermano, Amit H.
    PROCEEDINGS OF THE SIGGRAPH ASIA 2023 CONFERENCE PAPERS, 2023,