Domain-agnostic discovery of similarities and concepts at scale

被引:1
|
作者
Gornerup, Olof [1 ]
Gillblad, Daniel [1 ]
Vasiloudis, Theodore [1 ]
机构
[1] Swedish Inst Comp Sci SICS, S-16429 Kista, Sweden
关键词
Similarity discovery; Concept mining; Distributional semantics; Graph processing; NETWORKS; DATABASE;
D O I
10.1007/s10115-016-0984-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Appropriately defining and efficiently calculating similarities from large data sets are often essential in data mining, both for gaining understanding of data and generating processes and for building tractable representations. Given a set of objects and their correlations, we here rely on the premise that each object is characterized by its context, i.e., its correlations to the other objects. The similarity between two objects can then be expressed in terms of the similarity between their contexts. In this way, similarity pertains to the general notion that objects are similar if they are exchangeable in the data. We propose a scalable approach for calculating all relevant similarities among objects by relating them in a correlation graph that is transformed to a similarity graph. These graphs can express rich structural properties among objects. Specifically, we show that concepts-abstractions of objects-are constituted by groups of similar objects that can be discovered by clustering the objects in the similarity graph. These principles and methods are applicable in a wide range of fields and will be demonstrated here in three domains: computational linguistics, music, and molecular biology, where the numbers of objects and correlations range from small to very large.
引用
收藏
页码:531 / 560
页数:30
相关论文
共 50 条
  • [41] Domain-Agnostic Context-Aware Assistant Framework for Task-Based Environment
    Tiwari, Sarthak
    Bansal, Ajay
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 86 - 87
  • [42] Continual-GEN: Continual Group Ensembling for Domain-agnostic Skin Lesion Classification
    Bayasi, Nourhan
    Du, Siyi
    Hamarneh, Ghassan
    Garbi, Rafeef
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023 WORKSHOPS, 2023, 14393 : 3 - 13
  • [43] Suppressing Spoof-Irrelevant Factors for Domain-Agnostic Face Anti-Spoofing
    Kim, Taewook
    Kim, Yonghyun
    IEEE ACCESS, 2021, 9 : 86966 - 86974
  • [44] Towards Efficient and Domain-Agnostic Evasion Attack with High-Dimensional Categorical Inputs
    Bao, Hongyan
    Han, Yufei
    Zhou, Yujun
    Gao, Xin
    Zhang, Xiangliang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 6753 - 6761
  • [45] Mining Similarities and Concepts at Scale
    Gornerup, Olof
    Vasiloudis, Theodore
    ERCIM NEWS, 2016, (107): : 26 - 27
  • [46] Low-Resource Dialogue Summarization with Domain-Agnostic Multi-Source Pretraining
    Zou, Yicheng
    Zhu, Bolin
    Hu, Xingwu
    Gui, Tao
    Zhang, Qi
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 80 - 91
  • [47] Interpretable domain-informed and domain-agnostic features for supervised and unsupervised learning on building energy demand data
    Canaydin, Ada
    Fu, Chun
    Balint, Attila
    Khalil, Mohamad
    Miller, Clayton
    Kazmi, Hussain
    APPLIED ENERGY, 2024, 360
  • [48] Powering Finetuning in Few-Shot Learning: Domain-Agnostic Bias Reduction with Selected Sampling
    Tao, Ran
    Zhang, Han
    Zheng, Yutong
    Savvides, Marios
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8467 - 8475
  • [49] SAZED: parameter-free domain-agnostic season length estimation in time series data
    Toller, Maximilian
    Santos, Tiago
    Kern, Roman
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (06) : 1775 - 1798
  • [50] A Needle in a Haystack: Distinguishable Deep Neural Network Features for Domain-Agnostic Device Fingerprinting
    Elmaghbub, Abdurrahman
    Hamdaoui, Bechir
    2023 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY, CNS, 2023,