Domain-agnostic discovery of similarities and concepts at scale

被引:1
|
作者
Gornerup, Olof [1 ]
Gillblad, Daniel [1 ]
Vasiloudis, Theodore [1 ]
机构
[1] Swedish Inst Comp Sci SICS, S-16429 Kista, Sweden
关键词
Similarity discovery; Concept mining; Distributional semantics; Graph processing; NETWORKS; DATABASE;
D O I
10.1007/s10115-016-0984-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Appropriately defining and efficiently calculating similarities from large data sets are often essential in data mining, both for gaining understanding of data and generating processes and for building tractable representations. Given a set of objects and their correlations, we here rely on the premise that each object is characterized by its context, i.e., its correlations to the other objects. The similarity between two objects can then be expressed in terms of the similarity between their contexts. In this way, similarity pertains to the general notion that objects are similar if they are exchangeable in the data. We propose a scalable approach for calculating all relevant similarities among objects by relating them in a correlation graph that is transformed to a similarity graph. These graphs can express rich structural properties among objects. Specifically, we show that concepts-abstractions of objects-are constituted by groups of similar objects that can be discovered by clustering the objects in the similarity graph. These principles and methods are applicable in a wide range of fields and will be demonstrated here in three domains: computational linguistics, music, and molecular biology, where the numbers of objects and correlations range from small to very large.
引用
收藏
页码:531 / 560
页数:30
相关论文
共 50 条
  • [21] DOMAIN-AGNOSTIC VIDEO PREDICTION FROM MOTION SELECTIVE KERNELS
    Prinet, Veronique
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4205 - 4209
  • [22] GraphGen: A Scalable Approach to Domain-agnostic Labeled Graph Generation
    Goyal, Nikhil
    Jain, Harsh Vardhan
    Ranu, Sayan
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1253 - 1263
  • [23] Domain-Agnostic Priors for Semantic Segmentation Under Unsupervised Domain Adaptation and Domain Generalization
    Huo, Xinyue
    Xie, Lingxi
    Hu, Hengtong
    Zhou, Wengang
    Li, Houqiang
    Tian, Qi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 3954 - 3976
  • [24] DaCo: domain-agnostic contrastive learning for visual place recognition
    Ren, Hao
    Zheng, Ziqiang
    Wu, Yang
    Lu, Hong
    APPLIED INTELLIGENCE, 2023, 53 (19) : 21827 - 21840
  • [25] CyclePro: A Robust Framework for Domain-Agnostic Gait Cycle Detection
    Ma, Yuchao
    Ashari, Zhila Esna
    Pedram, Mahdi
    Amini, Navid
    Tarquinio, Daniel
    Nouri-Mahdavi, Kouros
    Pourhomayoun, Mohammad
    Catena, Robert D.
    Ghasemzadeh, Hassan
    IEEE SENSORS JOURNAL, 2019, 19 (10) : 3751 - 3762
  • [26] CoPT: Unsupervised Domain Adaptive Segmentation Using Domain-Agnostic Text Embeddings
    Mata, Cristina
    Ranasinghe, Kanchana
    Ryoo, Michael S.
    COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 424 - 440
  • [27] Domain-Agnostic Contrastive Representations for Learning from Label Proportions
    Nandy, Jay
    Saket, Rishi
    Jain, Prateek
    Chauhan, Jatin
    Ravindran, Balaraman
    Raghuveer, Aravindan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1542 - 1551
  • [28] A Domain-Agnostic Approach to Spam-URL Detection via Redirects
    Kwon, Heeyoung
    Baig, Mirza Basim
    Akoglu, Leman
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 220 - 232
  • [29] Towards Domain-Agnostic and Domain-Adaptive Dementia Detection from Spoken Language
    Farzana, Shahla
    Parde, Natalie
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11965 - 11978
  • [30] Few-Shot Link Prediction with Domain-Agnostic Graph Embedding
    Zhu, Hao
    Das, Mahashweta
    Bendre, Mangesh
    Wang, Fei
    Yang, Hao
    Hassoun, Soha
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 659 - 664