Distance metric learning for graph structured data

被引:0
|
作者
Tomoki Yoshida
Ichiro Takeuchi
Masayuki Karasuyama
机构
[1] Nagoya Institute of Technology,
[2] National Institute for Material Science,undefined
[3] RIKEN Center for Advanced Intelligence Project,undefined
[4] Japan Science and Technology Agency,undefined
来源
Machine Learning | 2021年 / 110卷
关键词
Metric learning; Structured data; Graph mining; Convex optimization; Interpretability;
D O I
暂无
中图分类号
学科分类号
摘要
Graphs are versatile tools for representing structured data. As a result, a variety of machine learning methods have been studied for graph data analysis. Although many such learning methods depend on the measurement of differences between input graphs, defining an appropriate distance metric for graphs remains a controversial issue. Hence, we propose a supervised distance metric learning method for the graph classification problem. Our method, named interpretable graph metric learning (IGML), learns discriminative metrics in a subgraph-based feature space, which has a strong graph representation capability. By introducing a sparsity-inducing penalty on the weight of each subgraph, IGML can identify a small number of important subgraphs that can provide insight into the given classification task. Because our formulation has a large number of optimization variables, an efficient algorithm that uses pruning techniques based on safe screening and working set selection methods is also proposed. An important property of IGML is that solution optimality is guaranteed because the problem is formulated as a convex problem and our pruning strategies only discard unnecessary subgraphs. Furthermore, we show that IGML is also applicable to other structured data such as itemset and sequence data, and that it can incorporate vertex-label similarity by using a transportation-based subgraph feature. We empirically evaluate the computational efficiency and classification performance of IGML on several benchmark datasets and provide some illustrative examples of how IGML identifies important subgraphs from a given graph dataset.
引用
收藏
页码:1765 / 1811
页数:46
相关论文
共 50 条
  • [31] Kernel-based distance metric learning for microarray data classification
    Xiong, Huilin
    Chen, Xue-wen
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [32] Tree-Based Metric Learning for Distance Computation in Data Mining
    Yan, Ming
    Zhang, Yan
    Wang, Hongzhi
    WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 377 - 388
  • [33] Kernel-based distance metric learning for microarray data classification
    Huilin Xiong
    Xue-wen Chen
    BMC Bioinformatics, 7
  • [34] LEARNING OF STRUCTURED GRAPH DICTIONARIES
    Zhang, Xuan
    Dong, Xiaowen
    Frossard, Pascal
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 3373 - 3376
  • [35] Reinforcement learning based metric filtering for evolutionary distance metric learning
    Ali, Bassel
    Moriyama, Koichi
    Kalintha, Wasin
    Numao, Masayuki
    Fukui, Ken-Ichi
    INTELLIGENT DATA ANALYSIS, 2020, 24 (06) : 1345 - 1364
  • [36] Learning Distance for Sequences by Learning a Ground Metric
    Su, Bing
    Wu, Ying
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [37] Object Tracking With Structured Metric Learning
    Zhao, Xiaolin
    Xu, Zhuofan
    Zhao, Boxin
    Chen, Xiaolong
    Li, Zongzhe
    IEEE ACCESS, 2019, 7 : 161764 - 161775
  • [38] A graph distance metric based on the maximal common subgraph
    Bunke, H
    Shearer, K
    PATTERN RECOGNITION LETTERS, 1998, 19 (3-4) : 255 - 259
  • [39] The complementary distance matrix, a new molecular graph metric
    Ivanciuc, O
    Ivanciuc, T
    Balaban, AT
    ACH-MODELS IN CHEMISTRY, 2000, 137 (01): : 57 - 82
  • [40] Superpixel Graph Label Transfer with Learned Distance Metric
    Gould, Stephen
    Zhao, Jiecheng
    He, Xuming
    Zhang, Yuhang
    COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 632 - 647