Unravelling the structure of the CSD cocrystal network using a fast near-optimal bipartisation algorithm for large networks

被引:2
|
作者
de Vries, Tom E. [1 ]
Vlieg, Elias [1 ]
de Gelder, Rene [1 ]
机构
[1] Radboud Univ Nijmegen, Inst Mol & Mat, Solid State Chem, Heyendaalseweg 135, NL-6525 AJ Nijmegen, Netherlands
关键词
DATABASE; GRAPH;
D O I
10.1039/d3ce00978e
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Networks, consisting of vertices connected by edges, are an important mathematical concept used to describe relationships between people, roads between cities, reactions between chemicals, and many other interactions. Such a network can be created by extracting cocrystals from the Cambridge Structural Database (CSD). This network describes which compounds can form cocrystals together and can, for example, be used to predict new cocrystals using link-prediction techniques. Bipartiteness is an important property of some networks wherein the vertices can be separated into two groups such that edges only point from one group to the other. Knowing whether a network is bipartite can make studying its structure considerably easier. If a network is nearly bipartite except for a number of outlying edges, one might want to identify and remove those edges, thereby bipartising the network. The CSD cocrystal network was previously found to be close to bipartiteness. Truly bipartising it could improve the accuracy of link-prediction and give insight into the hidden structure of the network. Many algorithms exist for exactly finding the optimal bipartisation for a nearly-bipartite network, but the time it takes to complete such algorithms increases exponentially with the size of the problem. In some cases, an exact solution is unnecessary and a 'good enough' bipartisation is sufficient. We have developed an algorithm that can find a near-optimal bipartisation within reasonable time, even for very large networks, and used it to unravel the structure of the CSD cocrystal network. We obtained a bipartisation that leaves 96% of the network intact, and we were able to identify 'universal' coformers that do not conform to the bipartite nature of the network. By applying a clustering algorithm to the bipartised network, we were also able to identify anticommunities of coformers. Analysing the CSD cocrystal network using a fast near-optimal bipartisation algorithm reveals its hidden structures.
引用
收藏
页码:192 / 202
页数:11
相关论文
共 50 条
  • [1] A fast near-optimal algorithm for approximation of polygonal curves
    Kolesnikov, A
    Fränti, P
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITON, VOL IV, PROCEEDINGS, 2002, : 335 - 338
  • [2] Near-Optimal Algorithm for Group Scheduling in OBS Networks
    Vo Viet Minh Nhat
    Nguyen Hong Quoc
    Nguyen Hoang Son
    ETRI JOURNAL, 2015, 37 (05) : 888 - 897
  • [3] Fast near-optimal subnetwork selection in layered relay networks
    Kolte, Ritesh
    Ozgur, Ayfer
    2014 52ND ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2014, : 1238 - 1245
  • [4] A Randomized Greedy Algorithm for Near-Optimal Sensor Scheduling in Large-Scale Sensor Networks
    Hashemi, Abolfazl
    Ghasemi, Mahsa
    Vikalo, Haris
    Topcu, Ufuk
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 1027 - 1032
  • [5] Near-Optimal Quantization and Linear Network Coding for Relay Networks
    Muralidhar, Anand
    Kumar, P. R.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2013, 59 (05) : 2777 - 2793
  • [6] Near-Optimal Location Tracking Using Sensor Networks
    Sharma, Gokarna
    Krishnan, Hari
    Busch, Costas
    Brandt, Steven R.
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 738 - 747
  • [7] Near-Optimal Weather Routing by Using Improved A* Algorithm
    Shin, Yong Woo
    Abebe, Misganaw
    Noh, Yoojeong
    Lee, Sangbong
    Lee, Inwon
    Kim, Donghyun
    Bae, Jungchul
    Kim, Kyung Chun
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [8] A Near-Optimal Heuristic Algorithm for Advance Lightpath Reservation in WDM Networks
    Chen, Alice
    Lee, Steven S. W.
    2008 34TH EUROPEAN CONFERENCE ON OPTICAL COMMUNICATION (ECOC), 2008,
  • [9] A near-optimal multicast scheme for mobile ad hoc networks using a hybrid genetic algorithm
    Chiang, Tzu-Chiang
    Liu, Chien-Hung
    Huang, Yueh-Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (03) : 734 - 742
  • [10] A near-optimal algorithm for network-constrained averaging with noisy links
    Noorshams, Nima
    Wainwright, Martin J.
    2010 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2010, : 1768 - 1772