Mining semantic networks of bioinformatics e-resources from the literature

被引:1
|
作者
Afzal H. [2 ,3 ]
Eales J. [1 ]
Stevens R. [1 ]
Nenadic G. [1 ]
机构
[1] University of Manchester, School of Computer Science, Oxford Road, Manchester
[2] National University of Sciences and Technology, College of Telecommunication Engineering, Islamabad
[3] National University of Ireland, Digital Enterprise Research Institute, Galway
基金
爱尔兰科学基金会; 英国生物技术与生命科学研究理事会;
关键词
Semantic Relatedness; Semantic Network; Pairwise Alignment; Resource Discovery; Semantic Descriptor;
D O I
10.1186/2041-1480-2-S1-S4
中图分类号
学科分类号
摘要
Background: There have been a number of recent efforts (e.g. BioCatalogue, BioMoby) to systematically catalogue bioinformatics tools, services and datasets. These efforts rely on manual curation, making it difficult to cope with the huge influx of various electronic resources that have been provided by the bioinformatics community. We present a text mining approach that utilises the literature to automatically extract descriptions and semantically profile bioinformatics resources to make them available for resource discovery and exploration through semantic networks that contain related resources. Results: The method identifies the mentions of resources in the literature and assigns a set of co-occurring terminological entities (descriptors) to represent them. We have processed 2,691 full-text bioinformatics articles and extracted profiles of 12,452 resources containing associated descriptors with binary and tf*idf weights. Since such representations are typically sparse (on average 13.77 features per resource), we used lexical kernel metrics to identify semantically related resources via descriptor smoothing. Resources are then clustered or linked into semantic networks, providing the users (bioinformaticians, curators and service/tool crawlers) with a possibility to explore algorithms, tools, services and datasets based on their relatedness. Manual exploration of links between a set of 18 well-known bioinformatics resources suggests that the method was able to identify and group semantically related entities. Conclusions: The results have shown that the method can reconstruct interesting functional links between resources (e.g. linking data types and algorithms), in particular when tf * idf-like weights are used for profiling. This demonstrates the potential of combining literature mining and simple lexical kernel methods to model relatedness between resource descriptors in particular when there are few features, thus potentially improving the resource description, discovery and exploration process. The resource profiles are available at http://gnode1.mib.man.ac.uk/bioinf/semnets.html © 2011 Afzal et al; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 50 条
  • [1] Mining Semantic Descriptions of Bioinformatics Web Resources from the Literature
    Afzal, Hammad
    Stevens, Robert
    Nenadic, Goran
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, 2009, 5554 : 535 - 549
  • [2] E-resources for technical communication
    Smith, EO
    STC'S 49TH ANNUAL CONFERENCE, PROCEEDINGS: LEADING THE TECHNICAL COMMUNICATION REVOLUTION, 2002, : 111 - 113
  • [3] A New Vision for E-Resources
    McCracken, Peter
    LIBRARY JOURNAL, 2011, 136 (09) : 106 - 106
  • [4] E-resources measurement in libraries
    Fava, Ilaria
    JLIS.IT, 2011, 2 (02):
  • [5] E-Resources in tough times
    Tenopir, C
    LIBRARY JOURNAL, 2004, 129 (10) : 42 - 42
  • [6] From communication to collaboration: blogging to troubleshoot e-resources
    Pan, Denise
    Bradbeer, Gayle
    Jurries, Elaine
    ELECTRONIC LIBRARY, 2011, 29 (03): : 344 - 353
  • [7] MEDIA LITERACY E-RESOURCES
    Price, Gary
    Dar, Mahnaz
    LIBRARY JOURNAL, 2017, 142 (18) : 14 - 15
  • [8] ASHS e-resources for teaching
    Cole, Janet
    HORTSCIENCE, 2006, 41 (04) : 919 - 919
  • [9] E-resources - Digging deeper
    Condon, Kathleen
    VOICES-THE JOURNAL OF NEW YORK FOLKLORE, 2006, 32 (3-4): : 15 - 15
  • [10] Students' keenness on use of e-resources
    Swain, Dillip K.
    ELECTRONIC LIBRARY, 2010, 28 (04): : 580 - 591