Encoding of primary structures of biological macromolecules within a data mining perspective

被引:9
|
作者
Maddouri, M
Elloumi, M
机构
[1] Natl Inst Appl Sci & Technol, Dept Comp Sci, Tunis, Tunisia
[2] Fac Econ Sci & Management Tunis, Dept Comp Sci, Tunis, Tunisia
关键词
encoding methods; biological macromolecules; data mining; strings;
D O I
10.1007/BF02944786
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An encoding method has a direct effect on the quality and the representation of the discovered knowledge in data mining systems. Biological macromolecules are encoded by strings of characters, called primary structures. Knowing that data mining systems usually use relational tables to encode data, we have then to re-encode these strings and transform them into relational tables. In this paper, we do a comparative study of the existing static encoding methods, that are based on the Biologist know-how, and our new dynamic encoding one, that is based on the construction of Discriminant and Minimal Substrings (DMS). Different classification methods are used to do this study. The experimental results show that our dynamic encoding method is more efficient than the static ones, to encode biological macromolecules within a data mining perspective.
引用
收藏
页码:78 / 88
页数:11
相关论文
共 50 条
  • [21] Biomagresbank: A repository of NMR spectroscopic data on biological macromolecules
    Ulrich, EL
    Argentar, DR
    Manabat, NC
    Ioannidis, YE
    Livny, M
    Markley, JL
    FASEB JOURNAL, 1997, 11 (09): : A1122 - A1122
  • [22] A perspective of marine mining within de Beers
    De Beers Marine Ltd., Cape Town, South Africa
    J S Afr Inst Min Metall, 2007, 6 (393-402):
  • [23] A perspective of marine mining within De Beers
    Richardson, K.
    JOURNAL OF THE SOUTH AFRICAN INSTITUTE OF MINING AND METALLURGY, 2007, 107 (06): : 393 - 402
  • [25] Spatial Data Mining: A Perspective of Big Data
    Wang, Shuliang
    Yuan, Hanning
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2014, 10 (04) : 50 - 70
  • [26] The seasons within: a theoretical perspective on photoperiodic entrainment and encoding
    Schmal, Christoph
    JOURNAL OF COMPARATIVE PHYSIOLOGY A-NEUROETHOLOGY SENSORY NEURAL AND BEHAVIORAL PHYSIOLOGY, 2024, 210 (04): : 549 - 564
  • [27] Statistical encoding of succinct data structures
    Gonzalez, Rodrigo
    Navarro, Gonzalo
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2006, 4009 : 294 - 305
  • [28] ENCODING DATA-STRUCTURES IN TREES
    ROSENBERG, AL
    JOURNAL OF THE ACM, 1979, 26 (04) : 668 - 689
  • [29] Comparison of four indirect (data mining) approaches to derive within-subject biological variation
    Tan, Rui Zhen
    Markus, Corey
    Vasikaran, Samuel
    Loh, Tze Ping
    CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2022, 60 (04) : 636 - 644
  • [30] Data mining within DBMS functionality
    Zakrzewicz, M
    DATABASES AND INFORMATION SYSTEMS, 2001, : 85 - 96