Encoding of primary structures of biological macromolecules within a data mining perspective

被引:9
|
作者
Maddouri, M
Elloumi, M
机构
[1] Natl Inst Appl Sci & Technol, Dept Comp Sci, Tunis, Tunisia
[2] Fac Econ Sci & Management Tunis, Dept Comp Sci, Tunis, Tunisia
关键词
encoding methods; biological macromolecules; data mining; strings;
D O I
10.1007/BF02944786
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An encoding method has a direct effect on the quality and the representation of the discovered knowledge in data mining systems. Biological macromolecules are encoded by strings of characters, called primary structures. Knowing that data mining systems usually use relational tables to encode data, we have then to re-encode these strings and transform them into relational tables. In this paper, we do a comparative study of the existing static encoding methods, that are based on the Biologist know-how, and our new dynamic encoding one, that is based on the construction of Discriminant and Minimal Substrings (DMS). Different classification methods are used to do this study. The experimental results show that our dynamic encoding method is more efficient than the static ones, to encode biological macromolecules within a data mining perspective.
引用
收藏
页码:78 / 88
页数:11
相关论文
共 50 条
  • [41] Data mining: An industrial research perspective
    Apte, C
    IEEE COMPUTATIONAL SCIENCE & ENGINEERING, 1997, 4 (02): : 6 - 9
  • [42] Spatiotemporal Data Mining: A Computational Perspective
    Shekhar, Shashi
    Jiang, Zhe
    Ali, Reem Y.
    Eftelioglu, Emre
    Tang, Xun
    Gunturi, Venkata M. V.
    Zhou, Xun
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2015, 4 (04) : 2306 - 2338
  • [43] A Data Mining Perspective of the Newsvendor Problem
    Yu, Xiaodan
    Qi, Zhiquan
    Zhao, Yuanmeng
    FIRST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2013, 17 : 166 - 172
  • [44] Data mining from an AI perspective
    Quinlan, R
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 186 - 186
  • [45] Cortical encoding of rhythmic kinematic structures in biological motion
    Shen, Li
    Lu, Xiqian
    Yuan, Xiangyong
    Hu, Ruichen
    Wang, Ying
    Jiang, Yi
    NEUROIMAGE, 2023, 268
  • [46] THE USE OF GRAPH-THEORETICAL METHODS FOR THE COMPARISON OF THE STRUCTURES OF BIOLOGICAL MACROMOLECULES
    ARTYMIUK, PJ
    POIRRETTE, AR
    RICE, DW
    WILLETT, P
    MOLECULAR SIMILARITY II, 1995, 174 : 73 - 103
  • [47] Spatial ordering and encoding for geographic data mining and visualization
    Guo, Diansheng
    Gahegan, Mark
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2006, 27 (03) : 243 - 266
  • [48] Spatial ordering and encoding for geographic data mining and visualization
    Diansheng Guo
    Mark Gahegan
    Journal of Intelligent Information Systems, 2006, 27 : 243 - 266
  • [49] Fast HEVC Encoding Decisions Using Data Mining
    Correa, Guilherme
    Assuncao, Pedro A.
    Agostini, Luciano Volcan
    da Silva Cruz, Luis A.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (04) : 660 - 673
  • [50] The 50th Anniversary of the Founding of the Biological Macromolecules Structures Database
    Sinko, G.
    KEMIJA U INDUSTRIJI-JOURNAL OF CHEMISTS AND CHEMICAL ENGINEERS, 2023, 72 (1-2): : 95 - 101