MiRTif: a support vector machine-based microRNA target interaction filter

被引:45
|
作者
Yang, Yuchen [2 ]
Wang, Yu-Ping [1 ]
Li, Kuo-Bin [1 ]
机构
[1] Natl Yang Ming Univ, Ctr Syst & Synthet Biol, Taipei 11221, Taiwan
[2] Inst Mol & Cell Biol, Singapore 138673, Singapore
关键词
D O I
10.1186/1471-2105-9-S12-S4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: MicroRNAs (miRNAs) are a set of small non-coding RNAs serving as important negative gene regulators. In animals, miRNAs turn down protein translation by binding to the 3' UTR regions of target genes with imperfect complementary pairing. The identification of microRNA targets has become one of the major challenges of miRNA research. Bioinformatics investigations on miRNA target have resulted in a number of target prediction tools. Although these tools are capable of predicting hundreds of targets for a given miRNA, many of them suffer from high false positive rates, indicating the need for a post-processing filter for the predicted targets. Once trained with experimentally validated true and false targets, machine learning methods appear to be ideal approaches to distinguish the true targets from the false ones. Results: We present a miRNA target filtering system named MiRTif (miRNA: target interaction filter). The system is a support vector machine (SVM) classifier trained with 195 positive and 38 negative miRNA: target interaction pairs, all experimentally validated. Each miRNA: target interaction pair is divided into a seed and a non-seed region. The encoded feature vector contains various k-gram frequencies in the seed, the non-seed and the entire regions. Informative features are selected based on their discriminating abilities. Prediction accuracies are assessed using 10-fold cross-validation experiments. Our system achieves AUC (area under the ROC curve) of 0.86, sensitivity of 83.59%, and specificity of 73.68%. More importantly, the system correctly predicts majority of the false positive miRNA: target interactions (28 out of 38). The possibility of over-fitting due to the relatively small negative sample set has also been investigated using a set of non-validated and randomly selected targets (from miRBase). Conclusion: MiRTif is designed as a post-processing filter that takes miRNA: target interactions predicted by other target prediction softwares such as TargetScanS, PicTar and miRanda as inputs, and determines how likely the given interaction is a real or a pseudo one. MiRTif can be accessed from http://bsal.ym.edu.tw/mirtif.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Support Vector Machine-Based EMG Signal Classification Techniques: A Review
    Toledo-Perez, Diana C.
    Rodriguez-Resendiz, Juvenal
    Gomez-Loenzo, Roberto A.
    Jauregui-Correa, J. C.
    APPLIED SCIENCES-BASEL, 2019, 9 (20):
  • [42] Study on support vector machine-based prediction of steel quenching degree
    Department of Automation, University of Science and Technology of China, Hefei 230027, China
    不详
    Yi Qi Yi Biao Xue Bao, 2006, 11 (1410-1413):
  • [43] Support vector machine-based importance sampling for rare event estimation
    Ling, Chunyan
    Lu, Zhenzhou
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2021, 63 (04) : 1609 - 1631
  • [44] Support Vector Machine-Based Model for Host Overload Detection in Clouds
    Gahlawat, Monica
    Sharma, Priyanka
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ICT FOR SUSTAINABLE DEVELOPMENT, ICT4SD 2015, VOL 1, 2016, 408 : 369 - 376
  • [45] Support vector machine-based expert system for reliable heartbeat recognition
    Osowski, S
    Hoai, LT
    Markiewicz, T
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2004, 51 (04) : 582 - 589
  • [46] A support vector machine-based ensemble algorithm for breast cancer diagnosis
    Wang, Haifeng
    Zheng, Bichen
    Yoon, Sang Won
    Ko, Hoo Sang
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 267 (02) : 687 - 699
  • [47] A support vector machine-based algorithm for mining the knowledge hidden in inconsistencies
    Feng, HH
    Chen, GS
    Liao, MY
    Yang, BR
    Chen, YM
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2193 - 2197
  • [48] A support vector machine-based model for detecting top management fraud
    Pai, Ping-Feng
    Hsu, Ming-Fu
    Wang, Ming-Chieh
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (02) : 314 - 321
  • [49] Support vector machine-based image classification for genetic syndrome diagnosis
    David, A
    Lerner, B
    PATTERN RECOGNITION LETTERS, 2005, 26 (08) : 1029 - 1038
  • [50] TWSVC+: Improved twin support vector machine-based clustering
    Moezzi S.
    Jalali M.
    Forghani Y.
    Ingenierie des Systemes d'Information, 2019, 24 (05): : 463 - 471