Classifying Force Spectroscopy of DNA Pulling Measurements Using Supervised and Unsupervised Machine Learning Methods

被引:6
|
作者
Karatay, Durmus U. [1 ]
Zhang, Jie [1 ]
Harrison, Jeffrey S. [1 ]
Ginger, David S. [1 ]
机构
[1] Univ Washington, Dept Chem, Seattle, WA 98195 USA
关键词
RANDOM FOREST; VALIDATION;
D O I
10.1021/acs.jcim.5b00722
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Dynamic force spectroscopy (DFS) measurements on biomolecules typically require classifying thousands of repeated force spectra prior to data analysis. Here, we study classification of atomic force microscope-based DFS measurements using machine-learning algorithms in order to automate selection of successful force curves. Notably, we collect a data set that has a testable positive signal using photoswitch-modified DNA before and after illumination with UV (365 nm) light. We generate a feature set consisting of six properties of force distance curves to train supervised models and use principal component analysis (PCA) for an unsupervised model. For supervised classification, we train random forest models for binary and multiclass classification of force distance curves. Random forest models predict successful pulls with an accuracy of 94% and classify them into five classes with an accuracy of 90%. The unsupervised method using Gaussian mixture models (GMM) reaches an accuracy of approximately 80% for binary classification.
引用
收藏
页码:621 / 629
页数:9
相关论文
共 50 条
  • [41] Unsupervised and supervised machine learning for total scattering and PDF analyses
    Billinge, S.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2022, 78 : E244 - E244
  • [42] Artificial intelligence in dermatology: "unsupervised" versus "supervised" machine learning
    Pathania, Yashdeep Singh
    Budania, Anil
    INTERNATIONAL JOURNAL OF DERMATOLOGY, 2021, 60 (01) : E28 - E29
  • [43] Unsupervised machine learning technique for classifying production zones in unconventional reservoirs
    Abbas K.A.
    Gharavi A.
    Hindi N.A.
    Hassan M.
    Alhosin H.Y.
    Gholinezhad J.
    Ghoochaninejad H.
    Barati H.
    Buick J.
    Yousefi P.
    Alasmar R.
    Al-Saegh S.
    International Journal of Intelligent Networks, 2023, 4 : 29 - 37
  • [44] Supervised and Unsupervised Machine Learning Approaches for Bridge Damage Prediction
    Tamura, S.
    Zhang, B.
    Wang, Y.
    Chen, F.
    Nguyen, K.
    STRUCTURAL HEALTH MONITORING 2013, VOLS 1 AND 2, 2013, : 182 - 189
  • [45] Unsupervised change analysis using supervised learning
    Hido, Shohei
    Ide, Tsuyoshi
    Kashima, Hisashi
    Kubo, Harunobu
    Matsuzawa, Hirofumi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 148 - 159
  • [46] Extractive summarization using supervised and unsupervised learning
    Mao, Xiangke
    Yang, Hui
    Huang, Shaobin
    Liu, Ye
    Li, Rongsheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 133 : 173 - 181
  • [47] Supervised and Unsupervised Learning by Using Petri Nets
    Shen, Victor R. L.
    Chang, Yue-Shan
    Juang, Tony Tong-Ying
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2010, 40 (02): : 363 - 375
  • [48] QUALES: Machine Translation Quality Estimation via Supervised and Unsupervised Machine Learning
    Etchegoyhen, Thierry
    Martinez Garcia, Eva
    Azpeitia, Andoni
    Alegria, Inaki
    Labaka, Gorka
    Otegi, Arantza
    Sarasola, Kepa
    Cortes, Itziar
    Jauregi, Amaia
    Ellakuria, Igor
    Calonge, Eusebi
    Martin, Maite
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2018, (61): : 143 - 146
  • [49] Classifying social media bots as malicious or benign using semi-supervised machine learning
    Mbona, Innocent
    Eloff, Jan H. P.
    JOURNAL OF CYBERSECURITY, 2023, 9 (01):
  • [50] An Analysis of Sindhi Annotated Corpus using Supervised Machine Learning Methods
    Ali, Mazhar
    Wagan, Asim Imdad
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2019, 38 (01) : 185 - 196