Classifying Force Spectroscopy of DNA Pulling Measurements Using Supervised and Unsupervised Machine Learning Methods

被引:6
|
作者
Karatay, Durmus U. [1 ]
Zhang, Jie [1 ]
Harrison, Jeffrey S. [1 ]
Ginger, David S. [1 ]
机构
[1] Univ Washington, Dept Chem, Seattle, WA 98195 USA
关键词
RANDOM FOREST; VALIDATION;
D O I
10.1021/acs.jcim.5b00722
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Dynamic force spectroscopy (DFS) measurements on biomolecules typically require classifying thousands of repeated force spectra prior to data analysis. Here, we study classification of atomic force microscope-based DFS measurements using machine-learning algorithms in order to automate selection of successful force curves. Notably, we collect a data set that has a testable positive signal using photoswitch-modified DNA before and after illumination with UV (365 nm) light. We generate a feature set consisting of six properties of force distance curves to train supervised models and use principal component analysis (PCA) for an unsupervised model. For supervised classification, we train random forest models for binary and multiclass classification of force distance curves. Random forest models predict successful pulls with an accuracy of 94% and classify them into five classes with an accuracy of 90%. The unsupervised method using Gaussian mixture models (GMM) reaches an accuracy of approximately 80% for binary classification.
引用
收藏
页码:621 / 629
页数:9
相关论文
共 50 条
  • [31] Signal Parameter Estimation and Classification Using Mixed Supervised and Unsupervised Machine Learning Approaches
    Katyara, Sunny
    Staszewski, Lukasz
    Leonowicz, Zbigniew
    IEEE ACCESS, 2020, 8 : 92754 - 92764
  • [32] Prediction of SGEMM GPU Kernel Performance using Supervised and Unsupervised Machine Learning Techniques
    Agrawal, Sanket
    Bansal, Akshay
    Rathor, Sandeep
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [33] Multi-Regional landslide detection using combined unsupervised and supervised machine learning
    Tehrani, Faraz S.
    Santinelli, Giorgio
    Herrera Herrera, Meylin
    GEOMATICS NATURAL HAZARDS & RISK, 2021, 12 (01) : 1015 - 1038
  • [34] Predicting Early Stage Drug Induced Parkinsonism using Unsupervised and Supervised Machine Learning
    Nair, Parvathy
    Trisno, Roth
    Baghini, Maryam Shojaei
    Pendharkar, Gita
    Chung, Hoam
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 776 - 779
  • [35] Workflow for Evaluating Normalization Tools for Omics Data Using Supervised and Unsupervised Machine Learning
    Chua, Aleesa E.
    Pfeifer, Leah D.
    Sekera, Emily R.
    Hummon, Amanda B.
    Desaire, Heather
    JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2023, 34 (12) : 2775 - 2784
  • [36] Angiographic prognosis and diagnosis of heart disease by using unsupervised and supervised Machine Learning techniques
    Sbirna, Sebastian
    Sbirna, Liana-Simona
    2020 24TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2020, : 84 - 91
  • [37] Railway defect detection based on track geometry using supervised and unsupervised machine learning
    Sresakoolchai, Jessada
    Kaewunruen, Sakdirat
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2022, 21 (04): : 1757 - 1767
  • [38] Comparison of supervised and unsupervised machine learning techniques for UXO classification using EMI data
    Bijamov, Alex
    Shubitidze, Fridon
    Fernandez, Juan Pablo
    Shamatava, Irma
    Barrowes, Benjamin E.
    O'Neill, Kevin
    DETECTION AND SENSING OF MINES, EXPLOSIVE OBJECTS, AND OBSCURED TARGETS XVI, 2011, 8017
  • [39] Supervised Learning Methods in Classifying Organized Behavior in Tweet Collections
    Begenilmis, Erdem
    Uskudarli, Susan
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2019, 28 (06)
  • [40] Using 'Supervised' Attribute Selection for Unsupervised Learning
    Tan, Swee Chuan
    2015 4TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2015, : 198 - 201