A Practical Application of Data Mining Methods to Build Predictive Models for Autism Spectrum Disorder Based on Biosensor Data From Janssen Autism Knowledge Engine (JAKE®)

被引:5
|
作者
Jagannatha, Shyla [1 ]
Sargsyan, Davit [2 ]
Manyakov, Nikolay V. [3 ]
Skalkin, Andrew [2 ]
Bangerter, Abigail [1 ]
Ness, Seth [4 ]
Lewin, David [1 ]
Johnson, Kjell [5 ]
Durham, Kathryn [2 ]
Pandina, Gahan [1 ]
机构
[1] Janssen Res & Dev, 1125 Trenton Harbourton Rd, Titusville, NJ 08560 USA
[2] Janssen Res & Dev, Spring House, PA USA
[3] Janssen Res & Dev, Beerse, Belgium
[4] Janssen Res & Dev, Teaneck, NJ USA
[5] Stat Tenac, Ann Arbor, MI USA
来源
关键词
Diagnostic classifiers; JAKE; Predictive modeling; REGULARIZATION; RECOGNITION; CHILDREN;
D O I
10.1080/19466315.2018.1527247
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Janssen Autism Knowledge Engine (JAKE) collects a large number of features from five biosensors across a range of tasks. The application of data mining methods to these data may be a useful approach to enable objective discrimination between autism spectrum disorder (ASD) and typically developing (TD) participants. Following a prospective observational study using JAKE, ASD participants classified as "moderate" or "severe" based on total scores on the Social Responsiveness Scale, and TD participants were used to build models, using repeated cross-validation, to identify biosensor features contributing to diagnosis. Four different models (partial least squares, random forest, elastic net, and C5.0) were chosen to build diagnostic classifiers using the training set, and the fitted models were evaluated on the test set. Model performance on the training set, based on receiver operating characteristics (ROC), was moderate (area under ROC curve = 0.61-0.72), and model performance on the test set based on kappa statistic was between 0.40 and 0.46 across the four models. Data mining methods applied to biosensor data can lead to models that discriminate ASD from TD. This method may prove useful in creating new diagnostic tests for ASD.
引用
收藏
页码:111 / 117
页数:7
相关论文
共 16 条
  • [1] The Janssen Autism Knowledge Engine (JAKE): Results From a Large, Prospective Observational Biosensor Study in Autism Spectrum Disorder
    Pandina, Gahan
    Nikolay, Manyakov
    Bangerter, Abigail
    Lewin, David
    Jagannatha, Shyla
    Boice, Matthew
    Skalkin, Andrew
    Ness, Seth
    NEUROPSYCHOPHARMACOLOGY, 2017, 42 : S349 - S349
  • [2] An Observational Study With the Janssen Autism Knowledge Engine (JAKE®) in Individuals With Autism Spectrum Disorder
    Ness, Seth L.
    Bangerter, Abigail
    Manyakov, Nikolay V.
    Lewin, David
    Boice, Matthew
    Skalkin, Andrew
    Jagannatha, Shyla
    Chatterjee, Meenakshi
    Dawson, Geraldine
    Goodwin, Matthew S.
    Hendren, Robert
    Leventhal, Bennett
    Shic, Frederick
    Frazier, Jean A.
    Janvier, Yvette
    King, Bryan H.
    Miller, Judith S.
    Smith, Christopher J.
    Tobe, Russell H.
    Pandina, Gahan
    FRONTIERS IN NEUROSCIENCE, 2019, 13
  • [3] BUILDING PREDICTIVE MODELS FOR AUTISM SPECTRUM DISORDER BASED ON BIOSENSOR DATA
    Jagannatha, S.
    Sargsyan, D.
    Manyakov, N. V.
    Skalkin, A.
    Bangerter, A.
    Ness, S.
    Lewin, D.
    Dawson, Geraldine
    Shic, F.
    Goodwin, M. S.
    Hendren, R.
    Leventhal, Bennett L.
    Pandina, G.
    JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 2017, 56 (10): : S213 - S213
  • [4] The Janssen Autism Knowledge Engine (JAKE™): Results From a Clinical Validation Study of a Set of Tools and Technologies to Assess Potential Biomarkers for Autism Spectrum Disorder
    Ness, Seth
    Nikolay, Manyakov
    Bangerter, Abigail
    Lewin, David
    Jagannatha, Shyla
    Boice, Matthew
    Skalkin, Andrew
    Dawson, Geraldine
    Goodwin, Matthew
    Hendren, Robert
    Leventhal, Bennett
    Shic, Frederic
    Cioccia, Walter
    Pandina, Gahan
    NEUROPSYCHOPHARMACOLOGY, 2016, 41 : S475 - S475
  • [5] JAKE® Multimodal Data Capture System: Insights from an Observational Study of Autism Spectrum Disorder
    Ness, Seth L.
    Manyakov, Nikolay V.
    Bangerter, Abigail
    Lewin, David
    Jagannatha, Shyla
    Boice, Matthew
    Skalkin, Andrew
    Dawson, Geraldine
    Janvier, Yvette M.
    Goodwin, Matthew S.
    Hendren, Robert
    Leventhal, Bennett
    Shic, Frederick
    Cioccia, Walter
    Pandina, Gahan
    FRONTIERS IN NEUROSCIENCE, 2017, 11
  • [6] Computational Methods for Predicting Autism Spectrum Disorder from Gene Expression Data
    Zhang, Junpeng
    Thin Nguyen
    Buu Truong
    Liu, Lin
    Li, Jiuyong
    Thuc Duy Le
    ADVANCED DATA MINING AND APPLICATIONS, 2020, 12447 : 395 - 409
  • [7] A Data Mining Based Approach to Predict Autism Spectrum Disorder Considering Behavioral Attributes
    Shuvo, Shaon Bhatta
    Ghosh, Joyoshree
    Oyshi, Atia Sujana
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [8] On the Utility of Parents' Historical Data to Investigate the Causes of Autism Spectrum Disorder: A Data Mining-Based Framework
    Halim, Zahid
    Khan, Gohar
    Shah, Babar
    Naseer, Rabia
    Anwar, Sajid
    Shah, Ahsan
    IRBM, 2023, 44 (04)
  • [9] Comparison of methods for identifying phenotype subgroups using categorical features data with application to autism spectrum disorder
    Gebregziabher, Mulugeta
    Shotwell, Matthew S.
    Charles, Jane M.
    Nicholas, Joyce S.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (01) : 114 - 125
  • [10] A new Autism Spectrum Disorder Discovery (ASDD) strategy using data mining techniques based on blood tests
    Saleh, Ahmed I.
    Rabie, Asmaa H.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81