Semi-Supervised Approach to Predictive Analysis Using Temporal Data

被引:1
|
作者
Shenk, Kimberly [1 ]
Bertsimas, Dimitris [2 ]
Markuzon, Natasha [3 ]
机构
[1] Hickam AFB, Hickam Field, HI USA
[2] MIT, Cambridge, MA 02139 USA
[3] Draper Lab, Cambridge, MA USA
关键词
Feature vectors - Large volumes - Medical claims - Myocardial Infarction - Predictive power - Semi-supervised - Spatiotemporal characteristics - Supervised and unsupervised learning;
D O I
10.5711/1082598319137
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Predicting a target event from temporal data using supervised learning alone presents a number of challenges. It assumes that members falling into the same class have similar historical characteristics, which is a too strong an assumption. Additionally, it can be difficult for the algorithm to underline the differences from a large volume of data and multitude of temporal projections. In such situations, a combination of supervised and unsupervised learning proved to be superior in performance as compared to supervised learning alone. In the proposed methodology, we develop feature vectors of temporal events that are subsequently split into groups by similarity of spatio-temporal characteristics using a clustering algorithm. We then apply a supervised learning methodology to predict the class within each of these subpopulations. We show a dramatic improvement in predictive power of this joint methodology as compared to supervised learning alone. The case study that we use to demonstrate the methodology utilizes medical claims data to predict a patient's short-term risk of myocardial infarction. In particular, we identify groups of people with temporal diagnostic patterns associated with a high-risk of myocardial infarction in the coming three months. We use these patterns as a profile reference for assessing the state of new patients. We demonstrate that the newly developed combined approach yields improved predictions for myocardial infarction over using classification alone.
引用
收藏
页码:37 / 50
页数:14
相关论文
共 50 条
  • [11] AN APPROACH TO SEMI-SUPERVISED CLASSIFICATION USING THE HUNGARIAN ALGORITHM
    Albalate, Amparo
    Suchindranath, Aparna
    Minker, Wolfgang
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, : 424 - 433
  • [12] Semi-supervised learning for big social data analysis
    Hussain, Amir
    Cambria, Erik
    NEUROCOMPUTING, 2018, 275 : 1662 - 1673
  • [13] A Semi-Supervised Approach for Temporal Information Extraction from Clinical Text
    Moharasan, Gandhimathi
    Tu Bao Ho
    2016 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES, RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2016, : 7 - 12
  • [14] Semi-Supervised Learning on Data Streams via Temporal Label Propagation
    Wagner, Tal
    Guha, Sudipto
    Kasiviswanathan, Shiva Prasad
    Mishra, Nina
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [15] A new data selection approach for semi-supervised acoustic modeling
    Zhang, Rong
    Rudnicky, Alexander I.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 421 - 424
  • [16] ClusterClean: a Weak Semi-Supervised Approach for Cleaning Data Labels
    Dimitriadou, Kyriaki
    Manghwani, Rahul
    Hoad, Timothy C.
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4589 - 4595
  • [17] A novel semi-supervised classification approach for evolving data streams
    Liao, Guobo
    Zhang, Peng
    Yin, Hongpeng
    Deng, Xuanhong
    Li, Yanxia
    Zhou, Han
    Zhao, Dandan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [18] A semi-supervised approach to projected clustering with applications to microarray data
    Yip, Kevin Y.
    Cheung, Lin
    Cheung, David W.
    Jing, Liping
    Ng, Michael K.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2009, 3 (03) : 229 - 259
  • [19] Semi-supervised oblique predictive clustering trees
    Stepisnik, Tomaz
    Kocev, Dragi
    PEERJ COMPUTER SCIENCE, 2021,
  • [20] Semi-supervised oblique predictive clustering trees
    Stepišnik T.
    Kocev D.
    PeerJ Computer Science, 2021, 7 : 1 - 20