Semi-Supervised Approach to Predictive Analysis Using Temporal Data

被引:1
|
作者
Shenk, Kimberly [1 ]
Bertsimas, Dimitris [2 ]
Markuzon, Natasha [3 ]
机构
[1] Hickam AFB, Hickam Field, HI USA
[2] MIT, Cambridge, MA 02139 USA
[3] Draper Lab, Cambridge, MA USA
关键词
Feature vectors - Large volumes - Medical claims - Myocardial Infarction - Predictive power - Semi-supervised - Spatiotemporal characteristics - Supervised and unsupervised learning;
D O I
10.5711/1082598319137
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Predicting a target event from temporal data using supervised learning alone presents a number of challenges. It assumes that members falling into the same class have similar historical characteristics, which is a too strong an assumption. Additionally, it can be difficult for the algorithm to underline the differences from a large volume of data and multitude of temporal projections. In such situations, a combination of supervised and unsupervised learning proved to be superior in performance as compared to supervised learning alone. In the proposed methodology, we develop feature vectors of temporal events that are subsequently split into groups by similarity of spatio-temporal characteristics using a clustering algorithm. We then apply a supervised learning methodology to predict the class within each of these subpopulations. We show a dramatic improvement in predictive power of this joint methodology as compared to supervised learning alone. The case study that we use to demonstrate the methodology utilizes medical claims data to predict a patient's short-term risk of myocardial infarction. In particular, we identify groups of people with temporal diagnostic patterns associated with a high-risk of myocardial infarction in the coming three months. We use these patterns as a profile reference for assessing the state of new patients. We demonstrate that the newly developed combined approach yields improved predictions for myocardial infarction over using classification alone.
引用
收藏
页码:37 / 50
页数:14
相关论文
共 50 条
  • [21] A Semi-supervised Data Augmentation Approach Using 3D Graphical Engines
    Liu, Shuangjun
    Ostadabbas, Sarah
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 395 - 408
  • [22] SSSA: low data sentiment analysis using boosting semi-supervised approach and deep feature learning network
    Rashidi, Shima
    Tanha, Jafar
    Sharifi, Arash
    Hosseinzadeh, Mehdi
    APPLIED INTELLIGENCE, 2025, 55 (04)
  • [23] Semi-supervised discriminant analysis
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 222 - 228
  • [24] Semi-supervised Tuning from Temporal Coherence
    Maltoni, Davide
    Lomonaco, Vincenzo
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2509 - 2514
  • [25] SEMI-SUPERVISED REGRESSION WITH TEMPORAL IMAGE SEQUENCES
    Xie, Ling
    Carreira-Perpinan, Miguel A.
    Newsam, Shawn
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2637 - 2640
  • [26] Semi-supervised Component Analysis
    Watanabe, Kenji
    Wada, Toshikazu
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 3011 - 3016
  • [27] Fraud Detection in Big Data using Supervised and Semi-supervised Learning Techniques
    Melo-Acosta, German E.
    Duitama-Munoz, Freddy
    Arias-Londono, Julian D.
    2017 IEEE COLOMBIAN CONFERENCE ON COMMUNICATIONS AND COMPUTING (COLCOM), 2017,
  • [28] Self-Supervised and Semi-Supervised Polyp Segmentation using Synthetic Data
    Moreu, Enric
    Arazo, Eric
    McGuinness, Kevin
    O'Connor, Noel E.
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [29] Spectral clustering: A semi-supervised approach
    Chen, Weifu
    Feng, Guocan
    NEUROCOMPUTING, 2012, 77 (01) : 229 - 242
  • [30] Semi-supervised classification of iEEG using temporal autoencoder neural network
    Nejedly, P.
    Kremen, V.
    Lepkova, K.
    Mivalt, F.
    Sladky, V.
    Balzekas, I.
    Pridalova, T.
    Klimes, P.
    Plesinger, F.
    Brazdil, M.
    Jurak, P.
    Worrell, G.
    EPILEPSIA, 2022, 63 : 81 - 82