A comprehensive investigation of variational auto-encoders for population synthesis

被引:0
|
作者
Sane, Abdoul Razac [1 ]
Vandanjon, Pierre-Olivier [1 ]
Belaroussi, Rachid [2 ]
Hankach, Pierre [3 ]
机构
[1] Univ Gustave Eiffel, AME SPLOTT, All Ponts & Chaussees, F-44340 Bouguenais, France
[2] Univ Gustave Eiffel, COSYS GRETTIA, 5 Bd Descartes, F-77420 Champs Sur Marne, France
[3] Univ Gustave Eiffel, MAST LAMES, All Ponts & Chaussees, F-44340 Bouguenais, France
来源
关键词
Synthetic population; Machine learning; Deep generative model; Variational autoencoders; Sampling zeros; Structural zeros; BAYESIAN NETWORK; IMPACT; AGENT; AREA;
D O I
10.1007/s42001-024-00332-0
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
The use of synthetic populations has grown considerably over the recent years, in revolutionizing studies conducted within various fields, including social science research, urban planning, public health and transportation modeling. These synthetic populations prove to be valuable, as substitutes for the often missing or sensitive real data, and moreover are capable of preserving both privacy and representativeness. They are typically constructed from aggregate and/or sample data. Recently, new methods for generating synthetic populations based on deep learning, notably Variational Autoencoders (VAEs), have been developed. Such methods serve to overcome the limitations of traditional methods, such as Iterative Proportional Fitting (IPF), which are unable to generate agents with cross-modalities not found in the sample data. As such, IPF requires large samples to generate a synthetic population closely resembling the actual one. Conversely, the advantage of VAE lies in their ability to generate agents not found in the sample data, albeit with the risk of creating agents not existing in the actual population. However, the practical documentation as well as detailed analyses of the architectures and results from implementation of these deep learning approaches, in particular VAE, are limited, thus making these methods difficult to appropriate for practitioners. This paper focuses on generating synthetic populations using VAE. First, an in-depth and accessible theoretical explanation of how VAEs function is provided. Next, a detailed study of these methods is carried out by testing the various architectures, parameters, sample sizes and evaluation indicators necessary to guarantee high-quality results. Highlighted herein is the ability of VAEs to generate large datasets with a small training sample, in addition to VAE performance in generating new realistic individuals not present in the learning base. Certain limitations are identified, including the difficulties encountered by VAEs in managing numerical attributes and the need for post-processing to eliminate unrealistic individuals. In conclusion, despite a number of limitations, VAE constitutes a very promising methodology for generating synthetic populations, in offering practitioners numerous advantages. This paper is accompanied by a Python notebook to assist interested readers implement this new methodology.
引用
收藏
页数:34
相关论文
共 50 条
  • [21] Description Generation Using Variational Auto-Encoders for Precursor microRNA
    Petkovic, Marko
    Menkovski, Vlado
    ENTROPY, 2024, 26 (11)
  • [22] Ensemble kalman variational objective: a variational inference framework for sequential variational auto-encoders
    Ishizone, Tsuyoshi
    Higuchi, Tomoyuki
    Nakamura, Kazuyuki
    IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2023, 14 (04): : 691 - 717
  • [23] Understanding Instance-based Interpretability of Variational Auto-Encoders
    Kong, Zhifeng
    Chaudhuri, Kamalika
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [24] Generation and Extraction of Color Palettes with Adversarial Variational Auto-Encoders
    Moussa, Ahmad
    Watanabe, Hiroshi
    PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 889 - 897
  • [25] Learning Generative Factors of EEG Data with Variational Auto-Encoders
    Zhdanov, Maksim
    Steinmann, Saskia
    Hoffmann, Nico
    DEEP GENERATIVE MODELS, DGM4MICCAI 2022, 2022, 13609 : 45 - 54
  • [26] Transforming Auto-Encoders
    Hinton, Geoffrey E.
    Krizhevsky, Alex
    Wang, Sida D.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT I, 2011, 6791 : 44 - 51
  • [27] On Disentanglement and Mutual Information in Semi-Supervised Variational Auto-Encoders
    Gordon Rodriguez, Elliott
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1257 - 1262
  • [28] Unsupervised Phonocardiogram Analysis With Distribution Density Based Variational Auto-Encoders
    Li, Shengchen
    Tian, Ke
    FRONTIERS IN MEDICINE, 2021, 8
  • [29] VARIATIONAL AUTO-ENCODERS WITHOUT GRAPH COARSENING FOR FINE MESH LEARNING
    Vercheval, Nicolas
    De Bie, Hendrik
    Pizurica, Aleksandra
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2681 - 2685
  • [30] Micro and Macro Level Graph Modeling for Graph Variational Auto-Encoders
    Zahirnia, Kiarash
    Schulte, Oliver
    Naddaf, Parmis
    Li, Ke
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,