A comprehensive investigation of variational auto-encoders for population synthesis

被引:0
|
作者
Sane, Abdoul Razac [1 ]
Vandanjon, Pierre-Olivier [1 ]
Belaroussi, Rachid [2 ]
Hankach, Pierre [3 ]
机构
[1] Univ Gustave Eiffel, AME SPLOTT, All Ponts & Chaussees, F-44340 Bouguenais, France
[2] Univ Gustave Eiffel, COSYS GRETTIA, 5 Bd Descartes, F-77420 Champs Sur Marne, France
[3] Univ Gustave Eiffel, MAST LAMES, All Ponts & Chaussees, F-44340 Bouguenais, France
来源
关键词
Synthetic population; Machine learning; Deep generative model; Variational autoencoders; Sampling zeros; Structural zeros; BAYESIAN NETWORK; IMPACT; AGENT; AREA;
D O I
10.1007/s42001-024-00332-0
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
The use of synthetic populations has grown considerably over the recent years, in revolutionizing studies conducted within various fields, including social science research, urban planning, public health and transportation modeling. These synthetic populations prove to be valuable, as substitutes for the often missing or sensitive real data, and moreover are capable of preserving both privacy and representativeness. They are typically constructed from aggregate and/or sample data. Recently, new methods for generating synthetic populations based on deep learning, notably Variational Autoencoders (VAEs), have been developed. Such methods serve to overcome the limitations of traditional methods, such as Iterative Proportional Fitting (IPF), which are unable to generate agents with cross-modalities not found in the sample data. As such, IPF requires large samples to generate a synthetic population closely resembling the actual one. Conversely, the advantage of VAE lies in their ability to generate agents not found in the sample data, albeit with the risk of creating agents not existing in the actual population. However, the practical documentation as well as detailed analyses of the architectures and results from implementation of these deep learning approaches, in particular VAE, are limited, thus making these methods difficult to appropriate for practitioners. This paper focuses on generating synthetic populations using VAE. First, an in-depth and accessible theoretical explanation of how VAEs function is provided. Next, a detailed study of these methods is carried out by testing the various architectures, parameters, sample sizes and evaluation indicators necessary to guarantee high-quality results. Highlighted herein is the ability of VAEs to generate large datasets with a small training sample, in addition to VAE performance in generating new realistic individuals not present in the learning base. Certain limitations are identified, including the difficulties encountered by VAEs in managing numerical attributes and the need for post-processing to eliminate unrealistic individuals. In conclusion, despite a number of limitations, VAE constitutes a very promising methodology for generating synthetic populations, in offering practitioners numerous advantages. This paper is accompanied by a Python notebook to assist interested readers implement this new methodology.
引用
收藏
页数:34
相关论文
共 50 条
  • [41] Directed Graph Auto-Encoders
    Kollias, Georgios
    Kalantzis, Vasileios
    Ide, Tsuyoshi
    Lozano, Aurelie
    Abe, Naoki
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7211 - 7219
  • [42] Graph Attention Auto-Encoders
    Salehi, Amin
    Davulcu, Hasan
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 989 - 996
  • [43] Conservativeness of Untied Auto-Encoders
    Im, Daniel Jiwoong
    Belghazi, Mohamed Ishmael
    Memisevic, Roland
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1694 - 1700
  • [44] Isometric Quotient Variational Auto-Encoders for Structure-Preserving Representation Learning
    Huh, In
    Jeong, Changwook
    Choe, Jae Myung
    Kim, Young-Gu
    Kim, Dae Sin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [45] Reconstruction probability-based anomaly detection using variational auto-encoders
    Iqbal T.
    Qureshi S.
    International Journal of Computers and Applications, 2023, 45 (03) : 231 - 237
  • [46] Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders
    Sadeghi, Mostafa
    Leglaive, Simon
    Alameda-Pineda, Xavier
    Girin, Laurent
    Horaud, Radu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1788 - 1800
  • [47] On the Quality of Deep Representations for Kepler Light Curves Using Variational Auto-Encoders
    Mena, Francisco
    Olivares, Patricio
    Bugueno, Margarita
    Molina, Gabriel
    Araya, Mauricio
    SIGNALS, 2021, 2 (04): : 706 - 728
  • [48] Preliminary Unknown Appliance Detection using Convolutional Variational Auto-Encoders for AAL
    de Diego-Oton, Laura
    Fuentes, David
    Pizarro, Daniel
    Hernandez, Alvaro
    Mari, Simone
    Nieto, Ruben
    2024 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS, COINS 2024, 2024, : 289 - 292
  • [49] Salience Estimation via Variational Auto-Encoders for Multi-Document Summarization
    Li, Piji
    Wang, Zihao
    Lam, Wai
    Ren, Zhaochun
    Bing, Lidong
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3497 - 3503
  • [50] A robust generative classifier against transfer attacks based on variational auto-encoders
    Zhang, Chen
    Tang, Zhuo
    Zuo, Youfei
    Li, Kenli
    Li, Keqin
    INFORMATION SCIENCES, 2021, 550 : 57 - 70