Publication

Hybrid Simulator for Capturing Dynamics of Synthetic Populations

Michel Bierlaire, Marija Kukic
2024
Conference paper
Abstract

This paper presents a novel hybrid framework for generating and updating a synthetic population. We call it hybrid because it combines model-based and data-driven approaches. Existing generators produce a snapshot of synthetic data that becomes outdated over time, requiring complete regeneration using the newest datasets for updates. By leveraging regularly collected data, we propose a method that provides up-to-date synthetic populations at any given moment without using complete re-generation. Our approach generates a baseline synthetic population once, using the Markov Chain Monte Carlo simulation, and projects it over time. In scenarios where disaggregated real data are unavailable, we project the synthetic sample by simulating life-changing events. When new disaggregated real data become available, we calibrate the projected sample using resampling to account for data collection biases and projection errors. We implement and test our approach on 2010, 2015, and 2021 Swiss mobility and transport micro-census data. To generate the baseline sample we use data from 2010 and project it to 2021. We compare the projections of our hybrid approach to existing methods, namely dynamic projection and resampling. The results demonstrate that the synthetic sample generated by the hybrid approach improves the fit to the real data compared to the dynamic projection, and improves heterogeneity compared to the resampling.

About this result
This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.