Submission information
Submission Number: 88
Submission ID: 125
Submission UUID: d10bf9ba-27ac-487a-9cce-d2c58ef5d419
Submission URI: /form/project
Created: Wed, 02/17/2021 - 18:27
Completed: Wed, 02/17/2021 - 18:43
Changed: Thu, 05/05/2022 - 03:40
Remote IP address: 173.76.15.219
Submitted by: Janelle Hammond
Language: English
Is draft: No
Webform: Project
Uncertainty Quantification for Urban Air Quality Modeling in Clermont-Ferrand, France
Complete
Project Leader
Project Personnel
Project Information
With increased pollutant emissions and exposure due to mass urbanization worldwide, air quality measurement campaigns and epidemiology studies on air pollution and health effects have become increasingly common to estimate individual exposures and evaluate their association to various illnesses. Air quality simulations at urban scale are a key tool for the evaluation of population exposure to particulate matter and gaseous air pollutants. While modeling methods of various sophistication exist, these models depend on varying parameters such as traffic demand and emissions, and meteorological conditions, often unknown at the micro scale. In addition, atmospheric dispersion models generally make assumptions of simplified or unknown physics, and have high computational costs. The simulations are thus expensive and subject to high uncertainties that originate from various sources, especially from unknown or sparsely represented input data. The quantification of the resulting uncertainties is an important topic, particularly in the context of the assimilation of observational data. This task is no simple feat, considering the computational expense of detailed, small-scale numerical models. The objective of our work is to study the uncertainties in a simulation chain, which includes a dynamic traffic assignment model, an emission model and an atmospheric dispersion model at urban scale, using meta-modeling by projection-based dimensional reduction throughout the chain to reduce computational costs.
The domain of study in this work is the metropolitan area of Clermont-Ferrand (France). In this rich case study, we have access to two years of hourly observation data on traffic flow and pollutant concentrations. We have a representation of the road network and urban geometry, data on traffic demand, vehicle fleet, meteorological observations, background pollutant concentrations and background surface emissions, provided by the city of Clermont-Ferrand and the French company NUMTECH, a leader in environmental modeling and data science. An agreement with NUMTECH has been signed to continue this important collaboration.
We began with the meta-modeling of the computationally intensive simulation chain, using reduced basis of the model output spaces, and surrogate approximations of the projection of a state onto the basis. The traffic assignment model has been meta-modeled in [1], and after dimensional reduction of the input space by a reduced basis representing pollutant emissions, the atmospheric dispersion model is replaced by a meta-model for concentration of NO2 at street scale [2]. This meta-modeling technique has the advantage of relying on a reduced basis which represents the predominant behaviors of the model solutions, but with a non-intrusive method of approximating projection coefficients and implementation at much lower computational cost than the full model.
Uncertainty in inputs can be represented by probability density functions (PDFs) on each varying model parameter, throughout the simulation chain, such as traffic demand for the traffic model, composition of the vehicle fleet for emission estimation, and meteorology for the atmospheric dispersion, as well as uncertainty propagated from the traffic and emissions models to the inputs of the dispersion model. We also consider the uncertainty propagation due to the dimensional reduction necessary for computational costs of numerous model runs. We study the propagation of uncertainty in the complete chain, using Markov chain Monte Carlo (MCMC) simulation and probabilistic scores. The simulations are compared to observations, mainly pollutant concentrations at air quality monitoring stations, and traffic observations at loop counters used in the computation of traffic demand inputs. The goal of this uncertainty quantification is to find a representation of the distributions of the model input parameters, which are considered to be of stochastic nature with deterministic approximations in practice, based on this comparison to observation data. MCMC methods are computationally very expensive, precluding application to large-scale operational models for urban air quality. Our meta-model chain, in combination with extensive observation data, allows us to study these otherwise inaccessible methods. We want to take full advantage of the wealth of data available in this case study by using many observations to determine parameter distributions, which nevertheless requires significant computational resources despite the great reduction in approximation time by the meta-models.
References:
[1] Chen, Ruiwei, et al. Metamodeling of a Dynamic Traffic Assignment Model at Metropolitan Scale.
[2] Hammond, J. K., et al. “Meta-Modeling of a Simulation Chain for Urban Air Quality.” Advanced Modeling and Simulation in Engineering Sciences, vol. 7, no. 1, Sept. 2020, p. 37. BioMed Central, doi:10.1186/s40323-020-00173-2.
The domain of study in this work is the metropolitan area of Clermont-Ferrand (France). In this rich case study, we have access to two years of hourly observation data on traffic flow and pollutant concentrations. We have a representation of the road network and urban geometry, data on traffic demand, vehicle fleet, meteorological observations, background pollutant concentrations and background surface emissions, provided by the city of Clermont-Ferrand and the French company NUMTECH, a leader in environmental modeling and data science. An agreement with NUMTECH has been signed to continue this important collaboration.
We began with the meta-modeling of the computationally intensive simulation chain, using reduced basis of the model output spaces, and surrogate approximations of the projection of a state onto the basis. The traffic assignment model has been meta-modeled in [1], and after dimensional reduction of the input space by a reduced basis representing pollutant emissions, the atmospheric dispersion model is replaced by a meta-model for concentration of NO2 at street scale [2]. This meta-modeling technique has the advantage of relying on a reduced basis which represents the predominant behaviors of the model solutions, but with a non-intrusive method of approximating projection coefficients and implementation at much lower computational cost than the full model.
Uncertainty in inputs can be represented by probability density functions (PDFs) on each varying model parameter, throughout the simulation chain, such as traffic demand for the traffic model, composition of the vehicle fleet for emission estimation, and meteorology for the atmospheric dispersion, as well as uncertainty propagated from the traffic and emissions models to the inputs of the dispersion model. We also consider the uncertainty propagation due to the dimensional reduction necessary for computational costs of numerous model runs. We study the propagation of uncertainty in the complete chain, using Markov chain Monte Carlo (MCMC) simulation and probabilistic scores. The simulations are compared to observations, mainly pollutant concentrations at air quality monitoring stations, and traffic observations at loop counters used in the computation of traffic demand inputs. The goal of this uncertainty quantification is to find a representation of the distributions of the model input parameters, which are considered to be of stochastic nature with deterministic approximations in practice, based on this comparison to observation data. MCMC methods are computationally very expensive, precluding application to large-scale operational models for urban air quality. Our meta-model chain, in combination with extensive observation data, allows us to study these otherwise inaccessible methods. We want to take full advantage of the wealth of data available in this case study by using many observations to determine parameter distributions, which nevertheless requires significant computational resources despite the great reduction in approximation time by the meta-models.
References:
[1] Chen, Ruiwei, et al. Metamodeling of a Dynamic Traffic Assignment Model at Metropolitan Scale.
[2] Hammond, J. K., et al. “Meta-Modeling of a Simulation Chain for Urban Air Quality.” Advanced Modeling and Simulation in Engineering Sciences, vol. 7, no. 1, Sept. 2020, p. 37. BioMed Central, doi:10.1186/s40323-020-00173-2.
Project Information Subsection
• Simulating urban air quality over the Clermont-Ferrand agglomeration in France using a pre-constructed chain of meta-models.
• MCMC simulations using the air quality meta-modeling chain with observation data on traffic flow and pollutant concentrations, to estimate the distribution of the input parameters. Various simulations will be necessary to calibrate the appropriate initial parameter distributions and the maximum likelihood estimator.
• Uncertainty quantification method coded in Python which will be transferable to other computational domains given appropriate data and pre-constructed meta-model.
• Maps and graphics representing urban air quality and uncertain distributions of model parameters.
• Results to be presented at scientific conferences.
• MCMC simulations using the air quality meta-modeling chain with observation data on traffic flow and pollutant concentrations, to estimate the distribution of the input parameters. Various simulations will be necessary to calibrate the appropriate initial parameter distributions and the maximum likelihood estimator.
• Uncertainty quantification method coded in Python which will be transferable to other computational domains given appropriate data and pre-constructed meta-model.
• Maps and graphics representing urban air quality and uncertain distributions of model parameters.
• Results to be presented at scientific conferences.
{Empty}
{Empty}
{Empty}
Can work with any level
{Empty}
Stonehill College
{Empty}
NE-MGHPCC
05/10/2021
No
Already behind5Start date is flexible
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
- Milestone Title: File transfer
Milestone Description: We will be able to copy files to and from the necessary location.
Completion Date Goal: 2021-05-12 - Milestone Title: HPC runs
Milestone Description: We will be able to launch computations, including parallel computing, and access any output logs or error messages.
Completion Date Goal: 2021-06-15 - Milestone Title: Initial results
Milestone Description: We will run current MCMC uncertainty quantification code over parameters representing a period of one month, and analyze the results to determine the next steps.
Completion Date Goal: 2021-06-30
{Empty}
{Empty}
The results of this project will lead to an article on Uncertainty Quantification for Urban Air Quality Modeling, which we plan to submit to a leading journal in the field. I expect that a significant portion of the work towards this publication has already been done, but need access to computational resources to continue exploring numerical results. An article has recently been published [cite] on the meta-modeling chain used in this study.
In addition to experience helping researchers transition from individual computing resources to off-site cluster resources, the student facilitator will have the opportunity to learn about the computational requirements and methods used in this project. This includes model order reduction, which remains a topic of interest despite increases in computational capacity, data assimilation for the calibration and improvement of models, and uncertainty quantification by Markov chain Monte Carlo methods. This project also involves international collaboration with the French company NUMTECH, a leader in environmental modeling and data science.
{Empty}
{Empty}
My goal is to perform MCMC simulations for the input parameters of a month-long hourly air quality meta-model simulation to study the uncertainty in these parameters. In practice MCMC simulations are generally done with at least 100,000 iterations, where the nth simulation depends on the previous, precluding the possibility of pseudo-parallel computations by dividing the iterations among multiple workstation machines. Each iteration calls a chain of meta-models, for which the initialization requires approximately 20 minutes and should thus be done only once at the initialization of the MCMC loop. Each iteration of the MCMC chain should compute one month of hourly meta-model simulations, requiring up to approximately 42GB of RAM. The MCMC iterations produce outputs of shape (100000,2520,469), and without parallelization, each iteration requires 2520 meta-model computations totaling 191.5 seconds. Adjustments to the meta-model could reduce the RAM requirements, but would simultaneously reduce the precision of the reduced-order approximations. The MCMC outputs could be written at each iteration (as opposed to holding the entire array in working memory) with correct method compatible with parallel computing.
{Empty}
Final Report
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}
{Empty}