Submission Number: 109
Submission ID: 201
Submission UUID: b9cb6032-a202-4c2d-a723-9ea1fa4aedeb
Submission URI: /form/project

Created: Mon, 09/06/2021 - 12:58
Completed: Mon, 09/06/2021 - 12:58
Changed: Tue, 01/10/2023 - 13:41

Remote IP address: 67.250.115.253
Submitted by: Winslow Hansen
Language: English

Is draft: No
Webform: Project
Project Title Simulating 21st century boreal forests and fire with a state-of-the-art process-based model
Program CAREERS
Project Image image.png
Tags biology (515), cluster-support (244), dependencies (218), deployment (451), netcdf (370), performance-tuning (17), r (32)
Status Finishing Up
Project Leader Winslow Hansen
Email hansenw@caryinstitute.org
Mobile Phone 406-580-8890
Work Phone 845 677-7600 x138
Mentor(s) Neil McGlohon
Student-facilitator(s) Xuejing Wang
Mentee(s)
Project Description Background
Temperature in the North American boreal forest is rising 2.5 times faster than the global average. Warming has caused boreal wildfires to become more frequent, larger, and severe than at any point in the last 10,000 years, which is eroding the resilience of forests, causing abrupt ecological change. Fires also increasingly threaten people, including Alaska Native and Canadian First-Nations communities. With climate change, the risk of fire and potential for forest degradation will only accelerate in the future. Advanced predictive modeling to evaluate and prioritize conservation and adaptation strategies hold great potential for mitigating the impact of increased fire by reducing risk, cost, and damage. My team develops advanced simulation models of boreal forests and fire. We use these models to identify leverage points that could alter trajectories toward sustainable social, economic, and ecological outcomes.

Projection description
The purpose of this project is to deploy a state-of-the-art process-based simulation model of forests and the necessary dependencies in an HPC environment at Rensselaer Polytechnic Institute. Tasks will include compiling the model on "bare metal", optimizing and benchmarking performance, and developing a workflow for managing 100s of replicate runs. The key to project success will be replicable workflows that are well documented so a non CS expert can repeat the processes.
Project Deliverables 1. step by step documentation of the project so a non CS expert can repeat and adapt the processes
1. simulation model and dependencies deployed in RPI HPC environment
2. Input datasets uploaded
3. workflow established for running 100s to 1000s of replication model runs
4. workflow established for analyzing model output in R statistical environment
Project Deliverables
Student Research Computing Facilitator Profile Undergrad/ grad student with interest in facilitating scientific HPC.
Student must have exceptional written and oral communication skills, especially with scientists lacking CS expertise.
Experience with geospatial analysis in R on an HPC a plus.
Mentee Research Computing Profile
Student Facilitator Programming Skill Level Practical applications
Mentee Programming Skill Level
Project Institution Cary Institute
Project Address
Anchor Institution CR-Rensselaer Polytechnic Institute
Preferred Start Date
Start as soon as possible. Yes
Project Urgency Already behind2Start date is flexible
Expected Project Duration (in months) 3-6 months
Launch Presentation
Launch Presentation Date 06/08/2022
Wrap Presentation
Wrap Presentation Date
Project Milestones
  • Milestone Title: Unpack iLand from the Enroot container and compile directly on AIMOS.
    Milestone Description: The student will compile iLand and its QT library dependency directly on AIMOS. The student will evaluate whether there are methods for optimizing model runs, and will conduct benchmarks to quantify computing performance.
  • Milestone Title: Optimize performance and benchmark gains
    Milestone Description: The student will evaluate whether there are methods for optimizing model runs, and will conduct benchmarks to quantify computing performance with and without optimization.
  • Milestone Title: Develop a workflow for managing 100s of replicate model runs.
    Milestone Description: Once optimized, the student will develop a bash workflow for managing 100s of replicate runs. This will include submissions and labeling and moving output databases into organized folders for later analysis.
Github Contributions
Planned Portal Contributions (if any)
Planned Publications (if any) This project will support an NSF funded project to PI Hansen to simulate boreal forests and fire in western North America. We anticipate 4-6 peer reviewed scientific publications from the project.
What will the student learn? The student will learn how to deploy bespoke software and dependencies, how to manage large datasets that input to simulation models, and how to setup pipelines for the analysis of big data as outputs from the models. Such skillsets will be highly sought after at National Labs and other research centers that run global climate models and Earth System Models.
What will the mentee learn?
What will the Cyberteam program learn from this project?
HPC resources needed to complete this project? HPC resources have already been secured from Rensselaer Polytechnic Institute.
Notes This project submission was encouraged by RPI CCI Director Dr. Chris Carothers.
What is the impact on the development of the principal discipline(s) of the project?
What is the impact on other disciplines?
Is there an impact physical resources that form infrastructure?
Is there an impact on the development of human resources for research computing?
Is there an impact on institutional resources that form infrastructure?
Is there an impact on information resources that form infrastructure?
Is there an impact on technology transfer?
Is there an impact on society beyond science and technology?
Lessons Learned
Overall results