Submission information
Submission Number: 95
Submission ID: 3441
Submission UUID: beb19e4c-3bb9-4635-acf3-6f805b8243d1
Submission URI: /form/resource
Created: Wed, 03/15/2023 - 13:56
Completed: Wed, 03/15/2023 - 13:58
Changed: Fri, 03/14/2025 - 11:43
Remote IP address: 73.229.137.18
Submitted by: Daniel Howard
Language: English
Is draft: No
Webform: Knowledge Base Resources
Approved: Yes
Title: Using Dask on HPC Systems
Category: Learning
Skill Level:
Beginner (304), Intermediate (305)
Description:
A tutorial on the effective use of Dask on HPC resources. The four-hour
tutorial will be split into two sections, with early topics focused on novice
Dask users and later topics focused on intermediate usage on HPC and
associated best practices. The knowledge areas covered include (but are not
limited to):
Beginner section
High-level collections including dask.array and dask.dataframe
Distributed Dask clusters using HPC job schedulers
Earth Science data analysis using Dask with Xarray
Using the Dask dashboard to understand your computation
Intermediate section
Optimizing the number of workers and memory allocation
Choosing appropriate chunk shapes and sizes for Dask collections
Querying resource usage and debugging errors
Link to Resource:
- Dask Tutorial Github Page (https://github.com/NCAR/dask-tutorial)
- Video Recording of Tutorial - Part 1 (https://youtu.be/wJHosuzqLaU)
- Video Recording of Tutorial - Part 2 (https://youtu.be/E4utSzWgJEo)
Tags:
training (381), jupyterhub (214), python (69)
Domain:
ACCESS CSSN (780), Campus Champions (572), CAREERS (323), CCMNet (835), Great Plains (311), Kentucky (322), Northeast (308)
Would you like to associate this resource with an Affinity Group?: {Empty}