Skip to main content

Breadcrumb

  1. ACCESS Home
  2. Support
  3. Knowledge Base
  4. Knowledge Base Resources

Knowledge Base Resources

These resources are contributed by researchers, facilitators, engineers, and HPC admins. Please upvote resources you find useful!
Add a Resource

Filters

Topics

  • machine-learning (38)
  • ai (30)
  • data-analysis (30)
  • training (26)
  • documentation (22)
  • deep-learning (19)
  • big-data (18)
  • neural-networks (16)
  • parallelization (14)
  • visualization (14)
  • workforce-development (14)
  • community-outreach (12)
  • image-processing (12)
  • programming (11)
  • cybersecurity (10)
  • gpu (10)
  • professional-development (10)
  • r (10)
  • pytorch (9)
  • slurm (9)
  • c (7)
  • mpi (7)
  • plotting (7)
  • cuda (6)
  • gis (6)

Topics

  • machine-learning (38)
  • ai (30)
  • data-analysis (30)
  • training (26)
  • documentation (22)
  • deep-learning (19)
  • big-data (18)
  • neural-networks (16)
  • parallelization (14)
  • visualization (14)
  • workforce-development (14)
  • community-outreach (12)
  • image-processing (12)
  • programming (11)
  • cybersecurity (10)
  • gpu (10)
  • professional-development (10)
  • r (10)
  • pytorch (9)
  • slurm (9)
  • c (7)
  • mpi (7)
  • plotting (7)
  • cuda (6)
  • gis (6)

Programming Language

  • python (32)
  • c++ (11)

Programming Language

  • python (32)
  • c++ (11)

Science Domain

  • data-science (12)
  • bioinformatics (6)
  • computational-chemistry (6)

Science Domain

  • data-science (12)
  • bioinformatics (6)
  • computational-chemistry (6)

Skill Level

  • (-) intermediate (191)
  • beginner (189)
  • advanced (111)

Content Type

  • learning (59)
  • website (47)
  • documentation (43)
  • tool (25)
  • video_link (10)
  • presentation (5)
  • mailing_list (1)
  • video (1)

Skill Level

  • (-) intermediate (191)
  • beginner (189)
  • advanced (111)

Content Type

  • learning (59)
  • website (47)
  • documentation (43)
  • tool (25)
  • video_link (10)
  • presentation (5)
  • mailing_list (1)
  • video (1)
The Carpentries
5
  • Carpentries.org
We teach foundational coding and data science skills to researchers worldwide.
administering-hpctraining
5 Likes

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
HPC University
3
  • HPC University Resources
A comprehensive list of training resources from the HPC University. HPCU is a virtual organization whose primary goal is to provide a cohesive, persistent, and sustainable on-line environment to share educational and training materials for a continuum of high performance computing environments that span desktop computing capabilities to the highest-end of computing facilities offered by HPC centers.
debugginghpc-operationsprofessional-developmenttrainingworkforce-developmentcompilingmatlabpythonrmpi
3 Likes

Login to like
Type
learning
Level
Beginner, Intermediate, Advanced
Open OnDemand
2
  • Open Ondemand Home Page
Open OnDemand is an easy-to-use web portal that lets students, researchers, and industry professionals use supercomputers from anywhere. It is installed on supercomputing resources at hundreds of sites. By eliminating the need for client software or command-line interface, Open OnDemand empowers users of all skill levels and significantly speeds up the time to their first computing.
open-ondemandadministering-hpccluster-managementcluster-supporthpc-operationsbatch-jobskubernetes
2 Likes

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
An Introduction to Cryptography with Python
2
  • Workshop Tutorial
This comprehensive workshop is designed to guide participants through the world of cryptography, from foundational concepts to advanced implementations. Starting with the basics of encryption, decryption, and hashing, the workshop discusses real-world applications like SSL, blockchain, and digital signatures. Interactive Python-based coding examples, such as symmetric and asymmetric encryption, will provide hands-on experience. Participants will also learn to identify cryptographic vulnerabilities and perform attacks like length extension. Finally, the workshop also explores future trends such as quantum cryptography and zero-knowledge proofs, providing participants with the knowledge to apply cryptography in securing modern digital systems. Ideal for beginners and intermediate learners alike, this workshop is a step-by-step journey into mastering cryptographic principles and practices.
pythondata-securitycybersecurityencryptionsecure-data-architecture
2 Likes

Login to like
Type
website
Level
Beginner, Intermediate
Useful R Packages for Data Science and Statistics
1
  • https://www.udacity.com/blog/2021/01/best-r-packages-for-data-science.html
This Udacity article listed the most frequently used R packages for data science and statistics. For each package, the article provided the link to its official documentation. It will be a great start point if you want to start your data science journey in R.
plottingvisualizationdata-analysismachine-learningdata-sciencer
1 Like

Login to like
Type
documentation
Level
Beginner, Intermediate, Advanced
ACCESS HPC Workshop Series
1
  • ACESS HPC Workshop Series
  • MPI Workshop
  • OpenMP Workshop
  • GPU Programming Using OpenACC
  • Summer Boot Camp
  • Big Data and Machine Learning
Monthly workshops sponsored by ACCESS on a variety of HPC topics organized by Pittsburgh Supercomputing Center (PSC). Each workshop will be telecast to multiple satellite sites and workshop materials are archived.
deep-learningmachine-learningneural-networksbig-datatensorflowgputrainingopenmpicc++fortranopenmpprogrammingmpispark
1 Like

Login to like
Type
learning
Level
Beginner, Intermediate
HPC Carpentry
1
  • HPC Carpentry
An HPC focused Carpentry community. Trainings include: HPC fundamentals, python, chapel, LAMMPS, parallelization with python, scaling studies, etc.
software-carpentrytraining
1 Like

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
ACCESS Pegasus Documentation
1
  • ACCESS Pegasus Documentation
The documentation provides an overview of using Pegasus, a workflow management system, on ACCESS resources for high throughput computing (HTC) workloads, covering logging in, workflow creation, resource configuration, and monitoring options.
pegasus
1 Like

Login to like
Type
documentation
Level
Beginner, Intermediate, Advanced
Open OnDemand Documentation Repository
1
  • Open OnDemand Documentation repo
This is the main documentation repo for the Open OnDemand Portal which enables researchers to access HPC resources from a familiar web interface.
documentationopen-ondemand
1 Like

Login to like
Type
documentation
Level
Beginner, Intermediate, Advanced
Data Visualization tools for Python
1
  • MatPlotLib Docs
Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python. It makes analyzing and presenting your data extremely easy and works with Python which many people already know.
documentationpython
1 Like

Login to like
Type
documentation
Level
Beginner, Intermediate
GIS: Geocoding Services
1
  • Tidy Geocoder: geocoder services
  • GeoPy: Gecoders
Geocoding is the process of taking a street address and converting it into coordinates that can be plotted on a map. This conversion typically requires an API call to a remote server hosted by an organization/institution. The remote server will take the address attributes provided by you and the remote server will compare it to the data it contains and return a best estimate on the coordinates for that location. There are many geocoding services available with different world coverages, quality of result, and set different rate limits for access. For R, a package called "tidygeocoder" provides an easy way to connect to these different services. As an additional benefit, their documentation provides a good summary of geocoding services available and links to their documentation. The link to the documentation for gecoding services accessible by "tidygeocoder" is provided below. For Python, geopy package is a library that provides connection to various geocoding services. The link to the documentation for this package is also included below.
gis
1 Like

Login to like
Type
documentation
Level
Beginner, Intermediate
NCSA HPC Training Moodle
1
  • NCSA HPC Training Moodle Site
Self-paced tutorials on high-end computing topics such as parallel computing, multi-core performance, and performance tools. Other related topics include 'Cybersecurity for End Users' and 'Developing Webinar Training.' Some of the tutorials also offer digital badges. Many of these tutorials were previously offered on CI-Tutor. A list of open access training courses are provided below. Parallel Computing on High-Performance Systems Profiling Python Applications Using an HPC Cluster for Scientific Applications Debugging Serial and Parallel Codes Introduction to MPI Introduction to OpenMP Introduction to Visualization Introduction to Performance Tools Multilevel Parallel Programming Introduction to Multi-core Performance Using the Lustre File System
performance-tuningprofilingparallelizationlustretrainingworkforce-developmentopenmppythonmpicybersecurity
1 Like

Login to like
Type
learning
Level
Beginner, Intermediate
Attention, Transformers, and LLMs: a hands-on introduction in Pytorch
1
  • Landing Page
  • Preparing data for LLM training
  • Small Language Models: an introduction to autoregressive language modeling
  • Attention is all you need
  • Other LLM Topics
This workshop focuses on developing an understanding of the fundamentals of attention and the transformer architecture so that you can understand how LLMs work and use them in your own projects.
aideep-learningmachine-learningneural-networkspytorch
1 Like

Login to like
Type
learning
Level
Intermediate
Cornell Virtual Workshop
1
  • Roadmaps in Cornell Virtual Workshop
  • Search for topics
Cornell Virtual Workshop is a comprehensive training resource for high performance computing topics. The Cornell University Center for Advanced Computing (CAC) is a leader in the development and deployment of Web-based training programs. Our Cornell Virtual Workshop learning platform is designed to enhance the computational science skills of researchers, accelerate the adoption of new and emerging technologies, and broaden the participation of underrepresented groups in science and engineering. Over 350,000 unique visitors have accessed Cornell Virtual Workshop training on programming languages, parallel computing, code improvement, and data analysis. The platform supports learning communities around the world, with code examples from national systems such as Frontera, Stampede2, and Jetstream2.
jetstreammatlabcloud-computingdata-analysisperformance-tuningparallelizationfile-transferglobusslurmtrainingcudamatlabpythonrmpi
1 Like

Login to like
Type
learning
Level
Beginner, Intermediate, Advanced
DARWIN Documentation Pages
1
  • DARWIN Documentation
DARWIN (Delaware Advanced Research Workforce and Innovation Network) is a big data and high performance computing system designed to catalyze Delaware research and education
darwinbig-data
1 Like

Login to like
Type
documentation
Level
Beginner, Intermediate, Advanced
Using Linux commands in a python script (and the difference between the subprocess and os python modules)
1
  • Using Linux Commands in a Python Script
Learn how to use Linux commands in a python script. Specifically, learn how to use the subprocess and os modules in python to run shell commands (which run Linux commands) in a python script that is run on a cluster.
cluster-managementprogrammingpython
1 Like

Login to like
Type
learning
Level
Beginner, Intermediate
Introduction to Deep Learning in Pytorch
1
  • Landing Page
  • Pytorch Quickstart
  • Pytorch Basics
  • Pytorch GPU Support
  • Regression and Classification with Fully Connected Neural Networks
  • High Dimensional Data
  • Datasets and data loading
  • Building the network
  • Computer Vision and Convolutional Neural Networks
This workshop series introduces the essential concepts in deep learning and walks through the common steps in a deep learning workflow from data loading and preprocessing to training and model evaluation. Throughout the sessions, students participate in writing and executing simple deep learning programs using Pytorch – a popular Python library for developing, training, and deploying deep learning models.
aideep-learningimage-processingmachine-learningneural-networkspytorchgpu
1 Like

Login to like
Type
learning
Level
Beginner, Intermediate
Feed Forward NNs and Gradient Descent
0
  • Feed-Forward and SGD
Feed-forward neural networks are a simple type of network that simply rely on data to be "fed-forward" through a series of layers that makes decisions on how to categorize datum. Gradient descent is a type of optimization tool that is often used to train machines. These two areas in ML are good starting points and are the easiest types of neural network/optimization to understand.
deep-learningmachine-learningneural-networks
0 Likes

Login to like
Type
website
Level
Intermediate
Fine-tuning LLMs with PEFT and LoRA
0
  • Fine-tuning LLMs with PEFT and LoRA
As LLMs get larger fine-tuning to the full extent can become difficult to train on consumer hardware. Storing and deploying these tuned models can also be quite expensive and difficult to store. With PEFT (parameter -efficent fine tuning), it approaches fine-tune on a smaller scale of model parameters while freezing most parameters of the pretrained LLMs. Basically it is providing full performance that which is similar if not better than full fine tuning while only having a small number of trainable parameters. This source explains that as well as going over LORA diagrams and a code walk through.
fasteroptimizationperformance-tuningtuning
0 Likes

Login to like
Type
video_link
Level
Intermediate, Advanced
Benchmarking with a cross-platform open-source flow solver, PyFR
0
  • PyFR documentation
  • PyFR source code from Github
  • Discourse channel for discussions and help
What is PyFR and how does it solve fluid flow problems? PyFR is an open-source Computational Fluid Dynamics (CFD) solver that is based on Python and employs the high-order Flux Reconstruction technique. It effectively solves fluid flow problems by utilizing streaming architectures, making it suitable for complex fluid dynamics simulations. How does PyFR achieve scalability on clusters with CPUs and GPUs? PyFR achieves scalability by leveraging distributed memory parallelism through the Message Passing Interface (MPI). It implements persistent, non-blocking MPI requests using point-to-point (P2P) communication and organizes kernel calls to enable local computations while exchanging ghost states. This design approach allows PyFR to efficiently operate on clusters with heterogeneous architectures, combining CPUs and GPUs. Why is PyFR valuable for benchmarking clusters? PyFR's exceptional performance has been recognized by its selection as a finalist in the ACM Gordon Bell Prize for High-Performance Computing. It demonstrates strong-scaling capabilities by effectively utilizing low-latency inter-GPU communication and achieving strong-scaling on unstructured grids. PyFR has been successfully benchmarked with up to 18,000 NVIDIA K20X GPUs on Titan, showcasing its efficiency in handling large-scale simulations.
finite-element-analysisbenchmarkingparallelizationgithubfluid-dynamicsopenmpic++cudampi
0 Likes

Login to like
Type
tool
Level
Intermediate
Online Bachelor's in Data Science Program Guide - TechGuide
0
  • Online Data Science Degree Programs
The realm of data science is one that onlookers regard with curiosity and respect. There are a lot of unknowns in this area of study that only recently became hugely relevant. It is important to get the facts on how expertise in data science is transforming the world. This article features what a bachelor’s degree means in today’s market and the future.
machine-learningbig-datadata-analysisdata-science
0 Likes

Login to like
Type
website
Level
Intermediate
Using Dask on HPC Systems
0
  • Dask Tutorial Github Page
  • Video Recording of Tutorial - Part 1
  • Video Recording of Tutorial - Part 2
A tutorial on the effective use of Dask on HPC resources. The four-hour tutorial will be split into two sections, with early topics focused on novice Dask users and later topics focused on intermediate usage on HPC and associated best practices. The knowledge areas covered include (but are not limited to): Beginner section High-level collections including dask.array and dask.dataframe Distributed Dask clusters using HPC job schedulers Earth Science data analysis using Dask with Xarray Using the Dask dashboard to understand your computation Intermediate section Optimizing the number of workers and memory allocation Choosing appropriate chunk shapes and sizes for Dask collections Querying resource usage and debugging errors
trainingjupyterhubpython
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
MDAnalysis - Python library for the analysis of molecular dynamics simulations
0
  • MDAnalysis
MDAnalysis is a python based library of tools for the analysis of molecular dynamics simulations. It is able to read and write many popular simulation formats including CHARMM, LAMMPS, GROMACS, and AMBER and more. This link contains the documentation pages of all MDAnalysis functions and has links to tutorials using Jupyter Notebooks.
computational-chemistrymaterials-sciencepython
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate, Advanced
Data Visualization Tools for Julia
0
  • Visualizations in Julia Using Plots.jl
  • Plotting Options Using Julia
Plots.jl is the most widely used plotting library for the Julia programming language. It's known for being especially powerful in its versatility and intuitiveness. It's limited set of dependencies and wide applicability across different graphics packages make it especially helpful in visualizing the results of your latest Julia implementation. However, there are still multiple options available for Julia programmers to visualize their datasets. The second link details a comparison against a variety of Julia packages.
plottingvisualizationjulia
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate

Pagination

  • Current page 1
  • Page 2
  • Page 3
  • Page 4
  • Page 5
  • Page 6
  • Page 7
  • Page 8
  • Next page ››
  • Last page Last »