Skip to main content

Breadcrumb

  1. ACCESS Home
  2. Support
  3. Knowledge Base
  4. Knowledge Base Resources

Knowledge Base Resources

These resources are contributed by researchers, facilitators, engineers, and HPC admins. Please upvote resources you find useful!
Add a Resource

Topics

  • machine-learning (50)
  • ai (45)
  • training (41)
  • data-analysis (40)
  • deep-learning (28)
  • documentation (28)
  • big-data (26)
  • neural-networks (24)
  • workforce-development (21)
  • professional-development (18)
  • visualization (18)
  • parallelization (16)
  • community-outreach (14)
  • programming (14)
  • image-processing (13)
  • cybersecurity (12)
  • gpu (12)
  • r (12)
  • pytorch (11)
  • slurm (10)
  • c (9)
  • cloud-computing (9)
  • compiling (9)
  • mpi (9)
  • plotting (9)
  • administering-hpc (8)

Topics

  • machine-learning (50)
  • ai (45)
  • training (41)
  • data-analysis (40)
  • deep-learning (28)
  • documentation (28)
  • big-data (26)
  • neural-networks (24)
  • workforce-development (21)
  • professional-development (18)
  • visualization (18)
  • parallelization (16)
  • community-outreach (14)
  • programming (14)
  • image-processing (13)
  • cybersecurity (12)
  • gpu (12)
  • r (12)
  • pytorch (11)
  • slurm (10)
  • c (9)
  • cloud-computing (9)
  • compiling (9)
  • mpi (9)
  • plotting (9)
  • administering-hpc (8)

If you'd like to use more filters, please login to view them all.

RMACC Systems Administrator Workshop Slides
0
  • RMACC Sys Admin Workshop '24 Slides
A compilation of the slides from this year's RMACC Sys Admin Workshop. RMACC Sys Admin Workhop Schedule: Tuesday 12:00 PM Sign-in 1:00 PM Introductions 1:30 PM Lightning Talk - HPC Survival guide 2:00 PM Node Management - Scott Serr 2:30 PM Lightning Talk - Warewulf 3:00 PM Urgent HPC - Coltran Hophan-Nichols and Alexander Salois Wednesday 9:00 AM Breakfast 10:00 AM Round table Sites - BYU, INL, UMT, ASU, MSU 11:00 AM Open OnDemand setup - Dean Anderson 11:30 AM Lightning talk - Long term hardware support 12:00 PM Lunch 1:00 PM HPC Security - Matt Bidwell 2:00 PM Lightning talk- Security 2:30 PM ACCESS resources - Couso 3:00 PM Easybuild tutorial - Alexander Salois 3:30 PM General Q & A Thursday 9:00 AM Breakfast 10:00 AM Lightning Talk- Containers and Virtual Machines 11:00 AM University of Montana - Hellgate Site Tour 11:30 AM Closing Remarks
administering-hpchpc-toolscluster-supporthardwarehpc-cluster-architecturehpc-operationshpc-storagenetworkingserverless-hpcprofessional-development
0 Likes

Login to like
Type
presentation
Level
Intermediate, Advanced
Numba: Compiler for Python
0
  • Numba Compiler
Numba is a Python compiler designed for accelerating numerical and array operations, enabling users to enhance their application's performance by writing high-performance functions in Python itself. It utilizes LLVM to transform pure Python code into optimized machine code, achieving speeds comparable to languages like C, C++, and Fortran. Noteworthy features include dynamic code generation during import or runtime, support for both CPU and GPU hardware, and seamless integration with the Python scientific software ecosystem, particularly Numpy.
vectorizationoptimizationperformance-tuningparallelization
0 Likes

Login to like
Type
documentation
Level
Intermediate, Advanced
GPU Acceleration in Python
0
  • GPU Acceleration in Python
This tutorial explains how to use Python for GPU acceleration with libraries like CuPy, PyOpenCL, and PyCUDA. It shows how these libraries can speed up tasks like array operations and matrix multiplication by using the GPU. Examples include replacing NumPy with CuPy for large datasets and using PyOpenCL or PyCUDA for more control with custom GPU kernels. It focuses on practical steps to integrate GPU acceleration into Python programs.
machine-learningbig-datadata-analysisoptimizationparallelizationgpucudapython
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
RMACC Website
0
  • RMACC.org
Rocky Mountain Advanced Computing Consortium Website
community-outreach
0 Likes

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
AWS Tutorial For Beginners
0
  • AWS Tutorial For Beginners
An AWS Tutorial for Beginners is a course that teaches the basics of Amazon Web Services (AWS), a cloud computing platform that offers a wide range of services, including compute, storage, networking, databases, analytics, machine learning, and artificial intelligence.
aws
0 Likes

Login to like
Type
video_link
Level
Beginner, Intermediate
GDAL Multi-threading
0
  • GDAL Multi-threading
Multi-threading guidance when using GDAL.
parallelizationgis
0 Likes

Login to like
Type
learning
Level
Intermediate
Introductory Python Lecture Series
0
  • Python Handbook Series
A lecture and notes with the goal of teaching introductory python. Starting by understanding how to download and start using python, then expanding to basic syntax for lists, arrays, loops, and methods.
documentationprogrammingpython
0 Likes

Login to like
Type
learning
Level
Beginner
Official Documentation for PyTorch and NumPy
0
  • Official PyTorch Documentation
  • Official NumPy Documentation
The official documentation for PyTorch, a machine learning tensor-based framework, and NumPy, which allows for support for ndarrays which is useful to make tensors when implementing NNs. Both libraries can be installed with pip.
deep-learningneural-networkspytorchpython
0 Likes

Login to like
Type
documentation
Level
Beginner
Using Dask on HPC Systems
0
  • Dask Tutorial Github Page
  • Video Recording of Tutorial - Part 1
  • Video Recording of Tutorial - Part 2
A tutorial on the effective use of Dask on HPC resources. The four-hour tutorial will be split into two sections, with early topics focused on novice Dask users and later topics focused on intermediate usage on HPC and associated best practices. The knowledge areas covered include (but are not limited to): Beginner section High-level collections including dask.array and dask.dataframe Distributed Dask clusters using HPC job schedulers Earth Science data analysis using Dask with Xarray Using the Dask dashboard to understand your computation Intermediate section Optimizing the number of workers and memory allocation Choosing appropriate chunk shapes and sizes for Dask collections Querying resource usage and debugging errors
trainingjupyterhubpython
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
C Programming
0
  • C Programming Notes
"These notes are part of the UW Experimental College course on Introductory C Programming. They are based on notes prepared (beginning in Spring, 1995) to supplement the book The C Programming Language, by Brian Kernighan and Dennis Ritchie, or K&R as the book and its authors are affectionately known. (The second edition was published in 1988 by Prentice-Hall, ISBN 0-13-110362-8.) These notes are now (as of Winter, 1995-6) intended to be stand-alone, although the sections are still cross-referenced to those of K&R, for the reader who wants to pursue a more in-depth exposition." C is a low-level programming language that provides a deep understanding of how a computer's memory and hardware work. This knowledge can be valuable when optimizing apps for performance or when dealing with resource-constrained environments.C is often used as the foundation for creating cross-platform libraries and frameworks. Learning C can allow you to develop libraries that can be used across different platforms, including iOS, Android, and desktop environments.
cc++compilingprogrammingprogramming-best-practices
0 Likes

Login to like
Type
learning
Level
Beginner
MPI Resources
0
  • Easy MPI Tutorial
  • Open MPI documentation
Workshop for beginners and intermediate students in MPI which includes helpful exercises. Open MPI documentation.
parallelizationmpi
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
Raftlib: Open Source library for concurrent data processing pipelines
0
  • RaftLib
Raftlib is an open-source C++ Library that provides a framework for implementing parallel and concurrent data processing pipelines. It is designed to simplify the development of high-performance data processing applications by abstracting away the complexities of parallelism, concurrency, and data flow management. It enables stream/data-flow parallel computation by linking parallel compute kernels together using simple right shift operators, similar to C++ streams for string manipulation. RaftLib eliminates the need for explicit usage of traditional threading libraries such as pthreads, std::thread, or OpenMP, which can lead to non-deterministic behavior when misused.
parallelizationpthreadsopenmp
0 Likes

Login to like
Type
tool
Level
Intermediate, Advanced
fast.ai
0
  • fast.ai Homepage
Fastai offers many tools to people working with machine learning and artifical intelligence including tutorials on PyTorch in addition to their own library built on PyTorch, news articles, and other resources to dive into this realm.
aimachine-learningpytorchtraining
0 Likes

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
Use Windows Subsystem for Linux for HPC Command Line Access from Windows
0
  • Install Linux on Windows with WSL
Windows Subsystem for Linux (WSL) provides a Linux environment for Windows users to access HPC resources fast and efficiently.
workflowssh
0 Likes

Login to like
Type
tool
Level
Beginner
Big Data Research at the University of Colorado Boulder
0
  • Big Data Research at the University of Colorado Boulder
Background: Big data, defined as having high volume, complexity or velocity, have the potential to greatly accelerate research discovery. Such data can be challenging to work with and require research support and training to address technical and ethical challenges surrounding big data collection, analysis, and publication. Methods: The present study was conducted via a series of semi-structured interviews to assess big data methodologies employed by CU Boulder researchers across a broad sample of disciplines, with the goal of illuminating how they conduct their research; identifying challenges and needs; and providing recommendations for addressing them. Findings: Key results and conclusions from the study indicate: gaps in awareness of existing big data services provided by CU Boulder; open questions surrounding big data ethics, security and privacy issues; a need for clarity on how to attribute credit for big data research; and a preference for a variety of training options to support big data research.
big-data
0 Likes

Login to like
Type
documentation
Level
Beginner
The Official Documentation of Pandas
0
  • pandas documentation
Pandas is one of the most essential Python libraries for data analysis and manipulation. It provides high-performance, easy-to-use data structures, and data analysis tools for the Python programming language. The official documentation serves as an in-depth guide to using this powerful tool including explanations and examples.
plottingvisualization
0 Likes

Login to like
Type
documentation
Level
Beginner, Intermediate
Language models and using HPC resources
0
  • AI-Generated Text Detection In 2023
Documentation and research based on the latest NLP text generation detection methods for 2023.
natural-language-processing
0 Likes

Login to like
Type
learning
Level
Intermediate
Neural Networks in Julia
0
  • Neural Networks in Julia using Flux.jl
Making a neural network has never been easier! The following link directs users to the Flux.jl package, the easiest way of programming a neural network using the Julia programming language. Julia is the fastest growing software language for AI/ML and this package provides a faster alternative to Python's TensorFlow and PyTorch with a 100% Julia native programming and GPU support.
aideep-learningmachine-learningneural-networksjulia
0 Likes

Login to like
Type
tool
Level
Intermediate, Advanced
Neurodesk
0
  • Neurodesk
Neurodesk provides a containerised data analysis environment to facilitate reproducible analysis of neuroimaging data. Analysis pipelines for neuroimaging data typically rely on specific versions of packages and software, and are dependent on their native operating system. These dependencies mean that a working analysis pipeline may fail or produce different results on a new computer, or even on the same computer after a software update. Neurodesk provides a platform in which anyone, anywhere, using any computer can reproduce your original research findings given the original data and analysis code.
psychologycontainerssoftware-installationversion-control
0 Likes

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
ACCESS Events and Training
0
  • Events and Training
Listing of upcoming ACCESS related events and training activities.
professional-developmenttrainingworkforce-development
0 Likes

Login to like
Type
website
Level
Beginner
Workshop on LangChain and GPT
0
  • Zoom Recording of Workshop on LangChain and GPT
  • Code
  • Data

This interactive workshop introduces participants to the power of GPT and LangChain for solving domain-specific scientific challenges. Participants will learn how to use these tools to address real research problems, such as predicting molecular properties or analyzing large-scale datasets in genomics. Through guided tutorials and hands-on project development, attendees will leave with a working application tailored to their own research needs.

aillmdata-analysispython
0 Likes

Login to like
Type
video_link
Level
Beginner
MATLAB bioinformatics toolbox
0
  • https://www.mathworks.com/products/bioinfo.html
Bioinformatics Toolbox provides algorithms and apps for Next Generation Sequencing (NGS), microarray analysis, mass spectrometry, and gene ontology. Using toolbox functions, you can read genomic and proteomic data from standard file formats such as SAM, FASTA, CEL, and CDF, as well as from online databases such as the NCBI Gene Expression Omnibus and GenBank.
visualizationdata-analysisbioinformaticsgenomicsmatlab
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate, Advanced
Spack Documentation
0
  • Spack Documentation
  • Spack Home Page
Spack is a package manager for supercomputers that can help administrators install scientific software and libraries for multiple complex software stacks.
spack
0 Likes

Login to like
Type
documentation
Level
Intermediate
Slurm Tutorials
0
  • Slurm Tutorials
Introduction to the Slurm Workload Manager for users and system administrators, plus some material for Slurm programmers.
administering-hpccluster-managementhpc-cluster-architecturetraining
0 Likes

Login to like
Type
learning
Level
Beginner

Pagination

  • First page « First
  • Previous page ‹‹
  • …
  • Page 3
  • Page 4
  • Page 5
  • Page 6
  • Current page 7
  • Page 8
  • Page 9
  • Page 10
  • Page 11
  • …
  • Next page ››
  • Last page Last »