Skip to main content

Breadcrumb

  1. ACCESS Home
  2. Support
  3. Knowledge Base
  4. Knowledge Base Resources

Knowledge Base Resources

These resources are contributed by researchers, facilitators, engineers, and HPC admins. Please upvote resources you find useful!
Add a Resource

Filters

Topics

  • machine-learning (50)
  • ai (43)
  • training (40)
  • data-analysis (39)
  • deep-learning (28)
  • documentation (28)
  • big-data (26)
  • neural-networks (24)
  • workforce-development (21)
  • visualization (18)
  • professional-development (17)
  • parallelization (16)
  • community-outreach (14)
  • programming (14)
  • image-processing (13)
  • cybersecurity (12)
  • gpu (12)
  • r (12)
  • pytorch (11)
  • slurm (10)
  • c (9)
  • cloud-computing (9)
  • compiling (9)
  • mpi (9)
  • plotting (9)
  • administering-hpc (8)

Topics

  • machine-learning (50)
  • ai (43)
  • training (40)
  • data-analysis (39)
  • deep-learning (28)
  • documentation (28)
  • big-data (26)
  • neural-networks (24)
  • workforce-development (21)
  • visualization (18)
  • professional-development (17)
  • parallelization (16)
  • community-outreach (14)
  • programming (14)
  • image-processing (13)
  • cybersecurity (12)
  • gpu (12)
  • r (12)
  • pytorch (11)
  • slurm (10)
  • c (9)
  • cloud-computing (9)
  • compiling (9)
  • mpi (9)
  • plotting (9)
  • administering-hpc (8)

If you'd like to use more filters, please login to view them all.

Spack Documentation
0
  • Spack Documentation
  • Spack Home Page
Spack is a package manager for supercomputers that can help administrators install scientific software and libraries for multiple complex software stacks.
spack
0 Likes

Login to like
Type
documentation
Level
Intermediate
Using Dask on HPC Systems
0
  • Dask Tutorial Github Page
  • Video Recording of Tutorial - Part 1
  • Video Recording of Tutorial - Part 2
A tutorial on the effective use of Dask on HPC resources. The four-hour tutorial will be split into two sections, with early topics focused on novice Dask users and later topics focused on intermediate usage on HPC and associated best practices. The knowledge areas covered include (but are not limited to): Beginner section High-level collections including dask.array and dask.dataframe Distributed Dask clusters using HPC job schedulers Earth Science data analysis using Dask with Xarray Using the Dask dashboard to understand your computation Intermediate section Optimizing the number of workers and memory allocation Choosing appropriate chunk shapes and sizes for Dask collections Querying resource usage and debugging errors
trainingjupyterhubpython
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
Language models and using HPC resources
0
  • AI-Generated Text Detection In 2023
Documentation and research based on the latest NLP text generation detection methods for 2023.
natural-language-processing
0 Likes

Login to like
Type
learning
Level
Intermediate
Fundamentals of Cloud Computing
0
  • Fundamentals of Cloud Computing
An introduction to Cloud Computing
cloud-computing
0 Likes

Login to like
Type
website
Level
Beginner
MPI Resources
0
  • Easy MPI Tutorial
  • Open MPI documentation
Workshop for beginners and intermediate students in MPI which includes helpful exercises. Open MPI documentation.
parallelizationmpi
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
Use Windows Subsystem for Linux for HPC Command Line Access from Windows
0
  • Install Linux on Windows with WSL
Windows Subsystem for Linux (WSL) provides a Linux environment for Windows users to access HPC resources fast and efficiently.
workflowssh
0 Likes

Login to like
Type
tool
Level
Beginner
Set Up VSCode for Python and Github
0
  • VSCode for Python plus Github Integration
VSCode is a popular IDE that runs on Windows, MacOS, and Linux. This tutorial will explain how to get set up with VSCode to code in Python. It will also provide a tutorial on how to set up Github integration within VSCode.
gitpython
0 Likes

Login to like
Type
learning
Level
Big Data Research at the University of Colorado Boulder
0
  • Big Data Research at the University of Colorado Boulder
Background: Big data, defined as having high volume, complexity or velocity, have the potential to greatly accelerate research discovery. Such data can be challenging to work with and require research support and training to address technical and ethical challenges surrounding big data collection, analysis, and publication. Methods: The present study was conducted via a series of semi-structured interviews to assess big data methodologies employed by CU Boulder researchers across a broad sample of disciplines, with the goal of illuminating how they conduct their research; identifying challenges and needs; and providing recommendations for addressing them. Findings: Key results and conclusions from the study indicate: gaps in awareness of existing big data services provided by CU Boulder; open questions surrounding big data ethics, security and privacy issues; a need for clarity on how to attribute credit for big data research; and a preference for a variety of training options to support big data research.
big-data
0 Likes

Login to like
Type
documentation
Level
Beginner
Fine-tuning LLMs with PEFT and LoRA
0
  • Fine-tuning LLMs with PEFT and LoRA
As LLMs get larger fine-tuning to the full extent can become difficult to train on consumer hardware. Storing and deploying these tuned models can also be quite expensive and difficult to store. With PEFT (parameter -efficent fine tuning), it approaches fine-tune on a smaller scale of model parameters while freezing most parameters of the pretrained LLMs. Basically it is providing full performance that which is similar if not better than full fine tuning while only having a small number of trainable parameters. This source explains that as well as going over LORA diagrams and a code walk through.
fasteroptimizationperformance-tuningtuning
0 Likes

Login to like
Type
video_link
Level
Intermediate, Advanced
ACES: Charliecloud Containers for Scientific Workflows (Tutorial)
0
  • ACES: Charliecloud Containers for Scientific Workflows (Video)
  • ACES: Charliecloud Containers for Scientific Workflows (Slides)
This tutorial introduces the use of Containers using the Charliecloud software suite. This tutorial will provide participants with background and hands-on experience to use basic Charliecloud containers for HPC applications. We discuss what containers are, why they matter for HPC, and how they work. We'll give an overview of Charliecloud, the unprivileged container solution from Los Alamos National Laboratory's HPC Division. Students will learn how to build toy containers and containerize real HPC applications, and then run them on a cluster. Exercises are demonstrated using the ACES cluster, a composable accelerator testbed at Texas A&M University. Students with an allocation on the ACES cluster can follow along with the ACES-specific exercises.
ACESTAMUscratchlammpstensorflowopen-ondemandgpunfsslurmbashtrainingpythoncontainers
0 Likes

Login to like
Type
learning
Level
Beginner
Neurodesk
0
  • Neurodesk
Neurodesk provides a containerised data analysis environment to facilitate reproducible analysis of neuroimaging data. Analysis pipelines for neuroimaging data typically rely on specific versions of packages and software, and are dependent on their native operating system. These dependencies mean that a working analysis pipeline may fail or produce different results on a new computer, or even on the same computer after a software update. Neurodesk provides a platform in which anyone, anywhere, using any computer can reproduce your original research findings given the original data and analysis code.
psychologycontainerssoftware-installationversion-control
0 Likes

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
ACCESS Events and Training
0
  • Events and Training
Listing of upcoming ACCESS related events and training activities.
professional-developmenttrainingworkforce-development
0 Likes

Login to like
Type
website
Level
Beginner
InsideHPC
0
  • InsideHPC HomePage
InsideHPC is an informational site offers videos, research papers, articles, and other resources focused on machine learning and quantum computing among other topics within high performance computing.
aimachine-learningcommunity-outreach
0 Likes

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
Python
0
  • Introduction to Python - Texas A&M
Python course offered by Texas A&M HPRC
python
0 Likes

Login to like
Type
learning
Level
Beginner
Slurm Tutorials
0
  • Slurm Tutorials
Introduction to the Slurm Workload Manager for users and system administrators, plus some material for Slurm programmers.
administering-hpccluster-managementhpc-cluster-architecturetraining
0 Likes

Login to like
Type
learning
Level
Beginner
Data visualization with Matplotlib
0
  • Guide to data visualization with matplotlib
Data visualization is a critical aspect of data analysis. It allows for a clear and concise representation of data, making it easier for users to understand and interpret complex datasets. One of the most popular libraries for data visualization in Python is Matplotlib. The included website aims to provide a brief overview of Matplotlib, its features, and examples/exercises to dive deeper into its functionalities.
plottingvisualization
0 Likes

Login to like
Type
website
Level
Beginner
Master's in Data Science Program Guide - TechGuide
0
  • Masters in Data Science Program Guide
A master’s degree in data science helps prepare professionals to take the next career step. This article will focus primarily on data science, a graduate degree in this field, and a data scientist or data analyst career. With many employers preferring a master’s degree in data science for those seeking to fill roles as data scientists or analysts, we will discuss the data science master’s degree in detail.
big-datadata-analysisdata-science
0 Likes

Login to like
Type
website
Level
Advanced
Official Documentation of VisIt
0
  • Vislt github
  • Writing a File Format Reader
VisIt is a prominent open-source, interactive parallel visualization and graphical analysis tool predominantly used for viewing scientific data. Its GitHub repository offers a detailed insight into the software's source code, documentation, and contribution guidelines. In particular, it offers useful examples on how it
visItnovel-acceleratorsparticle-physics
0 Likes

Login to like
Type
documentation
Level
Intermediate, Advanced
marimo | a next generation python notebook
0
  • marimo | a next generation python notebook
Introduction seminar for new reactive python notebook from marimo ambassador.
aicomputer-graphicsplottingvisualizationbig-datadata-analysisgitpython
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate, Advanced
Ask.CI Q&A Platform for Research Computing
0
  • Ask.CI
resourcesprogramming-best-practices
0 Likes

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
Python Data and Viz Training (CCEP Program)
0
5 Days of recordings of Python data analysis and visualization training.
data-sciencepython
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
National Public Radio (NPR)
0
  • The right mentor can change your career. Here's how to find one.
Pluses and challenges of mentor selection. Offers tips for acquiring a mentor (finding, asking). And how to be a good mentee. SMART framework mentioned. Discrimination mentioned. Difference between mentor and sponsor underlined. More than one mentor encouraged. Good tips.
mentorshipprofessional-development
0 Likes

Login to like
Type
learning
Level
Beginner
Gaussian 16
0
  • Gaussian 16 HomePage
  • About Gaussian 16
Gaussian 16 is a computational chemistry package that is used in predicting molecular properties and understanding molecular behavior at a quantum mechanical level.
gaussiancomputational-chemistry
0 Likes

Login to like
Type
tool
Level
Intermediate, Advanced
Oakridge Leadership Computing Facility (OLCF) Training Events and Archive
0
  • OLCF User Training Main Site
  • OLCF Training Calendar
  • OLCF Training Archive
  • OLCF Training Github
Upcoming training events and archives of training materials detailing general HPC best practices as well as how to use OLCF resources and services.
training
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate, Advanced
Factor Graphs and the Sum-Product Algorithm
0
  • https://ieeexplore.ieee.org/document/910572
A tutorial paper that presents a generic message-passing algorithm, the sum-product algorithm, that operates in a factor graph. Following a single, simple computational rule, the sum-product algorithm computes either exactly or approximately various marginal functions derived from the global function. A wide variety of algorithms developed in artificial intelligence, signal processing, and digital communications can be derived as specific instances of the sum-product algorithm, including the forward/backward algorithm, the Viterbi algorithm, the iterative "turbo" decoding algorithm, Pearl's (1988) belief propagation algorithm for Bayesian networks, the Kalman filter, and certain fast Fourier transform (FFT) algorithms
ACCESS-accountaimachine-learning
0 Likes

Login to like
Type
documentation
Level
Intermediate

Pagination

  • First page « First
  • Previous page ‹‹
  • …
  • Page 4
  • Page 5
  • Page 6
  • Page 7
  • Current page 8
  • Page 9
  • Page 10
  • Page 11
  • Page 12
  • Next page ››
  • Last page Last »