Knowledge Base Resources

Contributed by cyberinfrastructure professionals (researchers, research computing facilitators, research software engineers and HPC system administrators), these resources are shared through the ConnectCI community platform. Add resources you find helpful!

Add a Resource

Implementing Markov Processes with Julia

Markov Decision Processes in Julia

The following link provides an easy method of implementing Markov Decision Processes (MDP) in the Julia computing language. MDPs are a class of algorithms designed to handle stochastic situations where the actor has some level of control. For example, used at a low level, MDPs can be used to control an inverted pendulum, but applied in higher level decision making the can also decide when to take evasive action in air traffic management. MDPs can also be extended to the partially observable domain to form the Partially Observable Markov Decision Process (POMDP). This link contains a wealth of information to show one can easily implement basic POMDP and MDP algorithms and apply well known online and offline solvers.

ai machine-learning julia

0 Likes

Type

tool

Level

Active inference textbook

Active Inference: The Free Energy Principle in Mind, Brain, and Behavior

This textbook is the first comprehensive treatment of active inference, an integrative perspective on brain, cognition, and behavior used across multiple disciplines including computational neurosciences, machine learning, artificial intelligence, and robotics. It was published in 2022 and it's open access at this time. The contents in this textbook should be educational to those who want to understand how the free energy principle is applied to the normative behavior of living organisms and who want to widen their knowledge of sequential decision making under uncertainty.

ai machine-learning neural-networks

0 Likes

Type

learning

Level

Pandas - Python

Pandas Docs

pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language. It lets you store data in easy to manage and display data frames, with column names and datatypes.

documentation ai big-data data-analysis

0 Likes

Type

documentation

Level

Handwritten Digits Tutorial in PyTorch

Handwritten Digits PyTorch

This tutorial is essentially the "hello world" of image recognition and feed-forward neural network (using PyTorch). Using the MNIST database (filled within images of handwritten digits), the tutorial will instruct how to build a feed-forward neural network that can recognize handwritten digits. A solid understanding of feed-forward and back-propagation is recommended.

ai visualization deep-learning image-processing machine-learning neural-networks pytorch python

0 Likes

Type

website

Level

How-To Video: Apply for an ACCESS Allocation

How-To Video

ACCESS Allocations website tour, and how to apply for allocations.

ACCESS-account ACCESS-allocations ACCESS-website adding-users allocation-management allocations-proposal

0 Likes

Type

video_link

Level

Representation Learning in Deep Learning

Representation Learning in Deep Learning

Representation learning is a fundamental concept in machine learning and artificial intelligence, particularly in the field of deep learning. At its core, representation learning involves the process of transforming raw data into a form that is more suitable for a specific task or learning objective. This transformation aims to extract meaningful and informative features or representations from the data, which can then be used for various tasks like classification, clustering, regression, and more.

deep-learning image-processing machine-learning neural-networks

0 Likes

Type

documentation

Level

Managing and Optimizing Your Jobs on HPC

Managing and Optimizing Your Jobs on HPC

An overview of tools and methods to manage and optimize jobs and HPC workflows

memory optimization batch-jobs job-accounting job-submission resources slurm

0 Likes

Type

video_link

Level

Advanced Compilers: The Self-Guided Online Course

Cornell's Advanced Compilers

This is a self guided online course on compilers. The topics covered throughout the course include universal compilers topics like intermediate representations, data flow, and “classic” optimizations as well as more research focusedtopics such as parallelization, just-in-time compilation, and garbage collection.

optimization parallelization training compiling

0 Likes

Type

learning

Level

UCLA Extended Reality (XR) collaboration resources and Workshop

Extended Reality (XR) Resource workshop/Guide for Building collaboration

Comprehensive Extended Reality (XR) collaboration resources for building a high performance extended reality (XR), augmented reality (AR), virtual reality (VR) and mixed reality campus teams. The tags set are a small subset of the the topics covered.

documentation neural-networks

0 Likes

Type

presentation

Level

EasyBuild Documentation

EasyBuild is a software installation framework that allows administrators to easily build and install software on high-performance computing (HPC) systems. It supports a wide range of software packages, toolchains, and compilers. Supported software are found in the EasyConfigs repository, one of several resositories in EasyBuild project.

easybuild

0 Likes

Type

documentation

Level

Texas A&M HPRC Training Site

Texas A&M Research Computing Training Resources

Training Resources and Courses offered by Texas A&M's Research Computing Group

ACES TAMU

0 Likes

Type

learning

Level

Automated Machine Learning Book

Automated Machine Learning: Methods, Systems, Challenges

The authoritative book on automated machine learning, which allows practitioners without ML expertise to develop and deploy state-of-the-art machine learning approaches. Describes the background of techniques used in detail, along with tools that are available for free.

ai data-analysis deep-learning machine-learning neural-networks python r

0 Likes

Type

learning

Level

ACCESS KB Guide - Expanse

ACCESS KB Guide

Expanse at SDSC is a cluster designed by Dell and SDSC delivering 5.16 peak petaflops, and offers Composable Systems and Cloud Bursting. This documentation describes how to use the Expanse cluster with some specific information for people with ACCESS accounts.

expanse composable-systems gpu

0 Likes

Type

documentation

Level

CHARMM Links to Install, Run, and Troubleshoot MD Simulations

CHARMM (Chemistry at HARvard Macromolecular Mechanics) is a widely distributed molecular simulation program with a broad array of applications. CHARMM has the capabilities to setup and run simulations on both biological and materials systems, contains a comprehensive set of analysis and tools, and has high performance on a variety of platforms. Here you will find links to the CHARMM website, forum, and registration/download page.

charmm molecular-dynamics namd computational-chemistry

0 Likes

Type

learning

Level

Practical Machine Learning with Python

Regression forecasting and predicting - Practical Machine Learning Tutorial with Python p.5

This video series provides a holistic understanding of machine learning, covering theory, application, and inner workings of supervised, unsupervised, and deep learning algorithms. It covers topics such as linear regression, K Nearest Neighbors, Support Vector Machines (SVM), flat clustering, hierarchical clustering, and neural networks. Goes over the high level intuitions of the algorithms and how they are logically meant to work. Apply the algorithms in code using real world data sets along with a module, such as with Scikit-Learn.

machine-learning programming python

0 Likes

Type

video_link

Level

TensorFlow for Deep Neural Networks

TensorFlow Docs

TensorFlow is a powerful framework for Deep Learning, developed by google. This specifically is their python package, which is easy to use and can be used to train incredibly powerful models.

documentation faster tensorflow

0 Likes

Type

tool

Level

Intro to Statistical Computing with Stan

The Stan language is used to specify a (Bayesian) statistical model with an imperative program calculating the log probability density function. Here are some useful links to start your exploration of this statistical programming language, and a Python interface to Stan.

data-analysis machine-learning monte-carlo python

0 Likes

Type

documentation

Level

Online Bachelor's in Data Science Program Guide - TechGuide

Online Data Science Degree Programs

The realm of data science is one that onlookers regard with curiosity and respect. There are a lot of unknowns in this area of study that only recently became hugely relevant. It is important to get the facts on how expertise in data science is transforming the world. This article features what a bachelor’s degree means in today’s market and the future.

machine-learning big-data data-analysis data-science

0 Likes

Type

website

Level

GPU Acceleration in Python

GPU Acceleration in Python

This tutorial explains how to use Python for GPU acceleration with libraries like CuPy, PyOpenCL, and PyCUDA. It shows how these libraries can speed up tasks like array operations and matrix multiplication by using the GPU. Examples include replacing NumPy with CuPy for large datasets and using PyOpenCL or PyCUDA for more control with custom GPU kernels. It focuses on practical steps to integrate GPU acceleration into Python programs.

machine-learning big-data data-analysis optimization parallelization gpu cuda python

0 Likes

Type

learning

Level

Federated CI Resources

How do you add your institutional HPC cluster to the Open Science Grid (OSG)?

Discussion about contributing cycles to the Open Science Grid.

open-science-grid

0 Likes

Type

learning

Level

Language models and using HPC resources

AI-Generated Text Detection In 2023

Documentation and research based on the latest NLP text generation detection methods for 2023.

natural-language-processing

0 Likes

Type

learning

Level

Docker Tutorial for Beginners

Docker Tutorial for Beginners

A Docker tutorial for beginners is a course that teaches the basics of Docker, a containerization platform that allows you to package your application and its dependencies into a standardized unit for development, shipment, and deployment.

docker

0 Likes

Type

video_link

Level

MOPAC

Examples of I/O Files for Mopac

MOPAC (Molecular Orbital PACkage) is a semi-empirical quantum chemistry package used to compute molecular properties and structures by using approximations of the Schrödinger equation. This tutorial explains the process of using MOPAC for different forms of calculations.

computational-chemistry

0 Likes

Type

tool

Level

Header-only C++ JSON library

JSON is a lightweight format for storing and transporting data, for example in a config file. This library is header-only, and has easy-to-read documentation. It is a C++ library.

resources c++

0 Likes

Type

learning

Level

Trusted CI Resources Page

Trusted CI Resources Page

Very helpful list of external resources from Trusted CI

cybersecurity

0 Likes

Type

website

Level