Knowledge Base Resources

Use these links “vetted” by the community. Additional CI links are always welcome.

Resource to active inference

Active inference institute website

Active inference is an emerging study field in machine learning and computational neuroscience. This website in particular introduces "active inference institute", which has established a couple of years ago, and contains a wide variety of resources for understanding the theory of active inference and for participating a worldwide active inference community.

0 Likes

Type

website

Level

Flag as

Numpy - a Python Library

NumPY Docs

Numpy is a python package that leverages types and compiled C code to make many math operations in Python efficient. It is especially useful for matrix manipulation and operations.

documentation big-data data-analysis deep-learning opencv pytorch tensorflow data-science

0 Likes

Type

tool

Level

Flag as

Implementing Markov Processes with Julia

Markov Decision Processes in Julia

The following link provides an easy method of implementing Markov Decision Processes (MDP) in the Julia computing language. MDPs are a class of algorithms designed to handle stochastic situations where the actor has some level of control. For example, used at a low level, MDPs can be used to control an inverted pendulum, but applied in higher level decision making the can also decide when to take evasive action in air traffic management. MDPs can also be extended to the partially observable domain to form the Partially Observable Markov Decision Process (POMDP). This link contains a wealth of information to show one can easily implement basic POMDP and MDP algorithms and apply well known online and offline solvers.

ai machine-learning julia

0 Likes

Type

tool

Level

Flag as

Rockfish at Johns Hopkins University

Rockfish Resources and Documentation

Resources and User Guide available at Rockfish

rockfish

0 Likes

Type

documentation

Level

Flag as

QGIS Processing Executor

QGIS processing from the command line

Running QGIS tools from the command line

gis

0 Likes

Type

documentation

Level

Flag as

MATLAB bioinformatics toolbox

https://www.mathworks.com/products/bioinfo.html

Bioinformatics Toolbox provides algorithms and apps for Next Generation Sequencing (NGS), microarray analysis, mass spectrometry, and gene ontology. Using toolbox functions, you can read genomic and proteomic data from standard file formats such as SAM, FASTA, CEL, and CDF, as well as from online databases such as the NCBI Gene Expression Omnibus and GenBank.

visualization data-analysis bioinformatics genomics matlab

0 Likes

Type

tool

Level

Flag as

What is fairness in ML?

Building ML models for everyone: understanding fairness in machine learning

This article discusses the importance of fairness in machine learning and provides insights into how Google approaches fairness in their ML models. The article covers several key topics: Introduction to fairness in ML: It provides an overview of why fairness is essential in machine learning systems, the potential biases that can arise, and the impact of biased models on different communities. Defining fairness: The article discusses various definitions of fairness, including individual fairness, group fairness, and disparate impact. It explains the challenges in achieving fairness due to trade-offs and the need for thoughtful considerations. Addressing bias in training data: It explores how biases can be present in training data and offers strategies to identify and mitigate these biases. Techniques like data preprocessing, data augmentation, and synthetic data generation are discussed. Fairness in ML algorithms: The article examines the potential biases that can arise from different machine learning algorithms, such as classification and recommendation systems. It highlights the importance of evaluating and monitoring models for fairness throughout their lifecycle. Fairness tools and resources: It showcases various tools and resources available to practitioners and developers to help measure, understand, and mitigate bias in machine learning models. Google's TensorFlow Extended (TFX) and What-If Tool are mentioned as examples. Google's approach to fairness: The article highlights Google's commitment to fairness and the steps they take to address fairness challenges in their ML models. It mentions the use of fairness indicators, ongoing research, and partnerships to advance fairness in AI. Overall, the article provides a comprehensive overview of fairness in machine learning and offers insights into Google's approach to building fair ML models.

ai visualization data-analysis deep-learning machine-learning

0 Likes

Type

documentation

Level

Flag as

Set Up VSCode for Python and Github

VSCode for Python plus Github Integration

VSCode is a popular IDE that runs on Windows, MacOS, and Linux. This tutorial will explain how to get set up with VSCode to code in Python. It will also provide a tutorial on how to set up Github integration within VSCode.

git python

0 Likes

Type

learning

Level

Flag as

Fine-tuning LLMs with PEFT and LoRA

Fine-tuning LLMs with PEFT and LoRA

As LLMs get larger fine-tuning to the full extent can become difficult to train on consumer hardware. Storing and deploying these tuned models can also be quite expensive and difficult to store. With PEFT (parameter -efficent fine tuning), it approaches fine-tune on a smaller scale of model parameters while freezing most parameters of the pretrained LLMs. Basically it is providing full performance that which is similar if not better than full fine tuning while only having a small number of trainable parameters. This source explains that as well as going over LORA diagrams and a code walk through.

faster optimization performance-tuning tuning

0 Likes

Type

video_link

Level

Flag as

MDAnalysis - Python library for the analysis of molecular dynamics simulations

MDAnalysis

MDAnalysis is a python based library of tools for the analysis of molecular dynamics simulations. It is able to read and write many popular simulation formats including CHARMM, LAMMPS, GROMACS, and AMBER and more. This link contains the documentation pages of all MDAnalysis functions and has links to tutorials using Jupyter Notebooks.

computational-chemistry materials-science python

0 Likes

Type

tool

Level

Flag as

Fundamentals of Cloud Computing

Fundamentals of Cloud Computing

An introduction to Cloud Computing

cloud-computing

0 Likes

Type

website

Level

Flag as

ACCESS Getting Started Quick-Guide

Getting Started Quick-Guide

A step-by-step guide to getting your first allocation for Access computing and storage resources.

access-account ACCESS-credits allocations-proposal

0 Likes

Type

website

Level

Flag as

Trusted CI Resources Page

Trusted CI Resources Page

Very helpful list of external resources from Trusted CI

cybersecurity

0 Likes

Type

website

Level

Flag as

ACCESS KB Guide - Expanse

ACCESS KB Guide

Expanse at SDSC is a cluster designed by Dell and SDSC delivering 5.16 peak petaflops, and offers Composable Systems and Cloud Bursting.

expanse composable-systems gpu

0 Likes

Type

documentation

Level

Flag as

UNIX/command line basics tutorial

UNIX/command line basics tutorial

Introductory training materials for working on the UNIX command line.

bash

0 Likes

Type

learning

Level

Flag as

Mechanism and Implementation of Various MPI Libraries

There is a detailed explanation about communication routines and managing methods of different MPI libraries, as well as several exercises designed for users to get familiar with the implementation of MPI build process.

compiling mpi

0 Likes

Type

website

Level

Flag as

Data visualization with Matplotlib

Guide to data visualization with matplotlib

Data visualization is a critical aspect of data analysis. It allows for a clear and concise representation of data, making it easier for users to understand and interpret complex datasets. One of the most popular libraries for data visualization in Python is Matplotlib. The included website aims to provide a brief overview of Matplotlib, its features, and examples/exercises to dive deeper into its functionalities.

plotting visualization

0 Likes

Type

website

Level

Flag as

ACES: Charliecloud Containers for Scientific Workflows (Tutorial)

This tutorial introduces the use of Containers using the Charliecloud software suite. This tutorial will provide participants with background and hands-on experience to use basic Charliecloud containers for HPC applications. We discuss what containers are, why they matter for HPC, and how they work. We'll give an overview of Charliecloud, the unprivileged container solution from Los Alamos National Laboratory's HPC Division. Students will learn how to build toy containers and containerize real HPC applications, and then run them on a cluster. Exercises are demonstrated using the ACES cluster, a composable accelerator testbed at Texas A&M University. Students with an allocation on the ACES cluster can follow along with the ACES-specific exercises.

ACES TAMU scratch lammps tensorflow ondemand gpu nfs slurm bash training python containers

0 Likes

Type

learning

Level

Flag as

NITRC

NITRC

The Neuroimaging Tools and Resources Collaboratory (NITRC) is a neuroimaging informatics knowledge environment for MR, PET/SPECT, CT, EEG/MEG, optical imaging, clinical neuroinformatics, imaging genomics, and computational neuroscience tools and resources.

data-analysis image-processing data-sharing

0 Likes

Type

website

Level

Flag as

Raftlib: Open Source library for concurrent data processing pipelines

RaftLib

Raftlib is an open-source C++ Library that provides a framework for implementing parallel and concurrent data processing pipelines. It is designed to simplify the development of high-performance data processing applications by abstracting away the complexities of parallelism, concurrency, and data flow management. It enables stream/data-flow parallel computation by linking parallel compute kernels together using simple right shift operators, similar to C++ streams for string manipulation. RaftLib eliminates the need for explicit usage of traditional threading libraries such as pthreads, std::thread, or OpenMP, which can lead to non-deterministic behavior when misused.

parallelization pthreads openmp

0 Likes

Type

tool

Level

Flag as

InsideHPC

InsideHPC HomePage

InsideHPC is an informational site offers videos, research papers, articles, and other resources focused on machine learning and quantum computing among other topics within high performance computing.

ai machine-learning community-outreach

0 Likes

Type

website

Level

Flag as