- AHPCC documentary0This link is a documentary website to use AHPCC.
- Understanding LLM Fine-tuning0With the recent uprising of LLM's many business are looking at way to adopt these LLMs and fine-tuning these models on specfic data sets to ensure accuracy. These models when fine-tuned can be optimal for fulfilling the specific needs of a company. This site explains explicitly when, how, and why models should be trained. It goes over various strategies for LLM fine -tuning.
- Active inference textbook0This textbook is the first comprehensive treatment of active inference, an integrative perspective on brain, cognition, and behavior used across multiple disciplines including computational neurosciences, machine learning, artificial intelligence, and robotics. It was published in 2022 and it's open access at this time. The contents in this textbook should be educational to those who want to understand how the free energy principle is applied to the normative behavior of living organisms and who want to widen their knowledge of sequential decision making under uncertainty.
- Open-Source Server Virtualization Platform0Proxmox Virtual Environment is a hyper-converged infrastructure open-source software. It is a hosted hypervisor that can run operating systems including Linux and Windows on x64 hardware.
- Why Mentoring Matters and How to Get Started0Describes effective mentorship (both ways).
- Spatial Data Science in the Cloud (Alpine HPC) using Python0Spatial Data Science is a growing field across a wide range of industries and disciplines. The open-source programming language Python has many libraries that support spatial analysis, but what do you do when your computer is unable to tackle the massive file sizes of high-resolution data and the computing power required in your analysis? There materials have been prepared to teach you spatial data science and how to execute your analysis using a high-performance computer (HPC).
- Trusted CI Resources Page0Very helpful list of external resources from Trusted CI
- Machine Learning in Astrophysics0Machine learning is becoming increasingly important in field with large data such as astrophysics. AstroML is a Python module for machine learning and data mining built on numpy, scipy, scikit-learn, matplotlib, and astropy allowing for a range of statistical and machine learning routines to analyze astronomical data in Python. In particular, it has loaders for many open astronomical datasets with examples on how to visualize such complicated and large datasets.
- File management of Visual Studio Code on clusters0Visual Studio Code, commonly known as VSCode, is a popular tool used by programmers worldwide. It serves as a text editor and an Integrated Development Environment (IDE) that supports a wide variety of programming languages. One of its key features is its extensive library of extensions. These extensions add on to the basic functionalities of VSCode, making coding more efficient and convenient. However, there's a catch. When these extensions are installed and used frequently, they generate a multitude of files. These files are typically stored in a folder named .vscode-extension within your home directory. On a cluster computing facility such as the FASTER and Grace clusters at Texas A&M University, there's a limitation on how many files you can have in your home directory. For instance, the file number limit could be 10000, while the .vscode-extension directory can hold around 4000 temporary files even with just a few extensions. Thus, if the number of files in your home directory surpasses this limit due to VSCode extensions, you might face some issues. This restriction can discourage users from taking full advantage of the extensive features and extensions offered by the VSCode editor. To overcome this, we can shift the .vscode-extension directory to the scratch space. The scratch space is another area in the cluster where you can store files and it usually has a much higher limit on the number of files compared to the home directory. We can perform this shift smoothly using a feature called symbolic links (or symlinks for short). Think of a symlink as a shortcut or a reference that points to another file or directory located somewhere else. Here's a step-by-step guide on how to move the .vscode-extension directory to the scratch space and create a symbolic link to it in your home directory: 1. Copy the .vscode-extension directory to the scratch space: Using the cp command, you can copy the .vscode-extension directory (along with all its contents) to the scratch space. Here's how: cp -r ~/.vscode-extension /scratch/user Don't forget to replace /scratch/user with the actual path to your scratch directory. 2. Remove the original .vscode-extension directory: Once you've confirmed that the directory has been copied successfully to the scratch space, you can remove the original directory from your home space. You can do this using the rm command: rm -r ~/.vscode-extension It's important to make sure that the directory has been copied to the scratch space successfully before deleting the original. 3. Create a symbolic link in the home directory: Lastly, you'll create a symbolic link in your home directory that points to the .vscode-extension directory in the scratch space. You can do this as follows: ln -s /scratch/user/.vscode-extension ~/.vscode-extension By following this process, all the files generated by VSCode extensions will be stored in the scratch space. This prevents your home directory from exceeding its file limit. Now, when you access ~/.vscode-extension, the system will automatically redirect you to the directory in the scratch space, thanks to the symlink. This method ensures that you can use VSCode and its various extensions without worrying about hitting the file limit in your home directory.
- AI for improved HPC research - Cursor and Termius - Powerpoint0These slides provide an introduction on how Termius and Cursor, two new and freemium apps that use AI to perform more efficient work, can be used for faster HPC research.
- Navier-Stokes Cahn-Hilliard (NSCH) for MOOSE Framework0The MOOSE Navier-Stokes Cahn-Hilliard (NSCH) application is a library for implementing simulation tools that solve the Navier-Stokes Cahn-Hilliard equations with non-matching densities using Galerkin finite element methods with a residual-based stabilization scheme.
- Geocomputation with R (Free Reference Book)0Below is a link for a book that focuses on how to use "sf" and "terra" packages for GIS computations. As of 5/1/2023, this book is up to date and examples are error free. The book has a lot of information but provides a good overview and example workflows on how to use these tools.
- Performance Engineering Of Software Systems0A class from MITOpenCourseware that gives a hands on approach to building scalable and high-performance software systems. Topics include performance analysis, algorithmic techniques for high performance, instruction-level optimizations, caching optimizations, parallel programming, and building scalable systems.
- Introduction to GPU/Parallel Programming using OpenACC0Introduction to the basics of OpenACC.
- Gesture Classifier Model using MediaPipe0MediaPipe is Google's open-source framework for building multimodal (e.g., video, audio, etc.) machine learning pipelines. It is highly efficient and versatile, making it perfect for tasks like gesture recognition. This is a tutorial on how to make a custom model for gesture recognition tasks based on the Google MediaPipe API. This tutorial is specifically for video-playback, though could be generalized to image and live-video feed recognition.
- UNIX/command line basics tutorial0Introductory training materials for working on the UNIX command line.
- Fundamentals of R Programming0This course is an introduction to the R programming language and covers the fundamental concepts needed to operate in the R environment. This course was taught for the ACCESS community on September 26, 2023, but the materials for the course are still available on the ACES cluster and can be completed independently. All materials are presented as learnR notebooks and cover several topics, including data types, variables, built-in functions, data structures, and plotting.
- AI Institutes Cyberinfrastructure Documents: SAIL Meeting0Materials from the SAIL meeting (https://aiinstitutes.org/2023/06/21/sail-2023-summit-for-ai-leadership/). A space where AI researchers can learn about using ACCESS resources for AI applications and research.
- A Practical Introduction to pymdp: A Two-Part Guide to Active Inference in Python0
- Active Inference ModelStream 007.1 ~ Conor Heins & Daphne Demekas ~ pymdp
- Active Inference ModelStream 007.2 ~ pymdp
Here are descriptions for a two-part video series from the Active Inference Institute. These videos introduce pymdp, a powerful Python package for researchers and developers working with active inference and the Free Energy Principle. 🧠The first video provides a high-level overview of the package, while the second dives into a practical coding demonstration. Together, they offer a comprehensive guide from theory to application.
Active Inference ModelStream 007.1 ~ Conor Heins & Daphne Demekas ~ pymdp
This video provides an overview of pymdp, a Python package designed for active inference. Active inference is a framework for modeling decision-making and planning based on the Free Energy Principle, which posits that intelligent agents act to minimize surprise or uncertainty about their world. The pymdp package allows researchers to simulate how agents achieve this through both perception and action. It was created as a more accessible, user-friendly alternative to traditional MATLAB tools. The package is modular, integrates with other Python libraries like PyTorch, and has future plans to incorporate JAX for deep learning applications, making it a powerful tool for neuroscience and AI researchers studying the Free Energy Principle.
Active Inference ModelStream 007.2 ~ pymdp
This video, the second part of a series, demonstrates how to implement an active inference agent using the pymdp library. The presentation focuses on building a generative model for a contextual multi-armed bandit task. Key concepts explained include the four main components of a POMDP—the A, B, C, and D matrices—which represent the agent's beliefs. The video also introduces the concept of factorized state spaces to manage computational complexity and details how an agent can learn by updating its model parameters based on experience. The ultimate goal is to show how these agents, following the Free Energy Principle, can engage in both optimal and information-seeking behavior.
- FreeSurfer Tutorials0The official MGH / Harvard tutorial page for FreeSurfer. The FreeSurfer group has provided and designed a series of tutorials for using FreeSurfer and for getting acquainted with the concepts needed to perform its various modes of analysis and processing of MRI data. The tutorials are designed to be followed along in a terminal window where commands can be copy/pasted instead of typed.
- Intro to Statistical Computing with Stan0
- https://mc-stan.org/users/documentation/
- https://vasishth.github.io/bayescogsci/book/ch-introstan.html
- https://pystan.readthedocs.io/en/latest/
The Stan language is used to specify a (Bayesian) statistical model with an imperative program calculating the log probability density function. Here are some useful links to start your exploration of this statistical programming language, and a Python interface to Stan. - Workshop on LangChain and GPT0
This interactive workshop introduces participants to the power of GPT and LangChain for solving domain-specific scientific challenges. Participants will learn how to use these tools to address real research problems, such as predicting molecular properties or analyzing large-scale datasets in genomics. Through guided tutorials and hands-on project development, attendees will leave with a working application tailored to their own research needs.
- OpenMP and Multithreaded Jobs in GRASS0Techniques and support for multithreaded geospatial data processing in GRASS.