- PetIGA, an open-source code for isogeometric analysis0This documentation provides an overview of the PetIGA framework, an open source code for solving multiphysics problems with isogeometric analysis. The documentation covers some simple tutorials and examples to help users get started with the framework and apply it to solve real-world problems in continuum mechanics, including solid and fluid mechanics.
- Fairness and Machine Learning0The "Fairness and Machine Learning" book offers a rigorous exploration of fairness in ML and is suitable for researchers, practitioners, and anyone interested in understanding the complexities and implications of fairness in machine learning.
- Pandas - Python0pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language. It lets you store data in easy to manage and display data frames, with column names and datatypes.
- United Nations Mentor Handbook0The United Nations (UN) is an international organization comprising 193 Member States, including the United States. As a global organization, the UN is the one place on Earth where the world's nations can gather to discuss common problems and find shared solutions that benefit all humanity. This handbook has been produced for UN staff of all backgrounds and levels and provides an overview of how to approach your participation in a mentorship program. This resource is quickly digestible and provides a basic structure that will be helpful to review before the first meeting with your mentee.
- Paraview UArizona HPC links (beginner)0
- University of Arizona Visualization homepage
- Getting Started with Paraview
- Paraview Cameras and Keyframes
- Graphs and Data Exporting
- Visualizing netcdf files
These links take you to visualization resources supported by the University of Arizona's HPC visualization consultant (rtdatavis.github.io). The following links are specific to the Paraview program and the workflows that have been used my researchers at the U of Arizona. Some of the pages linked are very beginner friendly: getting started, working with cameras and keyframes for rendering, visualizing external files (netcdf climate data), graphs and data exporting. Many of the workflows involve using remote desktops via the Open On Demand interface, but if this isn't set up at your university you can use paraview locally on a desktop. Feel free to post on access ci https://ask.cyberinfrastructure.org/ if you need assistance getting a paraview gui open for your work on HPC. - Intro to Statistical Computing with Stan0
- https://mc-stan.org/users/documentation/
- https://vasishth.github.io/bayescogsci/book/ch-introstan.html
- https://pystan.readthedocs.io/en/latest/
The Stan language is used to specify a (Bayesian) statistical model with an imperative program calculating the log probability density function. Here are some useful links to start your exploration of this statistical programming language, and a Python interface to Stan. - AI powered VsCode Editor0**Cursor: The AI-Powered Code Editor** Cursor is a cutting-edge, AI-first code editor designed to revolutionize the way developers write, debug, and understand code. Built upon the premise of pair-programming with artificial intelligence, Cursor harnesses the capabilities of advanced AI models to offer real-time coding assistance, bug detection, and code generation. **How Cursor Benefits High-Performance Computing (HPC) Work:** 1. **Efficient Code Development:** With AI-assisted code generation, researchers and developers in the HPC realm can quickly write optimized code for simulations, data processing, or modeling tasks, reducing the time to deployment. 2. **Debugging Assistance:** Handling complex datasets and simulations often lead to intricate bugs. Cursor's capability to automatically investigate errors and determine root causes can save crucial time in the HPC workflow. 3. **Tailored Code Suggestions:** Cursor's AI provides context-specific code suggestions by understanding the entire codebase. For HPC applications where performance is paramount, this means receiving recommendations that align with optimization goals. 4. **Improved Code Quality:** With AI-driven bug scanning and linter checks, Cursor ensures that HPC codes are not only fast but also robust and free of common errors. 5. **Easy Integration:** Being a fork of VSCode, Cursor allows seamless migration, ensuring that developers working in HPC can swiftly integrate their existing VSCode setups and extensions. In essence, for HPC tasks that demand speed, precision, and robustness, Cursor acts as an invaluable co-pilot, guiding developers towards efficient and optimized coding solutions. It is free if you provide your own OPEN AI API KEY.
- Bash shell tutorial0Training materials for using the bash (and zsh) shell.
- Wiki for Onboarding onto the C3DDB Cluster at MGHPCC0This is a resource for researchers and students looking to on-board onto the c3ddb cluster at MGHPCC. In the code section, there are example job submission scripts for the different queues on c3ddb.
- Numba: Compiler for Python0Numba is a Python compiler designed for accelerating numerical and array operations, enabling users to enhance their application's performance by writing high-performance functions in Python itself. It utilizes LLVM to transform pure Python code into optimized machine code, achieving speeds comparable to languages like C, C++, and Fortran. Noteworthy features include dynamic code generation during import or runtime, support for both CPU and GPU hardware, and seamless integration with the Python scientific software ecosystem, particularly Numpy.
- AWS Tutorial For Beginners0An AWS Tutorial for Beginners is a course that teaches the basics of Amazon Web Services (AWS), a cloud computing platform that offers a wide range of services, including compute, storage, networking, databases, analytics, machine learning, and artificial intelligence.
- RMACC Website0Rocky Mountain Advanced Computing Consortium Website
- ACCESS KB Guide - Expanse0Expanse at SDSC is a cluster designed by Dell and SDSC delivering 5.16 peak petaflops, and offers Composable Systems and Cloud Bursting. This documentation describes how to use the Expanse cluster with some specific information for people with ACCESS accounts.
- GDAL Multi-threading0Multi-threading guidance when using GDAL.
- Official Documentation for PyTorch and NumPy0The official documentation for PyTorch, a machine learning tensor-based framework, and NumPy, which allows for support for ndarrays which is useful to make tensors when implementing NNs. Both libraries can be installed with pip.
- C Programming0"These notes are part of the UW Experimental College course on Introductory C Programming. They are based on notes prepared (beginning in Spring, 1995) to supplement the book The C Programming Language, by Brian Kernighan and Dennis Ritchie, or K&R as the book and its authors are affectionately known. (The second edition was published in 1988 by Prentice-Hall, ISBN 0-13-110362-8.) These notes are now (as of Winter, 1995-6) intended to be stand-alone, although the sections are still cross-referenced to those of K&R, for the reader who wants to pursue a more in-depth exposition." C is a low-level programming language that provides a deep understanding of how a computer's memory and hardware work. This knowledge can be valuable when optimizing apps for performance or when dealing with resource-constrained environments.C is often used as the foundation for creating cross-platform libraries and frameworks. Learning C can allow you to develop libraries that can be used across different platforms, including iOS, Android, and desktop environments.
- Using Dask on HPC Systems0A tutorial on the effective use of Dask on HPC resources. The four-hour tutorial will be split into two sections, with early topics focused on novice Dask users and later topics focused on intermediate usage on HPC and associated best practices. The knowledge areas covered include (but are not limited to): Beginner section High-level collections including dask.array and dask.dataframe Distributed Dask clusters using HPC job schedulers Earth Science data analysis using Dask with Xarray Using the Dask dashboard to understand your computation Intermediate section Optimizing the number of workers and memory allocation Choosing appropriate chunk shapes and sizes for Dask collections Querying resource usage and debugging errors
- ACCESS Getting Started Quick-Guide0A step-by-step guide to getting your first allocation for Access computing and storage resources.
- Neural Networks in Julia0Making a neural network has never been easier! The following link directs users to the Flux.jl package, the easiest way of programming a neural network using the Julia programming language. Julia is the fastest growing software language for AI/ML and this package provides a faster alternative to Python's TensorFlow and PyTorch with a 100% Julia native programming and GPU support.
- Language models and using HPC resources0Documentation and research based on the latest NLP text generation detection methods for 2023.
- Network Science Textbook0
"Network Science" by Albert-László Barabási is a textbook that introduces the interdisciplinary field of network science. This field explores the connections and relationships between different entities, which can be anything from people in a social network to computers on the internet.
Description of the Textbook
The book is designed for a broad audience, including students and professionals in physics, computer science, engineering, economics, and social sciences. It covers a wide range of topics, from the "six degrees of separation" concept to the spread of viruses like Ebola. The textbook is structured to be accessible to both undergraduate and graduate students, with more complex mathematical details separated into "Advanced Topics" sections. It also offers extensive online resources, including films and software for network analysis.
The core idea of the book is that networks are everywhere, and understanding their structure and dynamics can provide valuable insights into a variety of complex systems. It uses real-world examples to illustrate key concepts and emphasizes the analysis of real network data.
Role in AI and Machine Learning
Network science plays a significant role in AI and machine learning by providing a framework for analyzing and understanding complex, interconnected data. Here's how it helps:
- Data Representation: Many real-world datasets can be represented as networks, such as social networks, transaction networks, and biological networks. Network science provides the tools to model and analyze this data, which can then be used to train machine learning models.
- Feature Engineering: Network properties, such as a node's centrality or the structure of its local neighborhood, can be used as features in machine learning models. This can help improve the performance of tasks like fraud detection, recommendation systems, and churn prediction.
- Graph Neural Networks (GNNs): GNNs are a class of deep learning models that are specifically designed to work with graph-structured data. They are heavily influenced by concepts from network science, such as message passing and neighborhood aggregation. GNNs have achieved state-of-the-art results on a variety of tasks, including node classification, link prediction, and graph classification.
- Understanding Complex Systems: Network science can be used to understand the behavior of complex systems, such as the spread of information or disease. This understanding can then be used to build more accurate AI and machine learning models.
Who Can Benefit and How?
A wide range of people can benefit from reading "Network Science," including:
- Data Scientists and Machine Learning Engineers: This book provides a strong foundation in network science, which is becoming increasingly important for working with graph-structured data. It can help them develop new features, build more accurate models, and gain a deeper understanding of their data.
- Computer Scientists and Software Engineers: The book can help them design more robust and efficient networked systems, such as communication networks and distributed systems.
- Social Scientists and Economists: The book can help them understand the structure and dynamics of social and economic networks, which can be used to study a variety of phenomena, such as the spread of fads and the stability of financial markets.
- Biologists and Medical Researchers: The book can help them understand the structure and function of biological networks, such as gene regulatory networks and protein-protein interaction networks. This can lead to new insights into diseases and the development of new drugs.
In short, anyone who is interested in understanding the interconnectedness of the world around them can benefit from reading "Network Science." It provides a powerful set of tools and concepts that can be applied to a wide variety of problems.
- Raftlib: Open Source library for concurrent data processing pipelines0Raftlib is an open-source C++ Library that provides a framework for implementing parallel and concurrent data processing pipelines. It is designed to simplify the development of high-performance data processing applications by abstracting away the complexities of parallelism, concurrency, and data flow management. It enables stream/data-flow parallel computation by linking parallel compute kernels together using simple right shift operators, similar to C++ streams for string manipulation. RaftLib eliminates the need for explicit usage of traditional threading libraries such as pthreads, std::thread, or OpenMP, which can lead to non-deterministic behavior when misused.
- fast.ai0Fastai offers many tools to people working with machine learning and artifical intelligence including tutorials on PyTorch in addition to their own library built on PyTorch, news articles, and other resources to dive into this realm.
- Master’s in Cybersecurity Degree Essentials0Offers comprehensive information on various master's degree options in cybersecurity, including program details, admission requirements, and career opportunities, helping students make informed decisions about pursuing an advanced degree in cybersecurity.