Skip to main content

Breadcrumb

  1. ACCESS Home
  2. Support
  3. Knowledge Base
  4. Knowledge Base Resources

Knowledge Base Resources

These resources are contributed by researchers, facilitators, engineers, and HPC admins. Please upvote resources you find useful!
Add a Resource

Filters

Topics

  • Show all (154)
  • (-) data-analysis (39)
  • machine-learning (18)
  • big-data (15)
  • ai (14)
  • deep-learning (8)
  • visualization (8)
  • r (7)
  • documentation (6)
  • plotting (5)
  • image-processing (4)
  • training (4)
  • workforce-development (4)
  • cuda (3)
  • neural-networks (3)
  • git (2)
  • matlab (2)
  • monte-carlo (2)
  • parallelization (2)
  • slurm (2)
  • vectorization (2)
  • access-account (1)
  • aces (1)
  • allocation-value (1)
  • anaconda (1)

Topics

  • Show all (154)
  • (-) data-analysis (39)
  • machine-learning (18)
  • big-data (15)
  • ai (14)
  • deep-learning (8)
  • visualization (8)
  • r (7)
  • documentation (6)
  • plotting (5)
  • image-processing (4)
  • training (4)
  • workforce-development (4)
  • cuda (3)
  • neural-networks (3)
  • git (2)
  • matlab (2)
  • monte-carlo (2)
  • parallelization (2)
  • slurm (2)
  • vectorization (2)
  • access-account (1)
  • aces (1)
  • allocation-value (1)
  • anaconda (1)

If you'd like to use more filters, please login to view them all.

PyTorch for Deep Learning and Natural Language Processing
1
  • Introduction to PyTorch for Deep Learning
PyTorch is a Python library that supports accelerated GPU processing for Machine Learning and Deep Learning. In this tutorial, I will teach the basics of PyTorch from scratch. I will then explore how to use it for some ML projects such as Neural Networks, Multi-layer perceptrons (MLPs), Sentiment analysis with RNN, and Image Classification with CNN.
aibig-datadata-analysisdeep-learningmachine-learningneural-networks
1 Like

Login to like
Type
documentation
Level
Beginner
Cornell Virtual Workshop
1
  • Roadmaps in Cornell Virtual Workshop
  • Search for topics
Cornell Virtual Workshop is a comprehensive training resource for high performance computing topics. The Cornell University Center for Advanced Computing (CAC) is a leader in the development and deployment of Web-based training programs. Our Cornell Virtual Workshop learning platform is designed to enhance the computational science skills of researchers, accelerate the adoption of new and emerging technologies, and broaden the participation of underrepresented groups in science and engineering. Over 350,000 unique visitors have accessed Cornell Virtual Workshop training on programming languages, parallel computing, code improvement, and data analysis. The platform supports learning communities around the world, with code examples from national systems such as Frontera, Stampede2, and Jetstream2.
jetstreammatlabcloud-computingdata-analysisperformance-tuningparallelizationfile-transferglobusslurmtrainingcudamatlabpythonrmpi
1 Like

Login to like
Type
learning
Level
Beginner, Intermediate, Advanced
Useful R Packages for Data Science and Statistics
1
  • https://www.udacity.com/blog/2021/01/best-r-packages-for-data-science.html
This Udacity article listed the most frequently used R packages for data science and statistics. For each package, the article provided the link to its official documentation. It will be a great start point if you want to start your data science journey in R.
plottingvisualizationdata-analysismachine-learningdata-sciencer
1 Like

Login to like
Type
documentation
Level
Beginner, Intermediate, Advanced
Introduction to Python for Digital Humanities and Computational Research
1
  • Introduction to Python book
This documentation contains introductory material on Python Programming for Digital Humanities and Computational Research. This can be a go-to material for a beginner trying to learn Python programming and for anyone wanting a Python refresher.
aibig-datadata-analysisdeep-learningdata-sciencepython
1 Like

Login to like
Type
documentation
Level
Beginner
AI Institutes Cyberinfrastructure Documents: SAIL Meeting
0
  • Recording of CI ACCESS Talk
  • Flyer about ACCESS
  • Slides from CI ACCESS Talk
Materials from the SAIL meeting (https://aiinstitutes.org/2023/06/21/sail-2023-summit-for-ai-leadership/). A space where AI researchers can learn about using ACCESS resources for AI applications and research.
ACCESS-accountaidata-analysismachine-learning
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate, Advanced
FreeSurfer Tutorials
0
  • FreeSurfer Tutorials
The official MGH / Harvard tutorial page for FreeSurfer. The FreeSurfer group has provided and designed a series of tutorials for using FreeSurfer and for getting acquainted with the concepts needed to perform its various modes of analysis and processing of MRI data. The tutorials are designed to be followed along in a terminal window where commands can be copy/pasted instead of typed.
data-analysisimage-processingpsychology
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
Optimizing Research Workflows - A Documentation of Snakemake
0
  • https://snakemake.readthedocs.io/en/stable/
Snakemake is a powerful and versatile workflow management system that simplifies the creation, execution, and management of data analysis pipelines. It uses a user-friendly, Python-based language to define workflows, making it particularly valuable for automating and reproducibly managing complex computational tasks in research and data analysis.
documentationdata-analysisdata-reproducibilityworkflowbioinformaticsdata-sciencepython
0 Likes

Login to like
Type
documentation
Level
Intermediate, Advanced
Scikit-Learn: Easy Machine Learning and Modeling
0
  • Scikit-learn
Scikit-learn is free software machine learning library for Python. It has a variety of features you can use on data, from linear regression classifiers to xg-boost and random forests. It is very useful when you want to analyze small parts of data quickly.
documentationaiplottingvisualizationbig-datadata-analysisdeep-learningimage-processingmachine-learningmonte-carloneural-networksvectorization
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate
Python Tools for Data Science
0
  • Python Tools for Data Science
Python has become a very popular programming language and software ecosystem for work in Data Science, integrating support for data access, data processing, modeling, machine learning, and visualization. In this webinar, we will describe some of the key Python packages that have been developed to support that work, and highlight some of their capabilities. This webinar will also serve as an introduction and overview of topics addressed in two Cornell Virtual Workshop tutorials, available at https://cvw.cac.cornell.edu/pydatasci1 and https://cvw.cac.cornell.edu/pydatasci2
aimachine-learningbig-datadata-analysisdata-wranglingdata-sciencetrainingworkforce-developmentpythonscikit-learnsql
0 Likes

Login to like
Type
video_link
Level
Intermediate
Online Bachelor's in Data Science Program Guide - TechGuide
0
  • Online Data Science Degree Programs
The realm of data science is one that onlookers regard with curiosity and respect. There are a lot of unknowns in this area of study that only recently became hugely relevant. It is important to get the facts on how expertise in data science is transforming the world. This article features what a bachelor’s degree means in today’s market and the future.
machine-learningbig-datadata-analysisdata-science
0 Likes

Login to like
Type
website
Level
Intermediate
What is fairness in ML?
0
  • Building ML models for everyone: understanding fairness in machine learning
This article discusses the importance of fairness in machine learning and provides insights into how Google approaches fairness in their ML models. The article covers several key topics: Introduction to fairness in ML: It provides an overview of why fairness is essential in machine learning systems, the potential biases that can arise, and the impact of biased models on different communities. Defining fairness: The article discusses various definitions of fairness, including individual fairness, group fairness, and disparate impact. It explains the challenges in achieving fairness due to trade-offs and the need for thoughtful considerations. Addressing bias in training data: It explores how biases can be present in training data and offers strategies to identify and mitigate these biases. Techniques like data preprocessing, data augmentation, and synthetic data generation are discussed. Fairness in ML algorithms: The article examines the potential biases that can arise from different machine learning algorithms, such as classification and recommendation systems. It highlights the importance of evaluating and monitoring models for fairness throughout their lifecycle. Fairness tools and resources: It showcases various tools and resources available to practitioners and developers to help measure, understand, and mitigate bias in machine learning models. Google's TensorFlow Extended (TFX) and What-If Tool are mentioned as examples. Google's approach to fairness: The article highlights Google's commitment to fairness and the steps they take to address fairness challenges in their ML models. It mentions the use of fairness indicators, ongoing research, and partnerships to advance fairness in AI. Overall, the article provides a comprehensive overview of fairness in machine learning and offers insights into Google's approach to building fair ML models.
aivisualizationdata-analysisdeep-learningmachine-learning
0 Likes

Login to like
Type
documentation
Level
Intermediate
Intro to Statistical Computing with Stan
0
  • https://mc-stan.org/users/documentation/
  • https://vasishth.github.io/bayescogsci/book/ch-introstan.html
  • https://pystan.readthedocs.io/en/latest/
The Stan language is used to specify a (Bayesian) statistical model with an imperative program calculating the log probability density function. Here are some useful links to start your exploration of this statistical programming language, and a Python interface to Stan.
data-analysismachine-learningmonte-carlopython
0 Likes

Login to like
Type
documentation
Level
Beginner, Intermediate
An Introduction to the Julia Programming Language
0
  • An Introduction to Julia
  • The Julia Computing Language
The Julia Programming Language is one of the fastest growing software languages for AI/ML development. It writes in manner that's similar to Python while being nearly as fast as C++, while being open source, and reproducible across platforms and environments. The following link provide an introduction to using Julia including the basic syntax, data structures, key functions, and a few key packages.
aidata-analysismachine-learningjulia
0 Likes

Login to like
Type
learning
Level
Beginner
FSL Lectures
0
  • FSL Courses
This is the official University of Oxford FSL group lecture page. This includes information on upcoming and past courses (online and in-person), as well as lecture materials. Available lecture materials includes slides and recordings on using FSL, MR physics, and applications of imaging data.
data-analysisimage-processingpsychology
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate, Advanced
Numpy - a Python Library
0
  • NumPY Docs
Numpy is a python package that leverages types and compiled C code to make many math operations in Python efficient. It is especially useful for matrix manipulation and operations.
documentationbig-datadata-analysisdeep-learningopencvpytorchtensorflowdata-science
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate
Applications of Machine Learning in Engineering and Parameter Tuning Tutorial
0
  • Applications of ML in Engineering and Parameter Tuning Tutorial (RMACC 2019)
Slides for a tutorial on Machine Learning applications in Engineering and parameter tuning given at the RMACC conference 2019.
data-analysismachine-learningpython
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
Online Master's in Business Analytics Program Guide - TechGuide
0
  • Find Online Master's in Business Analytics
A degree in business analytics looks different in today’s world than it did a decade ago. In its most current application, business analytics uses modern data science and capabilities in machine learning (ML). The magic comes into play when these are leveraged for strategic planning.
machine-learningbig-datadata-analysisdata-science
0 Likes

Login to like
Type
website
Level
Advanced
Scipy Lecture Notes
0
  • https://lectures.scientific-python.org/
Comprehensive tutorials and lecture notes covering various aspects of scientific computing using Python and Scipy.
visualizationdata-analysismachine-learningpython
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
Data Imputation Methods for Climate Data and Mortality Data
0
  • Data Imputation Methods for Climate Data and Mortality Data - Slices
  • Github repository
  • Data Imputation Methods for Climate Data and Mortality Data - Full Tutorial
This slices and videos introduced how to use K-Nearest-Neighbors method to impute climate data and how to use Bayesian Spatio-Temporal models in R-INLA to impute mortality data. The demos will be added soon.
allocation-valuedocumentationaiplottingvisualizationdata-analysismachine-learning
0 Likes

Login to like
Type
video_link
Level
Intermediate, Advanced
NITRC
0
  • NITRC
The Neuroimaging Tools and Resources Collaboratory (NITRC) is a neuroimaging informatics knowledge environment for MR, PET/SPECT, CT, EEG/MEG, optical imaging, clinical neuroinformatics, imaging genomics, and computational neuroscience tools and resources.
data-analysisimage-processingdata-sharing
0 Likes

Login to like
Type
website
Level
Beginner, Intermediate, Advanced
Discover Data Science
0
  • Discover Data Science
Discover Data Science is all about making connections between prospective students and educational opportunities in an exciting new, hot, and growing field – data science.
data-analysisworkforce-development
0 Likes

Login to like
Type
website
Level
Beginner
Fundamentals of R Programming
0
  • Fundamentals of R Programming
  • Instructions for Launching learnR notebook on TAMU ACES cluster
This course is an introduction to the R programming language and covers the fundamental concepts needed to operate in the R environment. This course was taught for the ACCESS community on September 26, 2023, but the materials for the course are still available on the ACES cluster and can be completed independently. All materials are presented as learnR notebooks and cover several topics, including data types, variables, built-in functions, data structures, and plotting.
ACESTAMUplottingdata-analysisr
0 Likes

Login to like
Type
learning
Level
Beginner
Automated Machine Learning Book
0
  • Automated Machine Learning: Methods, Systems, Challenges
The authoritative book on automated machine learning, which allows practitioners without ML expertise to develop and deploy state-of-the-art machine learning approaches. Describes the background of techniques used in detail, along with tools that are available for free.
aidata-analysisdeep-learningmachine-learningneural-networkspythonr
0 Likes

Login to like
Type
learning
Level
Intermediate, Advanced
Master's in Data Science Program Guide - TechGuide
0
  • Masters in Data Science Program Guide
A master’s degree in data science helps prepare professionals to take the next career step. This article will focus primarily on data science, a graduate degree in this field, and a data scientist or data analyst career. With many employers preferring a master’s degree in data science for those seeking to fill roles as data scientists or analysts, we will discuss the data science master’s degree in detail.
big-datadata-analysisdata-science
0 Likes

Login to like
Type
website
Level
Advanced

Pagination

  • Current page 1
  • Page 2
  • Next page ››
  • Last page Last »