Skip to main content

Breadcrumb

  1. ACCESS Home
  2. Support
  3. Knowledge Base
  4. Knowledge Base Resources

Knowledge Base Resources

These resources are contributed by researchers, facilitators, engineers, and HPC admins. Please upvote resources you find useful!
Add a Resource

Filters

Topics

  • Show all (73)
  • (-) visualization (18)
  • data-analysis (8)
  • plotting (7)
  • ai (6)
  • machine-learning (6)
  • image-processing (4)
  • deep-learning (3)
  • documentation (3)
  • big-data (2)
  • neural-networks (2)
  • r (2)
  • allocation-value (1)
  • computer-graphics (1)
  • computer-vision (1)
  • conda (1)
  • git (1)
  • julia (1)
  • jupyterhub (1)
  • matlab (1)
  • monte-carlo (1)
  • pytorch (1)
  • training (1)
  • vectorization (1)

Topics

  • Show all (73)
  • (-) visualization (18)
  • data-analysis (8)
  • plotting (7)
  • ai (6)
  • machine-learning (6)
  • image-processing (4)
  • deep-learning (3)
  • documentation (3)
  • big-data (2)
  • neural-networks (2)
  • r (2)
  • allocation-value (1)
  • computer-graphics (1)
  • computer-vision (1)
  • conda (1)
  • git (1)
  • julia (1)
  • jupyterhub (1)
  • matlab (1)
  • monte-carlo (1)
  • pytorch (1)
  • training (1)
  • vectorization (1)

If you'd like to use more filters, please login to view them all.

Useful R Packages for Data Science and Statistics
1
  • https://www.udacity.com/blog/2021/01/best-r-packages-for-data-science.html
This Udacity article listed the most frequently used R packages for data science and statistics. For each package, the article provided the link to its official documentation. It will be a great start point if you want to start your data science journey in R.
plottingvisualizationdata-analysismachine-learningdata-sciencer
1 Like

Login to like
Type
documentation
Level
Beginner, Intermediate, Advanced
Data Imputation Methods for Climate Data and Mortality Data
0
  • Data Imputation Methods for Climate Data and Mortality Data - Slices
  • Github repository
  • Data Imputation Methods for Climate Data and Mortality Data - Full Tutorial
This slices and videos introduced how to use K-Nearest-Neighbors method to impute climate data and how to use Bayesian Spatio-Temporal models in R-INLA to impute mortality data. The demos will be added soon.
allocation-valuedocumentationaiplottingvisualizationdata-analysismachine-learning
0 Likes

Login to like
Type
video_link
Level
Intermediate, Advanced
Handwritten Digits Tutorial in PyTorch
0
  • Handwritten Digits PyTorch
This tutorial is essentially the "hello world" of image recognition and feed-forward neural network (using PyTorch). Using the MNIST database (filled within images of handwritten digits), the tutorial will instruct how to build a feed-forward neural network that can recognize handwritten digits. A solid understanding of feed-forward and back-propagation is recommended.
aivisualizationdeep-learningimage-processingmachine-learningneural-networkspytorchpython
0 Likes

Login to like
Type
website
Level
Intermediate
marimo | a next generation python notebook
0
  • marimo | a next generation python notebook
Introduction seminar for new reactive python notebook from marimo ambassador.
aicomputer-graphicsplottingvisualizationbig-datadata-analysisgitpython
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate, Advanced
Scikit-Learn: Easy Machine Learning and Modeling
0
  • Scikit-learn
Scikit-learn is free software machine learning library for Python. It has a variety of features you can use on data, from linear regression classifiers to xg-boost and random forests. It is very useful when you want to analyze small parts of data quickly.
documentationaiplottingvisualizationbig-datadata-analysisdeep-learningimage-processingmachine-learningmonte-carloneural-networksvectorization
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate
Gesture Classifier Model using MediaPipe
0
  • Docs
MediaPipe is Google's open-source framework for building multimodal (e.g., video, audio, etc.) machine learning pipelines. It is highly efficient and versatile, making it perfect for tasks like gesture recognition. This is a tutorial on how to make a custom model for gesture recognition tasks based on the Google MediaPipe API. This tutorial is specifically for video-playback, though could be generalized to image and live-video feed recognition.
aicomputer-visionvisualizationimage-processing
0 Likes

Login to like
Type
documentation
Level
Intermediate
What is fairness in ML?
0
  • Building ML models for everyone: understanding fairness in machine learning
This article discusses the importance of fairness in machine learning and provides insights into how Google approaches fairness in their ML models. The article covers several key topics: Introduction to fairness in ML: It provides an overview of why fairness is essential in machine learning systems, the potential biases that can arise, and the impact of biased models on different communities. Defining fairness: The article discusses various definitions of fairness, including individual fairness, group fairness, and disparate impact. It explains the challenges in achieving fairness due to trade-offs and the need for thoughtful considerations. Addressing bias in training data: It explores how biases can be present in training data and offers strategies to identify and mitigate these biases. Techniques like data preprocessing, data augmentation, and synthetic data generation are discussed. Fairness in ML algorithms: The article examines the potential biases that can arise from different machine learning algorithms, such as classification and recommendation systems. It highlights the importance of evaluating and monitoring models for fairness throughout their lifecycle. Fairness tools and resources: It showcases various tools and resources available to practitioners and developers to help measure, understand, and mitigate bias in machine learning models. Google's TensorFlow Extended (TFX) and What-If Tool are mentioned as examples. Google's approach to fairness: The article highlights Google's commitment to fairness and the steps they take to address fairness challenges in their ML models. It mentions the use of fairness indicators, ongoing research, and partnerships to advance fairness in AI. Overall, the article provides a comprehensive overview of fairness in machine learning and offers insights into Google's approach to building fair ML models.
aivisualizationdata-analysisdeep-learningmachine-learning
0 Likes

Login to like
Type
documentation
Level
Intermediate
Astronomy data analysis with astropy
0
  • astropy
Astropy is a community-driven package that offers core functionalities needed for astrophysical computations and data analysis. From coordinate transformations to time and date handling, unit conversions, and cosmological calculations, Astropy ensures that astronomers can focus on their research without getting bogged down by the intricacies of programming. This guide walks you through practical usage of astropy from CCD data reduction to computing galactic orbits of stars.
visualizationimage-processingastrophysics
0 Likes

Login to like
Type
learning
Level
Intermediate, Advanced
R for Data Science
0
  • https://r4ds.had.co.nz/index.html
R for Data Science is a comprehensive resource for individuals looking to harness the power of the R programming language for data analysis, visualization, and statistical modeling. Whether you're a beginner or an experienced data scientist, this guide will help you unlock the full potential of R in the realm of data science.
visualizationdata-analysisdata-sciencer
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate, Advanced
Introduction to Vizualization on HPC Using Python
0
  • University of Arizona Workshop Series: Introduction to HPC, Visualization
This workshop has an introduction to the concepts of visualization followed by hands on exercises. The concepts section has Speaker Notes, and the hands on section has an accompanying Jupyter notebook. The workshop is one in a series of Introduction to HPC
visualizationdocumentationtrainingjupyterhub
0 Likes

Login to like
Type
learning
Level
Beginner
Paraview UArizona HPC links (advanced)
0
  • Getting started with the paraview terminal
  • Batch headless rendering with Paraview
These links take you to visualization resources supported by the University of Arizona's HPC visualization consultant ([rtdatavis.github.io](http://rtdatavis.github.io/)). The following links are specific to the Paraview program and the workflows that have been used my researchers at the U of Arizona. These links are distinct from the others posted in the beginner paraview access ci links from the University of Arizona in that they are for more complex workflows. The links included explain how to use the terminal with paraview (pvpython), and the steps to leverage HPC resources for headless batch rendering. The batch rendering tutorial is significantly more complex than the others so if you find yourself stuck please post on the https://ask.cyberinfrastructure.org/ and I will try to troubleshoot with you.
visualization
0 Likes

Login to like
Type
documentation
Level
Intermediate, Advanced
The Official Documentation of Pandas
0
  • pandas documentation
Pandas is one of the most essential Python libraries for data analysis and manipulation. It provides high-performance, easy-to-use data structures, and data analysis tools for the Python programming language. The official documentation serves as an in-depth guide to using this powerful tool including explanations and examples.
plottingvisualization
0 Likes

Login to like
Type
documentation
Level
Beginner, Intermediate
Scipy Lecture Notes
0
  • https://lectures.scientific-python.org/
Comprehensive tutorials and lecture notes covering various aspects of scientific computing using Python and Scipy.
visualizationdata-analysismachine-learningpython
0 Likes

Login to like
Type
learning
Level
Beginner, Intermediate
Data Visualization Tools for Julia
0
  • Visualizations in Julia Using Plots.jl
  • Plotting Options Using Julia
Plots.jl is the most widely used plotting library for the Julia programming language. It's known for being especially powerful in its versatility and intuitiveness. It's limited set of dependencies and wide applicability across different graphics packages make it especially helpful in visualizing the results of your latest Julia implementation. However, there are still multiple options available for Julia programmers to visualize their datasets. The second link details a comparison against a variety of Julia packages.
plottingvisualizationjulia
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate
Paraview UArizona HPC links (beginner)
0
  • University of Arizona Visualization homepage
  • Getting Started with Paraview
  • Paraview Cameras and Keyframes
  • Graphs and Data Exporting
  • Visualizing netcdf files
These links take you to visualization resources supported by the University of Arizona's HPC visualization consultant (rtdatavis.github.io). The following links are specific to the Paraview program and the workflows that have been used my researchers at the U of Arizona. Some of the pages linked are very beginner friendly: getting started, working with cameras and keyframes for rendering, visualizing external files (netcdf climate data), graphs and data exporting. Many of the workflows involve using remote desktops via the Open On Demand interface, but if this isn't set up at your university you can use paraview locally on a desktop. Feel free to post on access ci https://ask.cyberinfrastructure.org/ if you need assistance getting a paraview gui open for your work on HPC.
visualization
0 Likes

Login to like
Type
documentation
Level
Beginner
Data visualization with Matplotlib
0
  • Guide to data visualization with matplotlib
Data visualization is a critical aspect of data analysis. It allows for a clear and concise representation of data, making it easier for users to understand and interpret complex datasets. One of the most popular libraries for data visualization in Python is Matplotlib. The included website aims to provide a brief overview of Matplotlib, its features, and examples/exercises to dive deeper into its functionalities.
plottingvisualization
0 Likes

Login to like
Type
website
Level
Beginner
MATLAB bioinformatics toolbox
0
  • https://www.mathworks.com/products/bioinfo.html
Bioinformatics Toolbox provides algorithms and apps for Next Generation Sequencing (NGS), microarray analysis, mass spectrometry, and gene ontology. Using toolbox functions, you can read genomic and proteomic data from standard file formats such as SAM, FASTA, CEL, and CDF, as well as from online databases such as the NCBI Gene Expression Omnibus and GenBank.
visualizationdata-analysisbioinformaticsgenomicsmatlab
0 Likes

Login to like
Type
tool
Level
Beginner, Intermediate, Advanced