Workshop (6 hours): Clustering (Hdbscan, LCA, Hopach), dimension reduction (UMAP, GLRM), and anomaly detection (isolation forests).
Unsupervised machine learning is a class of algorithms that identifies patterns in unlabeled data, i.e. without considering an outcome or target. This workshop will describe and demonstrate powerful unsupervised learning algorithms used for clustering (hdbscan, latent class analysis, hopach), dimensionality reduction (umap, generalized low-rank models), and anomaly detection (isolation forests). Participants will learn how to structure unsupervised learning analyses and will gain familiarity with example code that can be adapted to their own projects.
Author: Chris Kennedy
This is an intermediate machine learning workshop. Participants should have significant prior experience with R and RStudio, including manipulation of data frames, installation of packages, and plotting.
Prerequisite workshops
Recommended workshops
Participants should have access to a computer with the following software:
To prepare for the workshop, please download the materials and work through the package installation in 0-install.Rmd
. Please report any errors to the GitHub issue queue.
There is also an RStudio Cloud workspace that can be used.
Please create a GitHub issue to report any errors or give feedback on this workshop.
Books