Course Description
My name is Peter Chen and I am the instructor for this course. I want to introduce you to the wonderful world of Unsupervised Machine Learning. Specifically, we will focus on Clustering algorithms and methods through practical examples and code. More importantly, it will get you up and running quickly with a clear conceptual understanding. The course has code & sample data for you to run and learn from. It also encourages you to explore your own datasets using Clustering algorithms.
Prerequisites:
Beginner knowledge of Python. It's used mostly for expository reasons. You do not need to be a Python expert. Basic math and comfortable with basic probability and statistics.
What am I going to get from this course?
* Understand the major types of clustering algorithms
* Know what, how, when to apply a k-means, GMM, and hierarchical clustering
* Understand the power of Gaussian Mixture Models(GMM) to go beyond simple clustering needs
* Determine the optimal number of clusters
* Gained an intuition behind the math of the underlying algorithms and be able to explain it
* Learn how to use Python scikit-learn library to build clustering machine learning models
* Apply Python code to their data sets to solve clustering various problems
* Evaluate the quality of clustering using Silhouette plots
* Learn about different industry applications of Clustering
Prerequisites and Target Audience
What will students need to know or do before starting this course?
Basic Python. Do not need to be an expert programmer. We use Python mainly for expository reasons. Basic probability math.
Who should take this course? Who should not?
Students who are interested in a practical introduction to clustering, a kind of unsupervised machine learning. Want an intuitive understanding of the theory behind clustering.
Students can use these methods and algorithms for hot applications such as marketing analytics, customer segmentation, anomaly detection, fraud detection, and other practical applications in their respective fields. Must like to play with data and code.
Curriculum
Module 1: Welcome & Introductions
06:04
Lecture 1
Welcome to the Course
03:20
Lecture 2
Course Overview and Introductions
02:44
Module 2: K-Means Clustering
18:54
Lecture 3
K-Means Clustering
03:08
Lecture 4
How does K-means do that?
07:51
Lecture 5
Similarity Measures
05:09
Lecture 6
Issues with K-Means
02:46
Module 3: Gaussian Mixture Models
17:42
Lecture 7
GMM Introductions
03:13
Lecture 8
GMM: Code Examples
06:29
Lecture 9
GMM as Density Estimators
02:41
Lecture 10
GMM: Optimal Number of Components
03:14
Lecture 11
GMM - Generate New Data
02:05
Module 4: Hierarchical Clustering
18:21
Lecture 12
Introductions to Hierarchical Clustering
03:50
Lecture 13
Linkage Methods
03:17
Lecture 14
Hierarchical Clustering Walk-Through
07:09
Lecture 15
Divisive Algorithm
01:43
Lecture 16
Hierarchical Clustering - Code Examples
02:22
Module 5: Methods for Selecting Number of Clusters
03:46
Lecture 17
Methods for Selecting Number of Clusters
03:46
Module 6: Evaluating the Quality of the Clustering
01:19
Lecture 18
Evaluating the Quality of Clustering
01:19
Module 7: Industry Applications
03:45
Lecture 19
Industry Applications
03:45
Module 8: Mini-Project: Pulling It All Together
Module 9: Mini-Project Solution Preview
01:42
Lecture 21
Solution Preview
01:42