Industry recognized certification enables you to add this credential to your resume upon completion of all courses

Need Custom Training for Your Team?
Get Quote
Call Us

Toll Free (844) 397-3739

Inquire About This Course
Anirban  Ghosh, Instructor - Clustering  and Association Rule Mining

Anirban Ghosh

Anirban Ghosh is a Machine Learning Scientist. He has been part of leading consulting, captive and R&D organizations, as well as startups. He's also active in teaching/mentoring in the Data Science realm. His experience spans over various sectors e.g. retail, telecom, travel/hospitality etc. Anirban has 10+ years of experience in Analytics and Data Science & holds a Bachelor's Degree in Statistics from St. Xavier's College, Calcutta and Master's in Applied Statistics & Computing from IIT, Bombay.

Instructor: Anirban Ghosh

Learn Clustering methods and Association Rule Mining Techniques

  • Learn concepts of Cluster Analysis and study most popular set of Clustering algorithms with end-to-end examples in R
  • Supported by office hours and hands-on practice exercises to be submitted at the end of the course 
  • Instructor is a Machine Learning Scientist with 10+ years of hands-on experience in predictive analytics and data science research at leading consulting, captive and R&D organizations.

Duration: 4h 44m

Course Description

Clustering and Association Rule Mining are two of the most frequently used Data Mining technique for various functional needs, especially in Marketing, Merchandising, and Campaign efforts. Clustering helps find natural and inherent structures amongst the objects, where as Association Rule is a very powerful way to identify interesting relations between objects in large commercial databases. The main motivation for the course is: i) This course specifically touches upon the scenarios where Clustering is necessary, and which Clustering technique is appropriate for which scenario. ii) This course also stresses on advantages as well as practical issues with different Clustering techniques

What am I going to get from this course?

  • Learn clustering through examples in R – that you immediately apply in your day-to-day work
  • Over 20 lectures and 5-6 hours of content, plus 2 practice exercises on Clustering and Market Basket Analysis
  • Learn practical Hierarchical, Non-Hierarchical, Density based clustering techniques. Also Association rules and Market Basket Analysis

Prerequisites and Target Audience

What will students need to know or do before starting this course?

  • Fundamental understanding of Statistics/Mathematics, specially Probability, Set Theory, Distance Measure, Matrix Algebra etc.
  • All examples in this course will be run in R. So, prior understanding of R is desirable.

Who should take this course? Who should not?

  • Beginners who want to change careers to Data Science and wish to enrich their horizon of knowledge with both theory and examples can also take this course
  • Experienced Domain Experts who intent to gain solid practical understanding on Clustering and use that to derive business insights in their current role
  • Analytics and Data Science professionals who want to refresh their skills in Clustering


Module 1: Overview of Unsupervised Learning

Lecture 1 Introduction to Unsupervised Learning

Review the basics of Unsupervised Learning

Lecture 2 Exploratory Data Analysis

Understand Exploratory Data Analysis (EDA) with R sessions

Lecture 3 R Session on Exploratory Data Analysis (Part 1)

Practical R-session on Correlation

Lecture 4 R Session on Exploratory Data Analysis (Part 2)

Practical R-session on Principal Components

Lecture 5 Introduction to Clustering

Get started to understand the Clustering basics

Module 2: K-Means Clustering

Lecture 6 Introductory Basics

Understand different mathematical basics and terminologies needed for K-Means

Lecture 7 K-Means Algorithm
Lecture 8 How to decide K?
Lecture 9 Example through R
Lecture 10 Further Discussions

Module 3: Hierarchical Clustering

Lecture 11 Introduction

Understand extended distance measures, dendrogram etc.

Lecture 12 Quick Algorithm
Lecture 13 Cluster number determination

Lecture 14 Example through R
Lecture 15 Further Discussions

Module 4: Density based Clustering

Lecture 16 Introduction & Terminologies
Lecture 17 DBSCAN Algorithm
Lecture 18 Choice of parameters

How do we empirically choose optimal parameters?

Lecture 19 Example through R
Lecture 20 Further Discussions

Module 5: Association Rules (AR)

Lecture 21 Introduction
Lecture 22 The Apriori Nature
Lecture 23 Market Basket Analysis
Lecture 24 Example through R
Lecture 25 Further Discussions
Lecture 26 Practice Exercises on Clustering & Association Rule

2 practice exercises, each on Clustering and Association Rule Mining


7 Reviews

David K

December, 2016

William C

May, 2017

This course is an all-encompassing and enthusiastic learning experience of most popular set of Cluster algorithms and analysis. It was educative and collaborative with end-to-end examples and hands-on practice exercises. It helped me learn quickly the data mining techniques in my functional needs in marketing and campaign efforts. This course specifically taught the various scenarios where clustering is necessary and showed very well clustering techniques appropriate for a scenario. The examples provided in the course were excellent and very useful. I find the lecturer friendly and professional. It was a nice participation experience for me.

Colin S

May, 2017

It is a fascinating course. Though I was an experience domain expert, I still took this course to gain practical understanding on clustering to derive business insights in my present role. Whether you are a beginner of experienced it is a great course to enrich your knowledge in analytics and data science.

Jeff J

July, 2017

A remarkable class that provides you a perfect introduction to clustering and gets you through to fitting an intermediary standard user. I very much recommend it if this is the area you're looking to get into or the experience you wish to include on your resume.

Sheetal C

July, 2017

Very good analysis. The lectures are straightforward and easy to understand. The subject matter is very suitable to practical application use.

Robert T

July, 2017

A fine preparatory study of clustering. Very resourceful. I just figure that there could be better help with reference to the practice tests. Otherwise the lectures by the instructor were really amazing

Jerry C

July, 2017

The instructor has masterfully communicated the elements and precisely crafted the curriculum components, therefore the course is both simple to keep up and you gain a tremendous amout of knowledge.