Introduction to Unsupervised Machine Learning: Clustering Techniques | by Sravanth | Medium

Member-only story
Introduction to Unsupervised Machine Learning: Clustering Techniques
Sravanth
·Follow
13 min read·
Sep 24, 2024
--
1. Introduction to Unsupervised Machine LearningDefinition:
Unsupervised learning deals with unlabeled data, finding hidden patterns or intrinsic structures without predefined outcomes.
Difference from Supervised Learning:
No target variable or ground truth labels.
Focuses on data exploration and pattern discovery.
General Applications:
Market segmentation, anomaly detection, recommendation systems, and more.
2. K-Means Clustering2.1. What is K-Means Clustering?Intuition:
Partitions data into K distinct, non-overlapping clusters based on feature similarity.
Algorithm Steps:
Initialization: Select K initial centroids randomly.
Assignment Step: Assign each data point to the nearest centroid.
Update Step: Recalculate centroids as the mean of assigned points.
Iteration: Repeat steps 2 and 3 until convergence.
2.2. Choosing K: The Elbow MethodConcept:
Plot the explained variance (or within-cluster sum of squares) against the number of clusters.
--
--
Written by Sravanth73 Followers
·16 Following
Data Scientist
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams