The website is in Maintenance mode. We are in the process of adding more features.
Any new bookmarks, comments, or user profiles made during this time will not be saved.

Machine Learning Resources

How does the initial choice of centroids affect the K-Means algorithm?

Bookmark this question

The final cluster assignments of the K-Means algorithm can be sensitive to the location of the initial centroids. For example, it is possible that one observation could be far removed from any other points in its region, and in an extreme case, a cluster could end up having only one data point. On the flip side, if initial centroids are chosen in close proximity to one another, it might lead to clusters that have a lot of overlap and fail to separate points into distinguishable regions within the data. K-Means usually is repeated multiple times with different initializations, and the iteration that results in the most pure clusters is chosen. Further, more specific initialization strategies exist to improve the quality of clustering.

Leave your Comments and Suggestions below:

Please Login or Sign Up to leave a comment

Partner Ad  

Find out all the ways
that you can

Explore Questions by Topics

Partner Ad

Learn Data Science with Travis - your AI-powered tutor |