K-means algorithm falls under the clustering category.
Clustering of data is a method by which large sets of data is grouped into clusters of smaller sets of similar data.
After clustering the above balls will be arranged as follows.
K-means clustering algorithm -
In this algorithm, we decide how many clusters(K) we want.
We need to select K cluster instances which in such a way that they are "Farthest apart" from each other.
Our aim is to partition the points into k groups such that the sum of squares from points or distance to the assigned cluster centres is minimized.
This can be illustrated using Voronoi cells. From wikipedia,
Suppose we want to estimate the number of customers of a given shop. With all else being equal (price, products, quality of service, etc.), it is reasonable to assume that customers choose their preferred shop simply by distance considerations: they will go to the shop located nearest to them.
In order to achieve K-mean clustering by programmatic means, we use Lloyd's Algorithm in PAL which uses an iterative refinement technique.
Organizations can thus divide their customers into segments based on variants such as demography, customer behavior, customer profitability, measure of risk, etc.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
User | Count |
---|---|
7 | |
5 | |
5 | |
5 | |
4 | |
4 | |
4 | |
4 | |
3 | |
3 |