I am following along with the new SAP Press book Predictive Analysis which helps give some background.  I also read the wikipedia link on K-Means clustering.

I have a sample CSV file of rentals, go to the Predict tab, and select the R-K Means algorithm under Clustering.

/wp-content/uploads/2014/01/1figure_361354.png

Above shows I am selecting 5 clusters, and all features in column selection.

/wp-content/uploads/2014/01/2run_361412.png

Then I select “Run” to run the clustering analysis.

3distribution of clusters.png

In the results screen I can see various cluster representations.  Above shows a bar chart of clusters.

/wp-content/uploads/2014/01/4cluster_361414.png

In reading the book, the “thicker the density” the closer the clusters in the above chart.

/wp-content/uploads/2014/01/6cluster3_361416.png

Above is the parallel coordinates chart.

Are cluster 3 customers the most valuable ones, according to this chart?  To be continued…

To report this post you need to login first.

Be the first to leave a comment

You must be Logged on to comment or reply to a post.

Leave a Reply