Skip to Content
Author's profile photo Tammy Powlas

Learning about Predictive Analysis – Segmentation Example

Given the official announcement that BusinessObjects Predictive Analysis is now generally available, I am reviewing how I am learning about the tool following the examples from the official SAP Help (page 40) using the Retail Store Segmentation example.

Scenario as explained from SAP Help:

“The country manager of a retail chain (which has 150 stores) is finalizing plans for three sales

promotion strategies. Data pertaining to stores such as store location, sales turnover, store size, staff,

and profit margin is stored in a CSV file. The manager wants to segment 150 stores into three different

groups based on sales turnover, profit margin, store size, and staff size so that specific strategies can

be applied to each store segment.”

What is clustering, segmentation do?  I think this explanation from Microsoft explains it well:

“The algorithm uses iterative techniques to group cases in a dataset into clusters that contain similar characteristics. These groupings are useful for exploring data, identifying anomalies in the data, and creating predictions.”

First I acquire the data via a .CSV file.  Notice the integration with Visual Intelligence

/wp-content/uploads/2012/11/1acq_160359.png

While the help does not state it, I go ahead and “enrich” the attributes so turnover, size, staff, margin are now measures.

Now I select the Predict Panel (step 6 of the help) and select R-K Means

/wp-content/uploads/2012/11/2rkmeans_160360.png

Now configure R-K means

/wp-content/uploads/2012/11/3configrkmeans_160397.png

I am configuring for 3 clusters as shown in above

Now going to the Data Writers tab (step 10 of help) I select the CSV writer to write the results of the R-K means analysis.

/wp-content/uploads/2012/11/4configcsv_160398.png

Then I run the Analysis and see the success message as shown below:

/wp-content/uploads/2012/11/5success_160399.png

Deviating from the steps in the help I look at the output of the CSV file as a result of running the analysis.  Now I have a column for “Cluster”:

/wp-content/uploads/2012/11/6csvlooks_160400.png

Here is how the results look in the Grid pane:

/wp-content/uploads/2012/11/7results_160401.png

Now the nice part is the built-in charts offered:

/wp-content/uploads/2012/11/8charts_160402.png

In the upper left above you can see the size of the clusters in a bar chart

In the upper right you see that cluster 1 has the lowest/weakest density and cluster 3 has the highest

Lower left shows that you can select the variables and use the slider for the clusters

Lower right shows a radar chart of the clusters; it also has a slider for clusters

/wp-content/uploads/2012/11/10algorithmsummary_160403.png

The above shows the algorithm summary – that is the middle icon in the Predict panel.

Additionally, you could use the visualization features from Visual Intelligence to view the results other ways.

If you are interested in learning more, there is an December 12th Webcast titled “Exciting New Release of SAP Predictive Analysis from a Leader in Analytics” (open to everyone) with SAP – register here

In January, SAP is offering a Roadmap webcast on Predictive Analysis in January 8th with Vishwanath Belur of SAP  – it is open to all with an SMP logon – register here

Assigned Tags

      7 Comments
      You must be Logged on to comment or reply to a post.
      Author's profile photo Andy Silvey
      Andy Silvey

      Hi Tammy,

      sitting here with a coffee skimming through the latest blogs on the SDN, I read this one motivated by curiosity as to what is, ' BusinessObjects Predictive Analysis'.

      Well I have to say, thanks to your fantastic all encompassing blog I now have quite a good intro to the subject.

      Thanks.

      All the best,

      Andy.

      Author's profile photo Tammy Powlas
      Tammy Powlas
      Blog Post Author

      Thank you, Andy, for taking the time to read and comment

      Drinking coffee + reading SCN sounds like a great combination to me as well

      All the best,

      Tammy

      Author's profile photo Sathish Kumar
      Sathish Kumar

      I understand that we can upload data using csv files. But what directly connecting to HANA Tables in our application? Also, is there a web version of the tool, so we can integrate it directly from UI5?

      Author's profile photo Tammy Powlas
      Tammy Powlas
      Blog Post Author

      Sathish, please post this question as a discussion so other experts can respond.

      regards,

      Tammy

      Author's profile photo Former Member
      Former Member

      Hi Powlas,

      A very nice blog. I have some questions.

      1. It is mentioned that we are dividing 150 stores into 3 clusters. What are the properties of each cluster? Is there a place in the tool to configure what each cluster stands for?

      2. In the last picture that is uploaded, Could you please elaborate on the charts a little bit especially the right top chart, which is about cluster density and distance?

      Thanks

      Premnath

      Author's profile photo Tammy Powlas
      Tammy Powlas
      Blog Post Author

      Hello Premnath,

      Thank you for your comment.

      There have been several updates to this and I was only using a trial version.

      I recommend you post this question in the discussion forum.

      Thank you,

      Tammy

      Author's profile photo Former Member
      Former Member

      Hello, I agree with Premnath.

      The graphs don't show the"features" of the clusters. In statistical analysis, the clusters are renamed to their interpretation.  Thank