Skip to Content

As part of my participation in the Data Geek Challenge 2.0 in the summer of 2013. I stumbled upon the records of the Titanic sinking from the University of Toronto. During the analysis with SAP Predictive Analysis it shows very quickly the extent of the tragedy and the low chance to survice for each passanger.


To report this post you need to login first.

18 Comments

You must be Logged on to comment or reply to a post.

    1. Boris Bader Post author

      Hi Benedict,

      I used SnagIt to capture the video and Windows Movie Maker to cut it. Hope this solves your question.

      greeting

      boris

      (0) 
  1. Bimal Mehta

    Nice one, it will be great if you could add some explanation on what the the tree diagram is trying to show. Me being from statistics backgound understood it, but my friend from a non statistic background did not get it so had to explain it by running an actual scenario using the data file from the source you mentioned

    (0) 
    1. Boris Bader Post author

      Hi Bimal,

      as you wish a small comment about the decision tree diagramm. The decision tree is a special tool for classification in DM systems. I used the R CNR Tree Methode in the titanic szenario. The Generation is based on the top-down principle. The Starting point (the root) contains all records of the training set which is divided with the aid of the rules defined by the variables in two or more sub-nodes (sons / daughters).

      As a measure for determining the best classification tree the Gini Index is used.

      By analysing the solution of the diagramm you get two profiles which had a chance to survive:

      Profil 1: female and part of the first class or a crew member

      Profil 2: a male child and part of the second class or higher.

      The quality of the solution is displayed by a chi-squared four field test, which has in this case, a hit rate of 79%

      Hope this helps you and your partner.

      greetings

      boris

      (0) 

Leave a Reply