Using Data Services Text Data Processing with SAP Lumira to Analyze Tweets
Time for some self-introspection – what have I been tweeting about the last 3+ years? At first I suspected that the ASUG hashtag would be the one that I tweeted most. I was wrong.
I first downloaded an archive of my tweets. Then I used the Data Services to read the 140 characters of unstructured data – what hashtags have I tweeted the most about?
You can see on the left I am reading in my tweets, and using the Base Entity Transform from Data Services.
There are several options in the Text Data Processing transform – people, product, person, organization, etc. For now I am just selecting social media.
After running the Data Services batch job, it outputs it to a file which I take to SAP Lumira.
Above is the initial heat map. It’s hard to read so next I filter down to the highest number of tweets.
So you can see that ASUG is not my most tweeted hashtag, as I thought, but it’s SCN, then SAP, then ASUG.
When I look at the grid view, you can see case matters with the hashtags – note #SAPTechEd SAPTechED (wrong) and sapteched.
Finally, what’s text data processing without a tag cloud?
SAP TechEd Las Vegas | October 21–25, 2013 | ASUG Pre-Conference Seminars – BI4.1 hands-on includes SAP BusinessObjects Lumira