Skip to Content
Author's profile photo Chandrashekhar Mahajan

Data Geek Challenge – Aadhaar Data Analysis Using SAP Lumira

From the time I saw Data Geek challenge,  I was thinking to participate into it with my data story and today is the time I am going to share it with you.


I choose Aadhaar Dataset which is available at Aadhaar public data portal at https://data.uidai.gov.in/uiddatacatalog/dataCatalogHome.do


Reason I selected this dataset is I was looking for data which will let me do geographical analysis 🙂 as well as I want to get answers to below few questions such as, 

  • Which states are generating more number of aadhaar cards?
  • Which Enrolment agencies are doing better?
  • What is the number of residents providing email, mobile number?
  • Top districts where enrolment numbers are high?
  • Aadhaar generated by States and gender?
  • Top districts in Maharashtra state where enrolments is high? and etc.etc…


To get answers, First I downloaded “Enrolment Processed in Detail” data set from https://data.uidai.gov.in/uiddatacatalog/getDatsetInfo.do?dataset=UIDAI-ENR-DETAIL

Here I provided date range from 1st August till 15th August and selected dataset with maximum number of records which I got for 6th August with around 1 million records!


After importing this data, I selected option to create geography hierarchy where I found out that at state and district level, names resolution did not found proper propositions and hence I cleaned data at least for Maharashtra State.  After data cleaning activity, I re imported CSV file as below.

/wp-content/uploads/2013/08/lumira1_264498.jpg

With below screen, it is clear that number of residents providing mobile number is quite high than those providing email id.

/wp-content/uploads/2013/08/lumira2_264499.jpg

/wp-content/uploads/2013/08/lumira3_264503.jpg

Maharashtra state is having maximum number of residents providing email during aadhaar registration.

/wp-content/uploads/2013/08/lumira4_264504.jpg

But with the combination of enrolment agency, registrar and state, Karnataka is having maximum number of residents providing email.

/wp-content/uploads/2013/08/lumira5_264505.jpg

/wp-content/uploads/2013/08/lumira6_264509.jpg

With below screenshot, we can easily identify top districts where enrolment number is high. Warangal is topmost district followed by Pune. for this view, Lumira took lots of time. I guess this is because of big number of rows.

/wp-content/uploads/2013/08/lumira7_264510.jpg

/wp-content/uploads/2013/08/lumira8_264511.jpg

/wp-content/uploads/2013/08/lumira9_264518.jpg

/wp-content/uploads/2013/08/lumira10_264519.jpg

Within Maharashtra state, Pune is at the top of the list!

/wp-content/uploads/2013/08/lumira11_264520.jpg

Now if we go one level down i.e. at district level, In the Pune district, Pune City sub-division is having maximum number of residents which provides email. again as Pune city is having maximum number of residents, I am not surprised by this outcome.

/wp-content/uploads/2013/08/lumira12_264524.jpg

/wp-content/uploads/2013/08/lumira13_264525.jpg

/wp-content/uploads/2013/08/lumira14_264526.jpg

/wp-content/uploads/2013/08/lumira15_264530.jpg

In Maharashtra state, Mumbai is the district where maximum number of residents are providing email.

/wp-content/uploads/2013/08/lumira16_264531.jpg

/wp-content/uploads/2013/08/lumira17_264532.jpg

/wp-content/uploads/2013/08/lumira18_264533.jpg

Please note – This analysis is based on data generated on 6th August 2013.

I also tried my hands on with SAP Predictive Analysis and was curious to know the difference between SAP Lumira and SAP Predictive Analysis. and below is what I can say about it.

SAP Lumira = Prepare + Share

SAP Predictive Analysis = Prepare + Predict + Share

And hence SAP Predictive Analysis = SAP Lumira + Predict 😉

I hope you enjoyed reading this blog. Please feel free to share your comments! Have a great day and Happy Analyzing 🙂 🙂


Assigned Tags

      9 Comments
      You must be Logged on to comment or reply to a post.
      Author's profile photo Jitendra Kansal
      Jitendra Kansal

      SAP Lumira = Prepare + Share

      SAP Predictive Analysis = Prepare + Predict + Share

      And hence SAP Predictive Analysis = SAP Lumira + Predict 😉

      nice !!!

      I feel you could share some points about Aadhar card.

      Rgrds,

      Jitendra

      Author's profile photo Former Member
      Former Member

      Awesome analysis! 🙂 Thank you !

      Author's profile photo Chandrashekhar Mahajan
      Chandrashekhar Mahajan
      Blog Post Author

      Thanks !

      Author's profile photo Former Member
      Former Member

      Hi Chandrasekar,

      Good Day!

      Wonderful Job! Keep posting good things like this.

      I appreciate.

      Have a nice day!

      Regards,

      Hari Suseelan

      Author's profile photo Chandrashekhar Mahajan
      Chandrashekhar Mahajan
      Blog Post Author

      Thanks Hari !

      Author's profile photo Vishnu Pankajakshan
      Vishnu Pankajakshan

      mind blowing doc.

      Regards,

      Vishnu

      Author's profile photo Reshoi R
      Reshoi R

      Well Done Chandrashekhar

      Regards

      Re$h

      Author's profile photo Gurbhej Singh
      Gurbhej Singh

      Mr Mahajan

      you have done a significant analysis , please keep up the good work

      Regards

      Guru

      Author's profile photo Former Member
      Former Member

      Very good information about Aadhaar Data Analysis. I learn new things from your post. Everyday I Learn like this and increase my knowledge. Like this, I also gain my knowledge on www.teachlane.com/ very informative information I get. thanks