Skip to Content

This was done using Data Services 4.0 SP1

Text data processing takes unstructured data and formats it into a table (as an example) to ease reporting.  This example takes e-mails and formats them into a table.

First create a batch job and a data flow. 

Create a source from the flat files in the Local Object library

/wp-content/uploads/2012/12/1fig_170258.png

Then select the unstructured text type

/wp-content/uploads/2012/12/2fig_170259.png

Select the e-mails as a file input (this is the unstructured data)

Then from the transforms tab, select Entity_Extraction > Base_EntityExtraction

/wp-content/uploads/2012/12/3fig_170260.png

Then in the input schema pane we’ll drag Data column into the Text column

/wp-content/uploads/2012/12/4fig_170261.png

In the options tab select ENGLISH as the language

/wp-content/uploads/2012/12/5fig_170262.png

In the output tab place checkboxes next the to the fields below

/wp-content/uploads/2012/12/6fig_170264.png

Then drag FileName from the Schema In to the Schema Out as shown below

/wp-content/uploads/2012/12/6fig_170264.png

Then add a table to the output table type (which you can use to report from)

/wp-content/uploads/2012/12/8fig_170265.png

After executing the batch job, you can view the results of the table

/wp-content/uploads/2012/12/9fig_170266.png

Now you can use this to report on unstructured data.

To report this post you need to login first.

Be the first to leave a comment

You must be Logged on to comment or reply to a post.

Leave a Reply