Skip to Content

XI: Read data from PDF file in Sender Adapter

In my previous weblog (XI: Generate PDF file out of file adapter), I had discussed how to generate a PDF file out of receiver adapter. In this weblog, we will see how to read a PDF file and convert it to XML document, by using a module processor exit bean in the sender file adapter

h5. Design In our example, we will use an open source utility called PDFBox (http://www.pdfbox.org) to extract text information from a PDF file. h5. Configuration The above module parses the PDF file and frames a XML document with the text data in node “TextData”. This module bean will be called in the sender adapter as shown. image

4 Comments
You must be Logged on to comment or reply to a post.
  • Dear Ramamoorthy,

    I like your solution of using module processing. We wrote a own adapter to receive pdf files. For the conversion of PDF file to PDF XML data we prefer to use the Adobe Document Service of the Netweaver Stack. You can use this to convert XML data to a pdf and other way around. I think this is a better solution (SAP only) then using the open source solution of PDFBOX. If you want to known more about this solution please contact me
    robert.eijpe@nl4b.com

    • Hi Robert,
      Where can i find Adobe Document service in NW stack. Please let me know about this in a step-by-step manner. What should be given in Source, Target Data types, Message Types,Interface,Interface Mappings,…..all from IR,ID to Testing.
      Thanks&Regards,
      Ganesh
  • Hi Ramamoorthy and Robert,
    I need to send a PDF file to XI and then XI wud Create a Webservice with respect to that.I wanted to know wether we can send PDF as a Payload to XI and then XI does the Webservice stuff? 

    Robert can u give more inofrmation on extracting text from PDF using Adobe Document service of NetWeaver stack.

    Awaitng Response.
    any kind of guidence wud be Appriciated .

  • Hi Ram,
    What should be given in parameter Value?
    What should i give in IR->Message Type
    Also In my sender Adaper->Module (tab) there is no Module Key. I can see only Module Name and Type in Module Processing,Parameter name and Parameter Value in Module configuration
    Help me please,
    Regards,
    Ganesh