PDF Metadata to XML - InterFormNG2

PDF Metadata to XML

The advanced, utilities workflow component, PDF Metadata to XML can help you to extract both metadata as well as key/values for form fields from a PDF file. The PDF must be loaded into the payload of the workflow before running this component. The component returns the result as XML into the payload.

 

The component has only a comment as parameter, which is optional and it has not effect on the component:

    

 

The resulting XML file can look like this:

    NG2PDFMetadata2XML0004

 

The component can e.g. be used as shown in the workflow below:

    NG2PDFMetadata2XML0002

 

In this example these components are used:

Read from file

    

In this component we read PDF files (with extension .pdf) from a directory, that are to be processed in this workflow.

 

PDF Metadata to XML

This exacts the metadata as described above - without any parameters.

 

To filesystem

Here we save the resulting XML file:

    

 

If we would want to refer to the name of the input PDF file, then we can find it as one of the predefined variables for input files.

    • Related Articles

    • PDF/UA (Accessibility)

      Introduction To support accessibility requirements, InterFormNG2 can create PDF documents that are PDF/UA-1 compliant in accordance with ISO standard 14289-1:2014. UA is an abbreviation of Universal Accessibility. In e.g. Germany this is referenced ...
    • XML Node Selection & Referencing

      XPath provides multiple ways to reference and navigate through an XML structure. Learn how to select specific nodes using direct references, index numbers, conditions, and relationships with other nodes. Connecting preceding or following node sets to ...
    • PDF to images XML

      The workflow component, PDF to images XML (of the type, Create document) can be used for converting a single page of a PDF file into an image. A prerequisite is, that the PDF file is loaded into the payload. The component takes a PDF and renders a ...
    • PDF

      InterFormNG2 can work with PDF files in multiple ways - both as input and as output. PDF is one of the file types/payloads, that can be handled by the InterFormNG2 workflow. PDF as input A workflow can e.g. accept PDF files as the input file type. ...
    • XML

      XML is used widely in InterFormNG2 as both input and output payload in the workflow (and of course in the designer). Please notice, that some workflow components are listed as accepting only XML as input, where they actually also accept an IBM i ...