Create and manage content extraction jobs easily

To create a new Document Automation Server (DAS) Content Extraction job, go to the Dashboard.

Add New Job

Click the Add New Job button to launch the new job wizard as shown below:

Job Creation Wizard

Follow the steps below to create a new job:

  1. Click the Next button at the bottom of the wizard. This takes you to the Job Definition tab.

    Job Definition tab

    See Job Definition for explanation of the fields in this window. Choose the suitable options for the job.

  2. Click the Next button at the bottom of the wizard. This takes you to the Location Settings tab.

    Location Settings tab

    See Location Settings for explanation of the fields in this window. Choose the suitable options for the job and click Next.

  3. One of the main advantages of DAS Content Extraction is the ability to process pdf files based on the file content. A Zone Definer allows the user to select areas on the pdf page to extract text or barcode values.

    Extract Text or Barcode Values

    Initially there is one variable (%VALUE1%). If there is no variable, click the Add Item button.

    Draw on the image of the PDF on the right-hand side by starting at one corner of the area required, holding down the left mouse button, and moving it to cover the area.

    Capture Area

    If the area is not correct, repeat the above procedure.

    Once the area is selected, click the Camera icon.

    This will display the captured text.

    Captured Text

    Initially the Extraction Log will just show the same text in the Text Extracted and Refiner Returning sections. The Refiner can be used to remove extraneous text, spacing and punctuation.

    Click Done.

    Details of the area will be shown in the Column Settings area.

    Column Settings Area

    The Selected Zone is the area displayed on the PDF viewer.

    The Select option determines how the text is handled:

    Select Option

    For this example, select text in zone.

    Click the Camera icon. You will see the below text:

    Captured Text

    Set up the scheduler. We will use Manual for this test.

    Set up Scheduler

  4. Click Next to skip the Alerts for this example.

  5. Click Next to skip the Schedule for this example.

  6. The File Naming tab allows the definition of a template for the output filename.

    File Naming tab

    The Advanced Settings tab allows the user to provide properties of the output PDF file, OCR settings, and post-processing script execution. For more information, see Advanced Settings.

    Advanced Settings tab

    For this example, leave the default settings, click Next.

  7. The Finish tab will show you a summary of the job you set up:

    Finish tab

  8. To confirm that you are happy with the job, click the Preview button to see what the outputs generated will look like.

    This does not execute the Extract PDF Contents step.

    Preview

  9. Click the Create button to create the new job.

Error codes

Error codeDescription
0Job executed successfully
1Error executing job. See log and support section for more details.
2License related error. See log and support section for more details.
3Server error: Contact our support team.
4Trigger file not present