Advanced Export to csv/xlsx

Advanced Export allows the export of areas on the PDF page to CSV or XLSX files.

For an Advanced Export job, the Select Variables tab looks like this.

The file displayed is either the one selected in the Location Settings tab or can be selected by clicking on the Open File button.

If there is no Kingfisher Variable, click on the Add Item button.

Select the area on the displayed file containing the information required.

Click on the camera icon to capture the text.

The extracted text contains more than the actual invoice number, this is because the area is designed to cover slightly different format invoices:

This is the same area on the invoice that makes up the second page of the example document.

Click Done.

The text needs to be refined.

For this example file, the invoice number is of the (regular expression) format [a-z][0-9\]+.

The first part selects a string starting with the alphabetic characters A-Z. The second part is one or more numeric characters or a hyphen.

This is added to the column settings by choosing Select “text in zone”, Where “text matches pattern” and entering the pattern in the box. There are tips on the regular expressions available by clicking on the “?” button.

Check the extracted text using the camera icon in the column settings.

Add another item.

Select an area that covers the Grand Total on both pages.

Click on the camera icon on Column 3 to select the area.

Check that the text contains the value you are after.

Click on Done.

The pattern for this selection is more complicated. The literal text “Grand Total” identifies the beginning of the selection. Next there is one or more whitespace (space or punctuation) characters followed by one or more digits, a decimal point then two digits.

Click on the Column Settings camera icon to see the extracted text.

The next task is to refine this text further by removing everything before the “Grand Total” part.

Select “all text in paragraph after value” “Grand Total” where “text matches pattern” one or more digits optionally followed by a decimal point and two digits.

Click on the Column Settings camera icon to see the extracted text.

Save the job and run it.

The output file will contain: