Effortlessly convert files to PDF with ToPDF

This converts any file type to PDF, the searchability status of the output PDF depends on the source document, for example, MS Office Input file.

This step uses a virtual printer to print files to a PDF file instead of a physical printer. Due to the printer and Microsoft office automation, there are a lot of technical and security limitations. Thus, we have introduced different methods of converting files to PDF so that our users can have a robust solution to their requirements.

We have provided a configuration file to allow users to switch between these various methods.

Overview of the Different Conversions Methods

Low-Code Solutions use three different methods to convert office files to PDF files, these methods are explained in the section below.

Ways to Convert Office Files to PDF

MS Office Native Conversion

This step uses the “Save as PDF” option in MS Office to save the PDF file directly. This requires Microsoft Office 2007 or newer. Microsoft Office 2007 requires the free “Save as PDF or XPS” add-in for Office 2007 to be installed. We recommend you use the 2010 and new versions.

If you are not interested in changing properties like Image Compression, Image Downsizing, Font embedding and you are ok with the default PDF/A output from your version of office, we recommend using this approach.

MS Office Direct Print

This method uses the BCL ‘easyPDF SDK x’ printer to print out PDF files directly from Microsoft Office. This is the recommended option for producing PDF/A files and PDF files with font embedding.

MS Office Extended Print

In this method, Office automation is used to create an intermediate XPS file, which is then printed with automatic hyperlink extraction. This approach has some technical differences to the first method that allows the users to use the software in a different way. This requires Microsoft Office 2007 or newer. Microsoft Office 2007 requires the free “Save as PDF or XPS” add-in for Office 2007 to be installed.

If you are performing conversions in server environments, for example, via IIS, Windows Services, Session 0 and ASP, this method does not require an interactive user to be logged on for server conversions to succeed, thus we recommend this setting for server environments.

Comparison of the features provided by the different methods

The table below shows you the features available in the different methods.

Property Direct Print Extended Print Native Conversion
Bookmark depth
Convert Hyperlinks
Convert Bookmarks (Word)
IncludeDocumentMarkups
(Track Changes Word)
Paper Orientation
Paper Size
Margin
PDFA1b Depends on the version of Microsoft Office
PDFX1a Depends on the version of Microsoft Office
PDFX3 Depends on the version of Microsoft Office
Image Compression
Image Downsizing
Font Embedding
Print All Sheets(Excel)
Include Document Properties
Fit to Page (Excel)
MSG Files
Frame Slides (PowerPoint)
Print Color Type (PowerPoint)
Output Type (PowerPoint)
Handout Order (PowerPoint)
Print Graphics (Pub)

In-depth Look at the Configuration file

The configuration file can be found in:

“<Autobahn DX Installation directory>\distribution\pj\bin\topdf.exe.config”

Below are the contents of the configuration file, to use a method for a file type, just write the extension in the value property of the conversion Method as shown below.

Information

Do not write an extension in more than one method.

<!-- Extension Mapping -->

<add key="AutoExtension" value="" />

<add key="AutoExtensionEx" value=".jpeg.jpg" />

<add key="AutoExtensionOpenNative" value="" />

<add key="AutoExtensionNative" value="" />

<add key="WordExtension" value="" />

<add key="WordExtensionEx" value=".docx.doc.rtf" />

<add key="ExcelExtension" value="" />

<add key="ExcelExtensionEx" value=".xls.xlsx" />

<add key="GenericExtension" value=".txt.dwg" />

<add key="PowerPointExtension" value="" />

<add key="PowerPointExtensionEx" value=".ppt.pptx" />

<add key="VisioExtension" value="" />

<add key="VisioExtensionEx" value=".vsd" />

<add key="IEExtension" value=".xml" /\>

<add key="PublisherExtension" value="" />

<add key="PublisherExtensionEx" value=".pub.puz" />

<add key="IEExtendedExtension" value=".mht" />

<add key="HTMLExtension" value=".htm.html.eml" />

<add key="OutlookExtension" value=".msg" />

<add key="OpenOfficeExtension" value=".odt.swx.wpd.ods.sxc.odp.sxi.odg.sxd" />

<add key="ExcludedExtensions" value=".zip.exe.pps.ps.chm" />

<add key="PDFExtension" value=".pdf" />

Mappings Between Configuration File and the Conversion Methods

The table below maps the configuration file with the Conversion Methods discussed earlier.

Information

The Configuration options ending with “Ex” work with the “NativeOfficePDF” config option to select a suitable conversion method.

| Configuration | Conversion Method | File Types | Comment | | –––––––––––––––––––––––––––– | —————– | ———————————————————— | –––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––– | — | | AutoExtension | Direct Print | All file types | This option works with any file extension that has an application that is compatible with the BCL printer | | AutoExtensionEx | Extended Print | All file types | This option works with any file extension that has an application that is compatible with the BCL printer | | AutoExtensionNative | Native Conversion | MS Office files | | | OpenOfficeExtension | Direct Print | Open Office files | This option works with all files that can be opened in Open Office. | | AutoExtensionOpenNative | Native Conversion | Open Office files | This option works with all files that can be opened in Open Office. | | WordExtension | Direct Print | .doc, .docx, .rtf… (All files that can be opened in MS Word) | | | WordExtensionEx
NativeOfficePDF=true | Native Conversion | .doc, .docx, .rtf… (All files that can be opened in MS Word) | | | WordExtensionEx
NativeOfficePDF=false | Extended Print | .doc, .docx, .rtf… (All files that can be opened in MS Word) | | | ExcelExtension | Direct Print | .xls, .csv, .xlsx… (All files that can be opened in Excel) | | | ExcelExtensionEx
NativeOfficePDF=true | Native Conversion | .xls, .csv, .xlsx… (All files that can be opened in Excel) | | | ExcelExtensionEx
NativeOfficePDF=false | Extended Print | .xls, .csv, .xlsx… (All files that can be opened in Excel) | | | PowerPointExtension | Direct Print | .ppt, .pptx (All Power Point files) | | | PowerPointExtensionEx
NativeOfficePDF=true | Native Conversion | .ppt, .pptx (All Power Point files) | | | PowerPointExtensionEx
NativeOfficePDF=false | Extended Print | .ppt, .pptx (All Power Point files) | | | VisioExtension | Direct Print | .vsd | | | VisioExtensionEx
NativeOfficePDF=true | Native Conversion | .vsd | | | VisioExtensionEx
NativeOfficePDF=false | Extended Print | .vsd | | | IEExtension | Direct Print | .html, .xml, .mht. All IE files. | | | IEExtendedExtension | Extended Print | .html, .xml, .mht. All IE files. | | | HTMLExtension | Direct Print | .htm, .html, eml | | | PublisherExtension | Direct Print | .pub, .puz | | | PublisherExtensionEx
NativeOfficePDF=true | Native Conversion | .pub, .puz | | | PublisherExtensionEx
NativeOfficePDF=false | Extended Print | .pub, .puz | | | OutlookExtension | Direct Print | .msg files | | | PDFExtension | | PDF files | We usually convert PDF attachments if you select the option | | | GenericExtension | Direct Print | .txt, .dwg | The GenericExtension print job is normally only used as a last resort, when a file type is totally unrecognized by the ToPDF component. This defaults to use Windows Notepad for output. | | ExcludedExtensions | | others | Skips all the extensions present here. |