PDF Converter for SharePoint 7.1 — Optical Character Recognition, EML and MSG Overhaul
The version of the PDF Converter for SharePoint released earlier this year went down a storm as it was one of the first major third party products to ship with SharePoint 2013 and Nintex Workflow 2013 support. Today we are happy to announce yet another release of this popular product - version 7.1 - that benefits users of all SharePoint versions including SharePoint 2007, 2010 and 2013.
The main new features are support for OCR (Optical Character Recognition) to convert scanned documents into fully searchable and indexable PDF files, and a completely overhauled converter for the EML (email) format that should really benefit those people with mail enabled document libraries.
For those not familiar with the product, the PDF Converter for SharePoint is a lightweight solution that allows end-users to watermark, merge, split, secure, OCR and convert common document types - including InfoPath, AutoCAD, MSG (email) MS-Office, HTML and images - to PDF as well as other formats from within SharePoint using a friendly user interface, workflows or a web service call without the need to install any client side software or Adobe Acrobat. It integrates at a deep level with SharePoint and leverages facilities such as the Audit log, Nintex Workflow, localisation, security and tracing. It runs on SharePoint 2007, 2010 & 2013 and is available in English, German, Dutch, French, Traditional Chinese and Japanese. For detailed information check out the product page .
Scanned Document with OCRed text selected
In addition to the changes listed above, some of the main changes and additions in the new version are as follows:
-
1901 CADFixCAD Conversion - AccessViolationException
-
1931 CAD Improvement CAD Converter does not resolve externally referenced files
-
1850 CAD Improvement Add support for AutoCAD 2013
-
1916 Conversion Fix TIFF to PDF Conversion uses dimensions of first page for all pages
-
1853 Conversion Fix Post processing PDF generated from TIF as ‘Screen Optimised’ scrambles PDF
-
676 Conversion Improvement Excel Conversion - Add support for PDF/A
-
1930 Cross-conversion FixFolder with Temp files cannot be deleted when converting DOC to HTML for some locales / regions
-
1879 EMLNewImplement conversion of RFC2045 / RFC5322 based EML files
-
1965 HTMLFixHTML Converter hangs on 0.5 page margin
-
1920 HTMLFixNot all URLs are recognised by HTML Converter
-
1894 HTML FixURL is not decoded properly when validating security settings
-
1827 HTML Fix HTML to PDF Conversion for some non-Roman languages lose characters
-
1840 HTML Fix Last line is truncated when converting HTML to PDF
-
1953 HTML Improvement Mixed fonts in same sentence are vertically offset when converting HTML to PDF
-
1940 HTML Improvement HTML Conversion doesn’t convert unencoded quotes
-
1884 HTML Improvement Add configurable delay to HTML to PDF conversion for pages heavy on JavaScript / DHTML (e.g. pages containing Google Maps)
-
2009 InfoPathImprovementFix InfoPath forms colour being lost on IE10 systems
-
1939 InfoPath Improvement InfoPath does not export to PDF well on systems with IE102010MergingFixSystem.NullReferenceException when saving merged file
-
2012 MergingFixInternal hyperlinks are broken when merging documents
-
1990 MergingFixUnexpected token DictionaryEnd while merging
-
1982 MergingFixSystem.IndexOutOfRangeException: Index was outside the bounds of the array. while merging PDF1984MergingFixBookmark targets bottom of page
-
1968 MergingFixNullreference error in PdfLoadedFormFieldCollection.GetFieldType while merging
-
1978 MergingFixError in ‘PdfLoadedPageCollection.GetPage’ while merging file
-
1967 MergingFixBlank pages while merging
-
1943 MergingFixFatal Error at 9670 while merging
-
1935 MergingFixMerged file is empty when merging large bitmapped PDFs
-
1895 MergingFixFatal Error when merging
-
1892 MergingFixSystem.NullReferenceException when merging
-
1957 Meta-dataFixIcon of destination file is incorrect if source file was created using the Content and structure page.
-
1944 Meta-dataFixContent type is not copied when it contains a user/group field that is populated
-
1946 Meta-dataFixContent Type ID is copied by field description, not internal field name
-
2007 MSGFixMSG - Unexpected line break using plain text conversion
-
2014 MSGFixMSG - Unicode / character encoding problem in HTML email
-
2006 MSGFixMSG - Hyperlink breaks during conversion
-
1958 MSGFixMSG - System.Exception: compressed-RTF CRC32 failed
-
1959 MSGFixMSG/EML Converter - Last line is missing from some converted emails
-
1925 MSGFixMSG to PDF - Plain text email carriage return handling is incorrect
-
1913 MSGFixMSG to PDF - RTF HTML MSG - incorrectly converted accents / diacritics
-
1914 MSGFixMSG to PDF - RTF HTML MSG - RTL languages not converted in correct order
-
1904 MSGFixMSG to PDF - Sometimes Attachment is not processed
-
1911 MSGFixMSG to PDF - Possible regression on in-line images
-
1912 MSGFixMSG to PDF - RTF HTML MSG - Azerbaijani, Maltese - some unicode characters not converted, left as \uXXXX
-
1899MSGFixMSG to PDF - German special characters are sometimes not properly converted
-
1882MSGFixMSG to PDF - RTF email is missing portion of first line in body text
-
1885MSGFixMSG to PDF - Handle and Memory leak when converting signed MSG files
-
1862 MSGFixMSG to PDF - Incorrect font
-
1863 MSGFixMSG to PDF - Numbered list items not rendered
-
1601 MSGImprovementMSG to PDF - Improve line spacing in HTML to PDF Conversion
-
1660 MSGImprovementMSG to PDF - Test / Implement remaining languages
-
1917 MSGImprovementMSG to PDF - RTF HTML MSG - some languages causing small fonts
-
1903 MSGImprovementMSG to PDF - Implement Best Body Algorithm from MS-OXBBODY specification
-
1881 MSGImprovementMSG to PDF - Text opaque signed MIME messages lose formatting
-
2015 MSGNewMSG to PDF - Include email address in ‘To’ field
-
995 OCRNewOCR - Add support for OCR of PDF data to allow searchable PDFs
-
1985 OtherFixCannot set PDF Creator / Processor meta data for some files
-
1972O therFixLoading a PDF 1.7 document into a PDFDocument resets it to PDF 1.
-
51952 OtherFixCertain PDFs do not permit viewerpreferences to be read
-
1906 OtherFixOccasional Access Denied in Task Monitor on Win2K12 / InfoPath 20151
-
799 OtherImprovementUpgrade to .net 3.5
-
2061 ProFixConverting between PDF Versions on a locale that uses ‘,’ as a decimal separator sets the PDF Version to 1.1
-
1945 ProFixPDF/A conversion - The DateTime represented by the string is not supported in calendar System.Globalization.GregorianCalendar.
-
1922 ProFixRe-processing existing PDF/A files for PDF/A output fails
-
1909 ProFixPDF/A Conversion fails when certain characters occur in the PDF Title
-
1888 ProFixImprove reliability of PDF/A2b conversions
-
1849 ProFixLinearization in combination with PDF/A fails
-
1979 ProImprovementAlways post process for PDFA when _outputFormatSpecificSettings.PostProcessFile == true
-
1843 ProImprovementAllow transparent content in PDF/A2b documents
-
1974 SecurityFixWhen security is removed from PDF files its contents still shows as encrypted
-
1966 UIFixSharePoint 2013 ribbon icon is broken for Site Collection scoped Feature
-
1937 UIFixUsers with only ‘read’ permission on a doclib, but not on the main site, cannot open the conversion screen1923WatermarkingFixRandom ArgumentException when applying XML based watermarks
For more information check out the following resources:
As always, feel free to contact us using Twitter, our Blog, regular email or subscribe to our newsletter.
Download your free trial here (43MB). .