PDF Converter Services 7.1 — Optical Character Recognition, EML and MSG Overhaul
We are happy to announce version 7.1 of our popular Muhimbi PDF Converter Services and OCR and PDF/A Archiving PDF Converter Services. The main new features are support for OCR (Optical Character Recognition) to convert scanned documents into fully searchable and indexable PDF files, and a completely overhauled converter for the EML (email) format that should really benefit those organisations that don’t use MS-Outlook’s MSG format to store email.
A quick introduction for those not familiar with the product: The Muhimbi PDF Converter Services is an ‘on premises’ server based SDK that allows software developers to convert typical Office files to PDF format using a robust, scalable but friendly Web Services interface from Java, .NET, Ruby & PHP based solutions. It supports a large number of file types including MS-Office and ODF file formats as well as HTML, MSG (email), EML, AutoCAD and Image based files and is used by some of the largest organisations in the world for mission critical document conversions. In addition to converting documents the product ships with a sophisticated watermarking engine, PDF Splitting and Merging facilities, an OCR facility and the ability to secure PDF files. A separate SharePoint specific version is available as well.
Scanned Document with OCRed text selected
In addition to the changes listed above, some of the main changes and additions in the new version are as follows:
-
1901 CADFixCAD Conversion - AccessViolationException
-
1931 CADImprovementCAD Converter does not resolve externally referenced files
-
1850 CADImprovementAdd support for AutoCAD 2013
-
1916 ConversionFixTIFF to PDF Conversion uses dimensions of first page for all pages
-
1853 ConversionFixPost processing PDF generated from TIF as ‘Screen Optimised’ scrambles PDF
-
676 ConversionImprovementExcel Conversion - Add support for PDF/A
-
1930 Cross-conversionFixFolder with Temp files cannot be deleted when converting DOC to HTML for some locales / regions
-
1879 EMLNewImplement conversion of RFC2045 / RFC5322 based EML files
-
1965 HTMLFixHTML Converter hangs on 0.5 page margin
-
1920 HTMLFixNot all URLs are recognised by HTML Converter
-
1827 HTMLFix HTML to PDF Conversion for some non-Roman languages lose characters
-
1840 HTMLFixLast line is truncated when converting HTML to PDF
-
1953 HTMLImprovementMixed fonts in same sentence are vertically offset when converting HTML to PDF
-
1940 HTMLImprovementHTML Conversion doesn’t convert unencoded quotes
-
1884 HTMLImprovementAdd configurable delay to HTML to PDF conversion for pages heavy on JavaScript / DHTML (e.g. pages containing Google Maps)
-
2009 InfoPathImprovementFix InfoPath forms colour being lost on IE10 systems
-
1939 InfoPathImprovementInfoPath does not export to PDF well on systems with IE10
-
2010 MergingFixSystem.NullReferenceException when saving merged file
-
2012 MergingFixInternal hyperlinks are broken when merging documents
-
1990 MergingFixUnexpected token DictionaryEnd while merging
-
1982 MergingFixSystem.IndexOutOfRangeException: Index was outside the bounds of the array. while merging PDF
-
1984 MergingFixBookmark targets bottom of page
-
1968 MergingFixNullreference error in PdfLoadedFormFieldCollection.GetFieldType while merging
-
1978 MergingFixError in ‘PdfLoadedPageCollection.GetPage’ while merging file
-
1967 MergingFixBlank pages while merging
-
1943 MergingFixFatal Error at 9670 while merging
-
1935 MergingFixMerged file is empty when merging large bitmapped PDFs
-
1895 MergingFixFatal Error when merging
-
1892 MergingFixSystem.NullReferenceException when merging
-
2007 MSGFixMSG - Unexpected line break using plain text conversion
-
2014 MSGFixMSG - Unicode / character encoding problem in HTML email
-
2006 MSGFixMSG - Hyperlink breaks during conversion
-
1958 MSGFixMSG - System.Exception: compressed-RTF CRC32 failed
-
1959 MSGFixMSG/EML Converter - Last line is missing from some converted emails
-
1925 MSGFixMSG to PDF - Plain text email carriage return handling is incorrect
-
1913 MSGFixMSG to PDF - RTF HTML MSG - incorrectly converted accents / diacritics
-
1914 MSGFixMSG to PDF - RTF HTML MSG - RTL languages not converted in correct order
-
1904 MSGFixMSG to PDF - Sometimes Attachment is not processed
-
1911 MSGFixMSG to PDF - Possible regression on in-line images
-
1912 MSGFixMSG to PDF - RTF HTML MSG - Azerbaijani, Maltese - some unicode characters not converted, left as \uXXXX
-
1899 MSGFixMSG to PDF - German special characters are sometimes not properly converted
-
1882 MSGFixMSG to PDF - RTF email is missing portion of first line in body text
-
1885 MSGFixMSG to PDF - Handle and Memory leak when converting signed MSG files
-
1862 MSGFixMSG to PDF - Incorrect font
-
1863 MSGFixMSG to PDF - Numbered list items not rendered1601MSGImprovementMSG to PDF - Improve line spacing in HTML to PDF Conversion
-
1660 MSGImprovementMSG to PDF - Test / Implement remaining languages
-
1917 MSGImprovementMSG to PDF - RTF HTML MSG - some languages causing small fonts
-
1903 MSGImprovementMSG to PDF - Implement Best Body Algorithm from MS-OXBBODY specification
-
1881 MSGImprovementMSG to PDF - Text opaque signed MIME messages lose formatting
-
2015 MSGNewMSG to PDF - Include email address in ‘To’ field
-
995 OCRNewOCR - Add support for OCR of PDF data to allow searchable PDFs
-
1985 OtherFixCannot set PDF Creator / Processor meta data for some files
-
1972 OtherFixLoading a PDF 1.7 document into a PDFDocument resets it to PDF 1.5
-
1952 OtherFixCertain PDFs do not permit viewerpreferences to be read1906OtherFixOccasional Access Denied in Task Monitor on Win2K12 / InfoPath 2015
-
1799 OtherImprovementUpgrade to .net 3.5
-
2061 ProFixConverting between PDF Versions on a locale that uses ‘,’ as a decimal separator sets the PDF Version to 1.1
-
1945 ProFixPDF/A conversion - The DateTime represented by the string is not supported in calendar System.Globalization.GregorianCalendar.
-
1922 ProFixRe-processing existing PDF/A files for PDF/A output fails
-
1909 ProFixPDF/A Conversion fails when certain characters occur in the PDF Title
-
1888 ProFixImprove reliability of PDF/A2b conversions
-
1849 ProFixLinearization in combination with PDF/A fails
-
1979 ProImprovementAlways post process for PDFA when _outputFormatSpecificSettings.PostProcessFile == true
-
1843 ProImprovementAllow transparent content in PDF/A2b documents
-
1974 SecurityFix When security is removed from PDF files its contents still shows as encrypted
For more information check out the following resources:
As always, feel free to contact us using Twitter, our Blog, regular email or subscribe to our newsletter.
Download your free trial here (37MB). .