Archive
2024
TUTORIALS
API
OCR
How To
How to OCR PDFs with PSPDFKit API
Unlock the power of OCR for your PDFs with PSPDFKit API. Learn how to automate OCR in Python, PHP, JavaScript, and Java. Enhance your document workflows today!
TUTORIALS
Python
API
OCR
Extract Text from PDF in Python: A Comprehensive Guide Using PyPDF and PSPDFKit API
Learn how to extract text from PDF files in Python using PyPDF and PSPDFKit API. This guide covers everything from basic extraction to advanced techniques for encrypted PDFs.
2023
TUTORIALS
Linux
Document Engine
OCR
How to OCR PDF files on Linux using OCRmyPDF
Learn how to OCR PDF files on Linux using OCRmyPDF, an open source tool based on Tesseract, and Nutrient for advanced OCR capabilities. Step-by-step guide included.
TUTORIALS
API
Python
How To
Tesseract
OCR
Using Tesseract OCR with Python for image text extraction
Master text extraction from images using Tesseract OCR in Python. A practical guide to set up and enhance OCR accuracy with image preprocessing techniques.
TUTORIALS
JavaScript
OCR
Extract text from PDF files using PDF.js and JavaScript
Step-by-step tutorial on how to extract text from a PDF using PDF.js.
2022
TUTORIALS
Linux
Python
OCR
PDF
How To
Creating a Document Scanner with OCR in Python
How to use the OCR component in PSPDFKit Processor with Python