Archive
2025
DEVELOPMENT
Python
OCR
Tesseract
Ultimate guide to Python Tesseract
Learn how to use Tesseract OCR with Python for text recognition in images. This comprehensive guide covers installation, image preprocessing, multilingual text recognition, and advanced configuration options.
2024
TUTORIALS
API
OCR
How To
How to OCR PDFs with PSPDFKit API
Unlock the power of OCR for your PDFs with PSPDFKit API. Learn how to automate OCR in Python, PHP, JavaScript, and Java. Enhance your document workflows today!
TUTORIALS
Python
API
OCR
PDF text extraction guide with Python
Discover how to extract text from PDFs in Python using PyPDF and PSPDFKit. A complete tutorial for beginners and advanced users.
2023
TUTORIALS
Linux
Document Engine
OCR
How to OCR PDF files on Linux using OCRmyPDF
Learn how to OCR PDF files on Linux using OCRmyPDF, an open source tool based on Tesseract, and Nutrient for advanced OCR capabilities. Step-by-step guide included.
TUTORIALS
API
Python
How To
Tesseract
OCR
Tesseract Python: Extract text from images using Tesseract OCR
Learn how to use Tesseract Python for accurate image text extraction. This guide covers setting up Tesseract OCR in Python, enhancing OCR accuracy with image preprocessing, and optimizing text recognition for better results. Perfect for extracting text from scanned documents, invoices, and more!
TUTORIALS
JavaScript
OCR
Extract text from PDF files using PDF.js and JavaScript
Step-by-step tutorial on how to extract text from a PDF using PDF.js.
2022
TUTORIALS
Linux
Python
OCR
PDF
How To
Creating a Document Scanner with OCR in Python
How to use the OCR component in PSPDFKit Processor with Python