JavaScript PDF Extraction Library
Extract content and data from PDF documents using JavaScript. PSPDFKit for Web offers developers a way to quickly parse and read PDF content, retrieve forms or annotations, extract digital signatures, and more — in any web app.
Nutrient SDKs are trusted by world’s most popular applications, including Autodesk, Disney, UBS, Dropbox, IBM, and Lufthansa. Nutrient’s powerful solutions enable seamless integration of PDF editing features into your applications, delivering reliability and efficiency to world-class enterprises.
Key Capabilities
-
Parse and read — Text, forms, signatures, annotations, and more
-
Extract text — From each page for processing
-
Form fields — Extract form information and values
-
Annotations — Retrieve all annotations as JSON or XFDF
-
Digital signatures — Get all validation info present in the PDF
-
Client-side — Extract directly in the browser (no server needed)
-
Extendable — Add a search UI, annotation, forms, and more
Guides for Extraction
Extract Selected Text
How to retrieve selected text and text located within a selection range
Text Extraction
How to extract the full text content from a PDF page or document
Parse Content
How to parse page info, form fields and values, annotations, and more
Read Text
How to read, extract, and process text in a PDF
Page Extraction
How to extract PDF pages and insert them into a new document