Blog post

Extract text from scanned PDFs using OCR with Nutrient Document Web Services API on Zapier

This tutorial walks you through how to automatically apply optical character recognition (OCR) to scanned PDF files using the Nutrient Document Web Services API on Zapier. The result is a searchable PDF with selectable text, perfect for digitization and archiving workflows.

Illustration: Extract text from scanned PDFs using OCR with Nutrient Document Web Services API on Zapier

What is Zapier?

Zapier is a no-code automation platform that lets you connect apps like Google Drive, Gmail, and Slack to automate tasks without writing code. You can use it to build workflows triggered by file uploads or new entries, and take actions like converting or extracting data from documents.

What is the Nutrient API?

Nutrient Document Web Services API helps you automate everything from document conversion to redaction, annotation, OCR, and digital signing. With your free account, you get 100 credits to experiment with different tools and integrations.

You’ll be using the OCR action to turn scanned PDFs into searchable documents.

What you’ll need

  • A Zapier account (a pro plan is necessary for multi-step Zaps)

  • A Google Drive account

  • A scanned PDF file stored in Google Drive

  • A Nutrient API key — get one here

Step 1 — Trigger a new file in a Google Drive folder

  1. Select Google Drive as the trigger app.

Select Google Drive as the Zap trigger

  1. Choose the New File in Folder trigger event.

Choose ‘New File in Folder’ as the trigger

  1. Connect your Google Drive account.

  2. Select the drive and input folder (e.g. ocr-scanned-files).

Folder selected for scanned PDFs

  1. Upload a test PDF file and confirm Zapier pulls it in.

Zapier successfully pulls in PDF file from Drive

Step 2 — Apply OCR to scanned PDF action

  1. Choose Nutrient Document Web Services API as the action app.

Nutrient API selected as the action

  1. Choose the Extract Text With OCR action.

OCR action selected in Nutrient API

  1. Connect your Nutrient account by pasting your API key.

Paste API key to connect to Nutrient

You can find your key in the Nutrient dashboard.

Nutrient dashboard showing where to find your API key

  1. Fill out the fields:

    • PDF File URL — Use the file URL from step 1.

    • Output File Name — Optional (e.g. invoice [OCR Processed].pdf).

Field mapping for OCR input and output settings

Your file must be publicly accessible or uploaded through Google Drive for Zapier to access it.

  1. Test the step and confirm the OCR-processed file is returned.

OCR result returns searchable PDF

Step 3 — Upload the searchable PDF to Google Drive

  1. Add Google Drive again as the action app.

Upload PDF to Drive step added

  1. Choose the Upload File action.

  2. Select the drive and a destination folder like ocr-results.

  3. Map the OCR output from the previous step.

Map output PDF to upload destination

  1. Test the step to confirm the processed file is uploaded correctly.

Test showing OCR-processed PDF uploaded to Drive

Conclusion

You’ve just built an automated OCR pipeline using Zapier and Nutrient API. This lets you convert scanned PDFs into searchable text documents — perfect for digitizing archives, extracting data, and automating back-office tasks.

Looking to expand? Try integrating this with PDF conversion or form creation in your document automation workflow.

Author
Hulya Masharipov
Hulya Masharipov Technical Writer

Hulya is a frontend web developer and technical writer at Nutrient who enjoys creating responsive, scalable, and maintainable web experiences. She’s passionate about open source, web accessibility, cybersecurity privacy, and blockchain.

Free trial Ready to get started?
Free trial