Blog post

Extract text from PDFs with Nutrient Document Web Services API on Zapier

This tutorial walks you through creating a Zap that automatically extracts plain and structured text from PDFs using the Nutrient Document Web Services API. It’s perfect for digitizing paperwork, archiving content, or transforming scanned documents into searchable formats.

Illustration: Extract text from PDFs with Nutrient Document Web Services API on Zapier

What is Zapier?

Zapier is an automation platform that connects your favorite apps and services with no code required. You can build “Zaps” to automate repetitive tasks by setting up triggers and actions between apps like Google Drive, Slack, Gmail, and more.

In this tutorial, you’ll learn how to use Zapier to automatically extract plain and structured text from PDF documents using the Nutrient API.

What is the Nutrient API?

Nutrient Document Web Services API enables powerful document processing with simple API calls. With a free account, you get 100 credits to automate actions like:

  • Text and table extraction

  • OCR on scanned files

  • Conversion between formats

  • Document signing, watermarking, and more

What you’ll need

  • A Zapier account (a pro plan is necessary for multi-step Zaps)

  • A Google Drive account

  • A PDF file uploaded to Google Drive

  • A Nutrient API key — sign up here

Step 1 — Trigger a new file in a Google Drive folder

  1. Select Google Drive as the trigger app.

Google Drive selected as trigger app in Zapier

  1. Choose the New File in Folder trigger event.

Trigger set to ‘New File in Folder’

  1. Connect your Google Drive account.

  2. Set up the trigger:

  • In the Drive field, select your Google Drive.

  • In the Folder field, select or create a folder (e.g. pdf-to-text).

Drive and folder selected for PDF upload monitoring

  1. Test this step with a sample PDF file.

Test result showing PDF found in Google Drive folder

Step 2 — Extract text from PDF action using Nutrient API

  1. Select Nutrient Document Web Services API as your action app.

Nutrient API selected as action app

  1. Choose the Extract Text From PDF action.

Action dropdown showing ‘Extract Text from PDF’ selected

  1. Connect your Nutrient API account using your API key.

Paste API key to connect Nutrient account

You can find your key in the Nutrient dashboard.

Where to find your API key in the dashboard

  1. Configure the action:

    • PDF File URL — Use the output from step 1.

    • Output File Name — Optional — e.g. invoice-text.txt.

Fields filled with PDF URL and output file name

Make sure the file is publicly accessible or uploaded through a supported trigger like Google Drive.

  1. Run a test to ensure the extracted text is returned in structured JSON format.

Test result showing successfully extracted text

Step 3 — Upload the extracted data to Google Drive (optional)

  1. Now, add another step by selecting Google Drive as the app and Upload File as the action event. Connect your Google Drive account if needed.

Google Drive selected as output app for extracted text

  1. Choose your drive and output folder.

  2. In the File field, map the extracted JSON file or raw content.

Map extracted text as file upload to Google Drive

  1. Run a test to confirm the extracted file uploads properly.

Test confirming the extracted text was saved to Google Drive

Conclusion

With this Zap, you’ve automated the process of extracting plain and structured text from PDFs using Nutrient API. This workflow is ideal for indexing content, digitizing business records, or feeding content into CRMs and data pipelines.

You can expand this flow with steps for table extraction, OCR scanning, or automatic uploading to Notion.

Author
Hulya Masharipov
Hulya Masharipov Technical Writer

Hulya is a frontend web developer and technical writer at Nutrient who enjoys creating responsive, scalable, and maintainable web experiences. She’s passionate about open source, web accessibility, cybersecurity privacy, and blockchain.

Free trial Ready to get started?
Free trial