Skip to main content

Image Extract Text using n8n action

PDF4me Image Extract Text extracts text content from images using OCR technology through n8n automation workflows. Process images via n8n triggers, binary data, base64 strings, or public URLs to automatically recognize and extract text with multi-language support, high-accuracy character recognition, layout preservation, and structured text output for document digitization and data extraction. This solution is ideal for scanned document processing, data entry automation, business card OCR, receipt scanning, text extraction from photos, and automated OCR workflows that require accurate text recognition with multi-language support and seamless integration.

Setup

Add the PDF4me "Image Extract Text" node to your n8n workflow and configure the required parameters. For initial setup instructions, see our n8n Integration Guide.

Prerequisites:

  • PDF4me API credentials
  • n8n workflow access

Configuration:

  1. Add PDF4me node to workflow
  2. Select "Image Extract Text" action
  3. Configure input parameters (see below)
Image Extract Text

Parameters

Complete list of parameters for the Image Extract Text action. Configure these parameters to control text extraction from images.

Important: Parameters marked with an asterisk (***) are required and must be provided for the action to function correctly.

ParameterTypeDescriptionExample
Input Data TypeStringYesPDF4me image input format selection
Choose the format of your source image data input.
PDF4me supports multiple input types with different field requirements
1) Binary DataYes (One Required)1) Binary Data - Use "Input Binary Field" to reference image file from previous node
2) Base64 StringYes (One Required)2) Base64 String - Use "Base64 Image Content" field for encoded image data
3) URLYes (One Required)3) URL - Use "Image URL" field to provide a public/open permission URL to the image file to process
Input Binary FieldBinaryYes (If Binary Data)PDF4me binary image file input - Reference source image file (.jpg, .png, .gif, .bmp) from previous n8n node or file upload. PDF4me processes binary image files with automatic format detection and validation. Required when Input Data Type is "Binary Data"
Base64 Image ContentStringYes (If Base64 String)PDF4me base64 encoded image input - Provide source image data (.jpg, .png, .gif, .bmp) as base64 encoded string for secure transmission. PDF4me automatically decodes and processes the image content. Required when Input Data Type is "Base64 String"
Image URLStringYes (If URL)PDF4me public image URL input - Provide a public/open permission URL to the source image file (.jpg, .png, .gif, .bmp) to be processed for text extraction. PDF4me downloads and processes the file from the provided URL. Required when Input Data Type is "URL"
Output File NameStringYesPDF4me output filename configuration - Specify the name for the generated text extraction file with proper extension (.json, .txt). This is an input parameter that configures what the output file will be named. PDF4me ensures unique naming and validates file format compliance
AsyncBooleanNoPDF4me asynchronous processing - Enable asynchronous processing for large images or when processing time is not critical. When enabled, the operation runs in the background and returns immediately
Binary Data Output NameStringYesPDF4me output variable configuration - Define the variable name for accessing the generated text extraction data in your n8n workflow. This data can be used as input data in subsequent actions within your workflow

Supported Image Formats

FormatDescriptionFile ExtensionsMax SizeBest Use Cases
JPEGJoint Photographic Experts Group with lossy compression optimized for photographs.jpg, .jpeg50MBScanned documents, photographs with text, web content
PNGPortable Network Graphics with lossless compression and transparency support.png50MBScreenshots, graphics with text, high-quality documents
GIFGraphics Interchange Format with animation support and limited color palette.gif50MBAnimated text, simple graphics with text content
BMPBitmap image format with uncompressed pixel data.bmp50MBHigh-quality scanned documents, professional graphics

Output

Output Parameters

ParameterTypeDescriptionExample
successBooleanPDF4me text extraction status indicator - Boolean flag indicating the success or failure of the text extraction process. PDF4me returns true for successful operations and false for any errors, enabling robust error handling in automated workflowstrue
messageStringPDF4me text extraction status message - Human-readable status message providing details about the text extraction process result. Includes success confirmation or error details for troubleshootingImage text extracted successfully
fileNameStringPDF4me generated text extraction filename - The complete filename of the successfully generated text extraction file with proper extension. PDF4me ensures unique naming and validates file format compliance for seamless integration with downstream processesimage_text_extract.json
mimeTypeStringPDF4me output MIME type - MIME type of the generated text extraction file (e.g., "application/json", "text/plain"). Useful for content type validation and proper file handling in web applicationsapplication/json
fileSizeNumberPDF4me text extraction file size in bytes - The exact size of the generated text extraction file in bytes, provided by PDF4me for storage planning and file transfer monitoring0

N8N Action Response

The PDF4me Image Extract Text API returns a response that can be viewed in multiple formats. Choose the view that best fits your needs:

JSON Response Format

The raw JSON response from the API:

[
{
"success": true,
"message": "Image text extracted successfully",
"fileName": "image_text_extract.json",
"mimeType": "application/json",
"fileSize": 0
}
]

Use Cases

OCR Text Recognition and Digitization

  • Extract text from scanned documents, receipts, and forms for digital processing
  • Convert handwritten notes and printed materials to searchable text format
  • Process business cards and contact information for automated data entry

Image-Based Content Analysis

  • Extract text from screenshots and images for content analysis and indexing
  • Process social media images with text overlays for sentiment analysis
  • Analyze product packaging and labels for inventory management systems

Accessibility and Translation Services

  • Extract text from images to create accessible content for visually impaired users
  • Process multilingual documents for automated translation workflows
  • Convert image-based text to audio format for accessibility applications

Research and Data Mining

  • Extract text from research papers, articles, and academic documents
  • Process historical documents and archives for digital research projects
  • Analyze image-based surveys and feedback forms for data collection

Get Help