Skip to main content

Extract Table From PDF using n8n action

PDF4me Extract Table From PDF identifies and extracts structured table data from PDF documents through n8n automation workflows. Process PDFs via n8n triggers, binary data, base64 strings, or public URLs to automatically detect tables, preserve cell structure, maintain data relationships, extract headers and values, and output structured data in JSON or Excel format. This solution is ideal for financial report analysis, data digitization, statistical data extraction, invoice processing, form analysis, and automated table extraction workflows that require accurate table recognition with structured output and seamless integration.

Setup

Add the PDF4me "Extract Table From PDF" node to your n8n workflow and configure the required parameters. For initial setup instructions, see our n8n Integration Guide.

Prerequisites:

  • PDF4me API credentials
  • n8n workflow access

Configuration:

  1. Add PDF4me node to workflow
  2. Select "Extract Table From PDF" action
  3. Configure input parameters (see below)
Extract Table From PDF

Parameters

Complete list of parameters for the Extract Table From PDF action. Configure these parameters to control table extraction.

Important: Parameters marked with an asterisk (***) are required and must be provided for the action to function correctly.

ParameterTypeDescriptionExample
Input Data Type***StringPDF Input Format Selection
• Choose the format of your PDF data input
• PDF4me supports multiple input types
• Options: Binary Data, Base64 String, or URL
Binary Data
Input Binary FieldBinaryBinary PDF File Input (Required if Binary Data)
• Reference PDF file from previous n8n node or file upload
• PDF4me processes binary PDF files with automatic format detection
• Required when Input Data Type is "Binary Data"
{{ $binary.data }}
Base64 Document ContentStringBase64 Encoded PDF Input (Required if Base64 String)
• Provide PDF data as base64 encoded string
• PDF4me automatically decodes and processes the PDF content
• Required when Input Data Type is "Base64 String"
UEsDBBQABgAI...
File URLStringPublic PDF URL Input (Required if URL)
• Provide a public/open permission URL to the PDF file
• PDF4me downloads and processes the file from URL
• Required when Input Data Type is "URL"
https://abc.com/report.pdf
Document Name***StringSource PDF Reference
• Specify the name of the source PDF file
• For reference and tracking purposes in table extraction
• Helps with processing tracking
financial_report.pdf

Advanced Options

The following parameters are available in the Advanced Options section and are optional:

ParameterTypeDescriptionExample
Custom ProfilesStringCustom Configuration Profiles
• Set additional options using custom profiles
• JSON-like format containing predefined parameters
• Enables advanced table processing settings
• Optional for specialized requirements
{ "OutputDataFormat": "json" }

Supported Table Types

Table TypeDescriptionCommon Use Cases
Financial TablesFinancial statements, budgets, and accounting dataFinancial reports, invoices, balance sheets
Data TablesStatistical data, metrics, and analytical informationResearch reports, analytics, performance data
List TablesProduct catalogs, directories, and inventory listsProduct listings, employee directories, catalogs
Comparison TablesFeature comparisons, pricing tables, and specificationsProduct comparisons, pricing matrices, specifications
Schedule TablesTimetables, calendars, and planning documentsMeeting schedules, project timelines, calendars
Summary TablesExecutive summaries, key metrics, and overview dataDashboard data, KPI summaries, executive reports

Table Processing Features

FeatureDescriptionUse Case
Table DetectionAutomatic identification of table boundaries and structuresProcessing documents with multiple tables
Cell ExtractionPrecise extraction of individual cell values and contentData analysis and processing
Header RecognitionIdentification of table headers and column labelsStructured data organization
Row ProcessingExtraction of complete table rows with proper formattingDatabase integration and storage
Column AlignmentPreservation of column structure and alignmentMaintaining data relationships
Multi-Page TablesProcessing tables that span across multiple pagesComplex document analysis

Output

Output Parameters

ParameterTypeDescriptionExample
fileNameStringPDF4me output file name - Name of the generated JSON file containing the extracted table data. The file contains all structured table information in JSON formattable_extraction_results_1756997087699.json
mimeTypeStringPDF4me file MIME type - MIME type of the output file, always "application/json" for JSON format table dataapplication/json
fileSizeNumberPDF4me file size in bytes - Size of the generated JSON file in bytes, provided for storage planning and transfer monitoring563
successBooleanPDF4me extraction status indicator - Boolean flag indicating the success or failure of the table extraction process. PDF4me returns true for successful extractions and false for any errorstrue
messageStringPDF4me extraction status message - Human-readable status message providing details about the table extraction process result. Includes success confirmation or error details for troubleshootingTable extraction completed successfully
docNameStringPDF4me source document reference - Original filename of the PDF document that was processed for table extractiondocument.pdf

N8N Action Response

The PDF4me Extract Table From PDF API returns a response that can be viewed in multiple formats. Choose the view that best fits your needs:

JSON Response Format

The raw JSON response from the API:

[
{
"fileName": "table_extraction_results_1756997087699.json",
"mimeType": "application/json",
"fileSize": 563,
"success": true,
"message": "Table extraction completed successfully",
"docName": "document.pdf"
}
]

Use Cases

Data Processing and Analysis

  • Automated Data Import: Download and process the generated JSON file to automatically import tabular data into databases, spreadsheets, or business applications
  • Data Transformation: Process the JSON file content to transform table data into different formats for various business systems
  • Statistical Analysis: Analyze extracted tabular data from the JSON file to generate insights, trends, and statistical reports

Business Intelligence and Reporting

  • Report Automation: Automatically extract data from financial reports, invoices, and business documents, then process the JSON file for automated reporting
  • Dashboard Integration: Parse the JSON file and feed extracted table data into business intelligence dashboards and analytics platforms
  • Data Validation: Validate and verify tabular data extracted from the JSON file against existing business records

Document Processing and Management

  • Invoice Processing: Extract pricing tables, line items, and financial data from invoices, then process the generated JSON file for automated processing
  • Contract Analysis: Extract terms, conditions, and structured data from contracts and legal documents using the downloadable JSON file
  • Inventory Management: Process product catalogs, inventory lists, and supply chain data from PDF documents via the JSON output file

Enterprise Integration

  • System Integration: Download and integrate the JSON file with existing enterprise systems including ERP, CRM, and data warehouses
  • Workflow Automation: Trigger automated workflows based on JSON file processing and extracted tabular data
  • Data Synchronization: Process the JSON file and synchronize extracted data across multiple systems and platforms for consistent information management

Get Help