Skip to main content

Static Lines Stayed in the PDF. n8n Gave Them a Spreadsheet to Live In.

· 13 min read
SEO and Content Writer

Invoice tables and line items often arrive as PDFs. You need them in Excel for filters, formulas, and imports. n8n with PDF4me chains the right operations: Dropbox – Download a file pulls the PDF, Convert PDF to editable PDF using OCR recovers text from scans, Convert PDF to Excel builds the workbook, then Dropbox – Upload a file saves the .xlsx next to your process. Five nodes including the trigger. One execution. No retyping.

This guide uses authentic screenshots and descriptive image captions so you can match credentials, binary field names, and quality settings in your own workflow.

In a nutshell: When clicking Execute workflowDropbox – Download a file (/blog data/convert from pdf/sample_pdf.pdf, binary field data) → PDF4me – Convert PDF to editable PDF using OCR (Find Search; binary data; QualityType Draft; OCR Only When Needed true; Language English; output editable_pdf_output.pdf; binary output data) → PDF4me – Convert PDF to Excel (Convert; binary from OCR; QualityType High; Merge All Sheets; OCR When Needed; output PDF_to_EXCEL_output.xlsx; binary data) → Dropbox – Upload a file (/blog data/convert from pdf/output/PDF_To_Excel_Output.xlsx, binary data). Result: an Excel file in Dropbox with extracted rows and columns.

What You'll Get!

Input: A PDF in Dropbox (e.g. sample_pdf.pdf under /blog data/convert from pdf/) containing tabular content such as invoice lines. Output: An Excel workbook (e.g. PDF_To_Excel_Output.xlsx or PDF_to_EXCEL_output.xlsx depending on naming) stored under /blog data/convert from pdf/output/, with headers like ID, Name, Product, Quantity, Price, Total ready for analysis.


What You Need?

  • n8nn8n (cloud or self-hosted). Create a new workflow.
  • PDF4me credentialsGet your API key. Add a PDF4me account in n8n. See Connect PDF4me to n8n.
  • Dropbox — Dropbox OAuth2 API credential. Nodes: download by path and upload with binary.
  • Source PDF — e.g. /blog data/convert from pdf/sample_pdf.pdf.
  • Output folder — e.g. /blog data/convert from pdf/output/ for the generated .xlsx.

Sample source content (what the PDF represents)

The PDF may encode a small table like this test dataset—useful for validating column mapping after conversion.

Sample invoice data table PDF to Excel test with columns ID Name Product Quantity Price Total and five product rows

Example structured data: line items with quantities and totals, ideal for PDF-to-Excel regression tests.


The Workflow at a Glance (5 Nodes!)

  1. Trigger — When clicking Execute workflow (or schedule / webhook).
  2. Dropbox – Download a file — File Path: /blog data/convert from pdf/sample_pdf.pdf. Put output binary in field data.
  3. PDF4me (Find Search) — Convert PDF to editable PDF using OCR — Binary Property data; QualityType Draft; OCR Only When Needed true; Language English; Output File Name editable_pdf_output.pdf; Binary Data Output Name data.
  4. PDF4me (Convert) — Convert PDF to Excel — Binary from step 3; QualityType High; Language English; Merge All Sheets and OCR When Needed as needed; Output File Name PDF_to_EXCEL_output.xlsx; Binary Data Output Name data.
  5. Dropbox – Upload a file — File Path: /blog data/convert from pdf/output/PDF_To_Excel_Output.xlsx. Input Binary Field: data.

Complete workflow overview

n8n workflow canvas: manual trigger, Dropbox Download a file, PDF4me Convert PDF to editable PDF using OCR, PDF4me Convert PDF to Excel, Dropbox Upload a file connected in sequence

End-to-end chain: ingest PDF, OCR, convert to Excel, upload .xlsx. Each node shows one item when run successfully.


Step 1: Download a file (Dropbox)

Workflow so far: Trigger and Dropbox download only.

  1. Add When clicking Execute workflow (or your preferred trigger).
  2. Add DropboxDownload (file by path).
  3. Credential — Dropbox OAuth2 API.
  4. Resource — File. Operation — Download.
  5. File Path/blog data/convert from pdf/sample_pdf.pdf.
  6. Put Output File in Fielddata so PDF4me nodes read binary from the same property name consistently.
  7. Execute step and confirm output metadata (file name, size, MIME type application/pdf).

Dropbox – Download a file: Parameters

n8n Dropbox Download a file node Parameters File Path blog data convert from pdf sample_pdf.pdf Put Output File in Field data Credential Dropbox OAuth2 API OUTPUT shows sample_pdf.pdf binary metadata

Binary output in data feeds the OCR node; path matches your Dropbox folder layout.


Step 2: Convert PDF to editable PDF using OCR (PDF4me)

Workflow so far: Trigger → Download → OCR.

  1. Add PDF4me and choose Resource: Find Search (or the grouping that exposes OCR in your n8n version).
  2. OperationConvert PDF to editable PDF using OCR.
  3. Input Data TypeBinary Data.
  4. PDF Binary Fielddata (from Download).
  5. Quality TypeDraft for a fast pass; use High when scans are noisy (see docs for API cost per quality).
  6. OCR Only When Neededtrue to skip OCR when the PDF already has a text layer (efficiency).
  7. LanguageEnglish (set explicitly when detection is unreliable).
  8. Output File Name — e.g. editable_pdf_output.pdf.
  9. Binary Data Output Namedata so the next node keeps a single binary pipeline.
  10. Execute step and confirm output size reflects an OCR’d text layer (often larger than the source scan).

PDF4me – Convert PDF to editable PDF using OCR

n8n PDF4me Convert PDF to editable PDF using OCR Parameters Binary Data data field QualityType Draft OCR Only When Needed True Language English Output File Name editable_pdf_output.pdf Binary Data Output Name data INPUT OUTPUT panels

OCR produces a searchable PDF buffer used as input to Convert PDF to Excel—important for scanned invoices.


Step 3: Convert PDF to Excel (PDF4me)

Workflow so far: Trigger → Download → OCR → Convert to Excel.

  1. Add PDF4meResource: ConvertConvert PDF to Excel.
  2. Input Data TypeBinary Data.
  3. Binary Propertydata from the OCR node output (not the original download).
  4. Input File Name — e.g. New.pdf or a name matching your intermediate file; keep extension consistent with what PDF4me expects.
  5. Quality TypeHigh for stronger table recovery on complex or scanned PDFs.
  6. LanguageEnglish.
  7. Merge All Sheetson if you want one consolidated sheet when the PDF spans multiple pages or tables.
  8. OCR When Neededon so conversion can still apply OCR if required for parts of the document.
  9. Output File Name — e.g. PDF_to_EXCEL_output.xlsx.
  10. Binary Data Output Namedata.
  11. Execute step and verify OUTPUT shows application/vnd.openxmlformats-officedocument.spreadsheetml.sheet and a sensible file size.

PDF4me – Convert PDF to Excel

n8n PDF4me Convert PDF to Excel node INPUT from Convert PDF to editable PDF QualityType High Language English Merge All Sheets OCR When Needed on Output File Name PDF_to_EXCEL_output.xlsx Binary Property data OUTPUT xlsx file metadata

Mapping OCR output—not the raw download—keeps table detection aligned with searchable text.


Step 4: Upload a file (Dropbox)

Workflow so far: Full pipeline; upload publishes the spreadsheet.

  1. Add DropboxUpload (or Upload a file).
  2. Credential — Same Dropbox OAuth2 API.
  3. Resource — File. Operation — Upload.
  4. File Path — Full destination path including file name, e.g. /blog data/convert from pdf/output/PDF_To_Excel_Output.xlsx (match casing to your Dropbox folder).
  5. Binary FileOn.
  6. Input Binary Fielddata from Convert PDF to Excel.
  7. Execute step and confirm OUTPUT returns Dropbox metadata (path_display, id, is_downloadable).

Dropbox – Upload a file: Parameters

n8n Dropbox Upload a file File Path blog data convert from pdf output PDF_To_Excel_Output.xlsx Binary File on Input Binary Field data from Convert PDF to Excel OUTPUT Dropbox file metadata

Upload completes the loop: processed data lands in a dedicated output folder for finance or ops.


Output: Excel Online!

Open the uploaded file in Excel on the web or desktop. You should see columns such as ID, Name, Product, Quantity, Price, and Total aligned with your source table.

Excel Online spreadsheet PDF_to_Excel Saved with invoice table columns ID Name Product Quantity Price Total and five data rows

Final grid: editable rows for pivot tables, imports, and review—without manual copy from the PDF.


Use Cases

Operations and finance: Turn recurring vendor PDFs into spreadsheets for reconciliation, three-way match, or ERP staging tables.

Scanned PDFs: Run Convert PDF to editable PDF using OCR before Convert PDF to Excel so text and tables are recoverable from image-based documents.

Same logic elsewhere: Compare with PDF to Excel with OCR in Power Automate if your stack is Microsoft-first.


Quick Reference

StepNodePurposeKey settings
1Dropbox – Download a fileLoad PDFPath .../sample_pdf.pdf; binary field data
2PDF4me – OCR editable PDFSearchable PDFBinary data; Draft; OCR only when needed; English
3PDF4me – Convert PDF to ExcelXLSXBinary from step 2; High; merge sheets; PDF_to_EXCEL_output.xlsx
4Dropbox – Upload a fileSave outputFull path under .../output/; binary data

For full parameter lists, see Convert PDF to Editable PDF Using OCR — n8n and Convert PDF to Excel — n8n.


Troubleshooting

Empty or shifted columns

Point Convert PDF to Excel at the OCR output, not the first download. Raise Quality Type to High and confirm Language matches the document.

Binary field errors between nodes

Keep a consistent Binary Property name (e.g. data) end to end, or update each node after renames. Re-run Execute step per node when debugging.

API or credential errors

See PDF4me Troubleshooting for API key, credits, and connectivity.


What's Next?

  1. Production trigger — Replace the manual trigger with Cron, Webhook, or Dropbox event patterns so new PDFs convert without opening n8n.
  2. Naming — Build the upload path from the source file name plus .xlsx to avoid overwriting prior runs.
  3. Validation — Add a Spreadsheet File read or HTTP step to post rows to a database or queue.
  4. Governance — Log file IDs from Dropbox OUTPUT for audit trails.