Extract Text and Images in Power Automate
PDF4me Extract Text and Images action retrieves all textual content and embedded images from PDF documents in Power Automate for comprehensive content extraction. This versatile feature offers independent control over text and image extraction, returning full text as string and images as array of objects, enabling content reuse, data analysis, image processing, and multi-format content distribution across Microsoft 365 workflows.
Authenticating Your API Request
To access the PDF4me Web API through Power Automate, every request must include proper authentication credentials. Authentication ensures secure communication and validates your identity as an authorized user, enabling seamless integration between your Power Automate flows and PDF4me's powerful content extraction services.

Key Features
- Text Extraction: Retrieve all text content from PDF documents
- Image Extraction: Extract all embedded images as separate files
- Independent Control: Choose to extract text only, images only, or both
- Array Output: Receive images as array for iteration and processing
- Batch Processing: Extract content from multiple PDFs in workflows
Parameters
Complete list of parameters for the Extract Text and Images action. Configure these parameters to control content extraction.
Important: Parameters marked with an asterisk (***) are required and must be provided for the action to function correctly.
| Parameter | Type | Description | Example |
|---|---|---|---|
| File Content*** | Binary | Source PDF File Content • Map PDF file from previous action output • Supports PDFs from SharePoint, OneDrive, email • Can be dynamically retrieved from flow variables • Must be valid PDF document | [File Content from Get File] |
| Name*** | String | PDF Document Name • Source PDF file name with .pdf extension • Used for processing identification • Must include proper file extension • Supports dynamic naming from flows | Document.pdf |
| Extract Images*** | Boolean | Image Extraction Control • True - Extract all embedded images • False - Skip image extraction • Set to false if images not required • Reduces processing when only text needed | true |
| Extract Text*** | Boolean | Text Extraction Control • True - Extract all text content • False - Skip text extraction • Set to false if text not required • Reduces processing when only images needed | true |
Output
The PDF4me Extract Text and Images action returns comprehensive output data for seamless Power Automate flow integration:
- Table
- JSON
- Flow Integration
Table View
Response data in a structured table format:
| Parameter | Type | Description |
|---|---|---|
| Texts | String | Complete text content extracted from PDF |
| Images | Array | List of extracted images returned as array of objects |
JSON Response Format
{
"Texts": "Extracted text content from entire PDF document...",
"Images": [
{
"ImageData": "[Base64 Image Content]",
"ImageName": "image_1.png"
},
{
"ImageData": "[Base64 Image Content]",
"ImageName": "image_2.png"
}
]
}
Power Automate Flow Usage
Use extracted content in subsequent actions:
- Text Analysis: Analyze extracted text with AI Builder
- Image Processing: Process extracted images individually
- Database Storage: Store text and images in databases
- Content Reuse: Repurpose extracted content in new documents
- Search Indexing: Index extracted text for searchability
- Archive Separately: Store text and images in organized structure
Workflow Examples
The PDF4me Extract Text and Images action in Power Automate provides comprehensive workflow templates designed for real-world business scenarios:
- Content Archival
- Content Analysis
- Content Migration
- Translation Preparation
Automated Content Separation and Archival
Transform your content management with automated extraction and storage:
Complete Workflow Steps:
- Trigger: Document uploaded to processing folder
- Get Document: Retrieve PDF from SharePoint
- Extract Content: Extract both text and images
- Save Text: Store extracted text in text file in archive
- Apply to Each Image: Iterate through extracted images
- Save Images: Store each image individually in images folder
- Create Index: Generate metadata with text and image references
- Update Database: Log extraction details and locations
Business Benefits:
- Separates content from 200+ PDFs monthly
- Enables independent text and image management
- Reduces storage redundancy by 60%
- Facilitates content reuse and repurposing
Automated PDF Content Analysis Workflow
Streamline content analysis with automated text and image extraction:
Complete Workflow Steps:
- Trigger: Research paper PDF uploaded for analysis
- Get PDF: Retrieve PDF from research library
- Extract Text: Retrieve all text content
- Analyze Text: Use AI Builder for sentiment and key phrase analysis
- Extract Images: Get all embedded images and charts
- Process Images: Use AI Vision to analyze images
- Compile Report: Create analysis report with findings
- Email Researchers: Send analysis summary
Business Benefits:
- Analyzes 50+ research documents monthly
- Automates comprehensive content analysis
- Reduces analysis time from 4 hours to 20 minutes
- Enables AI-powered insights from PDF content
Automated Content Migration to SharePoint
Optimize content migration with automated extraction and import:
Complete Workflow Steps:
- Trigger: Legacy PDF uploaded for migration
- Get PDF: Retrieve PDF from migration source
- Extract Text: Get all text content for indexing
- Extract Images: Retrieve all embedded images
- Create SharePoint Page: Build modern SharePoint page
- Add Text Content: Import extracted text to page
- Upload Images: Add images to SharePoint assets
- Link Content: Reference images in page content
Business Benefits:
- Migrates 100+ legacy PDFs monthly to modern format
- Enables content search and discovery in SharePoint
- Reduces manual migration time by 85%
- Maintains content accessibility and usability
Automated Content Extraction for Translation
Enhance translation workflows with automated content extraction:
Complete Workflow Steps:
- Trigger: Document marked for translation
- Get Document: Retrieve PDF requiring translation
- Extract Text: Get all text content
- Extract Images: Retrieve embedded images separately
- Send to Translator: Submit text to Azure Translator API
- Translate Text: Convert text to target language
- Recreate Document: Build new PDF with translated text and original images
- Email Stakeholders: Send translated document
Business Benefits:
- Translates 40+ documents monthly
- Preserves original images while translating text
- Reduces translation preparation time by 75%
- Enables rapid multi-language document creation
Industry Use Cases & Applications
- Publishing & Media
- Legal & Professional Services
- Marketing & Communications
- Education & Training
Publishing & Media Use Cases
- Content Extraction: Extract text and images for content reuse
- Digital Asset Management: Separate and catalog document assets
- Content Migration: Extract content for platform migration
- Archive Management: Organize content separately for better access
Legal & Professional Services Use Cases
- Document Review: Extract text for legal review and analysis
- Evidence Collection: Extract images and text from case documents
- Discovery Processing: Separate content for e-discovery
- Content Analysis: Extract for compliance and review
Marketing & Communications Use Cases
- Content Repurposing: Extract content for marketing reuse
- Asset Collection: Gather images for marketing libraries
- Campaign Materials: Extract content for campaign creation
- Brand Asset Management: Collect logos and branding images
Education & Training Use Cases
- Course Content Extraction: Extract text and images from course materials
- Learning Object Creation: Separate content for LMS upload
- Content Migration: Extract for platform transitions
- Archive Management: Organize educational content assets