Skip to main content

Extract Metadata in Power Automate

PDF4me Extract Metadata action enables extracting comprehensive metadata and properties from Word documents in Power Automate with detailed document analysis capabilities. This powerful metadata extraction feature supports built-in document properties, custom properties, document statistics, author information, creation dates, and revision tracking with culture-specific formatting and localization support, perfect for document management, compliance tracking, and content analysis workflows across Microsoft 365.

Authenticating Your API Request

To access the PDF4me Web API through Power Automate, every request must include proper authentication credentials. Authentication ensures secure communication and validates your identity as an authorized user, enabling seamless integration between your Power Automate flows and PDF4me's powerful Word metadata extraction services.

Extract Metadata Power Automate

Key Features

  • Comprehensive Metadata Extraction: Retrieve all built-in and custom document properties
  • Document Statistics: Get real-time page count, word count, character count, and paragraph count
  • Author Information: Extract author, manager, company, and contact details
  • Date/Time Properties: Retrieve creation, modification, and printing timestamps
  • Custom Properties: Access all custom document properties with automatic prefixing
  • Culture Support: Format dates and times according to specified locale
  • Read-Only Operation: Extract metadata without modifying the original document
  • Structured Output: Return organized metadata in JSON format for easy processing

Parameters

Complete list of parameters for the Extract Metadata action. Configure these parameters to control metadata extraction behavior.

Important: Parameters marked with an asterisk (***) are required. Advanced parameters provide fine-grained control over metadata formatting.

ParameterTypeDescriptionExample
File Content***Base64Source Word File Content
• Map Word file from previous action output
• Supports Word files from SharePoint, Dropbox, OneDrive, email attachments
• Can be dynamically retrieved from flow variables
• Direct base64 can be given in the file content also
• Must be valid Word document (.docx, .doc formats)
• Document is read-only during metadata extraction
[File Content from Get File]
File Name***StringWord Document Name
• Specify Word file name with extension (.docx, .doc)
• Used for reference and processing
• Must include proper file extension
• Supports dynamic naming from flow variables
• Used for document identification and logging
document.docx
Culture NameStringDocument Culture/Locale
• Culture code for date/time formatting (e.g., "en-US", "de-DE", "fr-FR")
• Default: InvariantCulture (consistent formatting)
• Affects date/time display format in metadata
• Use valid culture codes for international documents
• Invalid cultures fall back to InvariantCulture
en-US

Output

The PDF4me Extract Metadata action returns comprehensive output data for seamless Power Automate flow integration:

Table View

Response data in a structured table format:

ParameterTypeDescription
metadataObjectComprehensive metadata dictionary containing all document properties
SuccessBooleantrue if operation successful, false if failed
Error MessageStringError description (null if successful)
ErrorsArrayList of detailed error information (empty array if successful)

Metadata Properties

Built-in Document Properties

Property NameTypeDescriptionExample
AuthorStringDocument author name"John Doe"
TitleStringDocument title"Project Proposal"
SubjectStringDocument subject"Q1 2024 Planning"
KeywordsStringDocument keywords (comma-separated)"planning, budget, strategy"
CommentsStringDocument comments"Draft version for review"
CategoryStringDocument category"Business"
CompanyStringCompany name"Acme Corporation"
ManagerStringManager name"Jane Smith"
CreatedStringDocument creation date/time (formatted)"1/15/2024 10:30:00 AM"
LastModifiedStringLast saved date/time (formatted)"1/20/2024 2:45:00 PM"
LastPrintedStringLast printed date/time (formatted)"1/18/2024 9:15:00 AM"
RevisionNumberIntegerDocument revision number3
TotalEditingTimeIntegerTotal editing time in minutes120

Document Statistics

Property NameTypeDescriptionExample
PagesIntegerTotal number of pages8
WordsIntegerTotal word count2150
CharactersIntegerTotal character count12800
ParagraphsIntegerTotal paragraph count45

Supported Culture Examples

Culture CodeDescriptionDate Format Example
en-USEnglish (United States)"1/15/2024 10:30:00 AM"
en-GBEnglish (United Kingdom)"15/01/2024 10:30:00"
de-DEGerman (Germany)"15.01.2024 10:30:00"
fr-FRFrench (France)"15/01/2024 10:30:00"
es-ESSpanish (Spain)"15/01/2024 10:30:00"
ja-JPJapanese (Japan)"2024/01/15 10:30:00"

Common Error Messages

Understanding and troubleshooting errors helps ensure smooth metadata extraction workflows:

Error MessageCauseSolution
"Request is empty"Request object is nullProvide valid request object
"Document is empty"Document object is nullProvide valid Word document
"Error extracting metadata"General exception during metadata extractionCheck document format and parameters. Review exception details in Errors array
"Error loading document from bytes"Invalid or corrupted Word documentVerify the input document is a valid Word file (.docx)

Workflow Examples

The PDF4me Extract Metadata action in Power Automate provides comprehensive workflow templates designed for real-world business scenarios:

Automated Document Classification Workflow

Streamline your document management with automated metadata-based classification:

Complete Workflow Steps:

  1. Trigger: Document uploaded to SharePoint document library
  2. Get Document: Retrieve Word document from upload location
  3. Extract Metadata: Extract all document properties and statistics
  4. Set Culture: Culture Name = "en-US" for consistent date formatting
  5. Classify by Author: Route documents based on Author property
  6. Classify by Category: Route documents based on Category property
  7. Update SharePoint: Move document to appropriate folder based on classification
  8. Log Classification: Record classification results in document management system

Business Benefits:

  • Classifies 500+ documents monthly automatically
  • Routes documents to appropriate departments based on metadata
  • Reduces manual document sorting time by 90%
  • Ensures consistent document organization

Industry Use Cases & Applications

Legal & Professional Services Use Cases

  • Document Classification: Classify legal documents by case type, client, or practice area
  • Compliance Tracking: Monitor document creation dates and revision history for regulatory compliance
  • Client Document Management: Track document properties for client file organization
  • Audit Trail Maintenance: Log document metadata for legal audit requirements

Get Help