Skip to main content

Extract Metadata from Word using Zapier

PDF4me Extract Metadata pulls title, author, subject, keywords, creation/modification dates, page count, word count, and custom properties from Word docs as structured JSON. Ideal for document cataloging, compliance tracking, CMS population, or routing based on metadata—without opening files.

Authenticating Your API Request

To access the PDF4me Web API through Zapier, every request must include proper authentication credentials. Authentication ensures secure communication and validates your identity as an authorized user, enabling seamless integration between your Zapier workflows and PDF4me's powerful Word metadata extraction services.

PDF4me Extract Metadata Word Zapier - Document, File Name
Configure the Extract Metadata action—map File and File Name from previous step; Culture Name optional.

Configuration at a Glance

Example configuration from the Zapier interface

File and File Name mapped from previous step (Drive, Dropbox, or trigger). Returns Metadata with Title, Author, Subject, Keywords, CreatedDate, ModifiedDate, PageCount, WordCount; Culture Name optional.

File

Map from step

File Name

document.docx

:::tip Map File from previous steps Use the + button next to File and File Name to map from earlier Zap steps (e.g., Google Drive, Dropbox, or a trigger). The Word file must provide full content, not "Exists but not shown" references. :::

:::warning File: (Exists but not shown) If you see "File: (Exists but not shown)" in the File field and get errors, select the option that provides the full file content instead. See Zapier & Power Automate Tips for details. :::

Key Features

  • Built-in Properties: Extract title, author, subject, keywords, company
  • Date Information: Creation date, modification date, last print date
  • Document Statistics: Page count, word count, character count
  • Custom Properties: Extract user-defined custom document properties
  • JSON Output: Structured JSON format for easy integration

Parameters

Complete list of parameters for the Extract Metadata action. Parameter names match the Zapier configuration UI.

Quick setup checklist
  1. Map File and File Name from previous step

Important: Parameters marked with an asterisk (***) are required and must be provided for the action to function correctly.

ParameterTypeDescriptionExample
File***FileWord document—map from previous step (Drive, Dropbox, trigger)[4. File from Step 4]
File NameStringWord filename—with .docx or .doc extensiondocument.docx
Culture NameStringLocale—optional (e.g. en-US)en-US

Output

The PDF4me Extract Metadata action returns a Metadata object (not a file) containing document properties. Map individual metadata fields to your next Zap steps for cataloging, compliance, or CMS integration.

Key output fields (map these to your next step)

Metadata (Object)

JSON object with Title, Author, Subject, Keywords, CreatedDate, ModifiedDate, PageCount, WordCount, and custom properties. Map nested fields (e.g., Metadata.Title, Metadata.Author) to next steps.

ParameterTypeDescription
Job IdStringUnique identifier for the Zapier job execution
MetadataObjectJSON object with document properties
Metadata.TitleStringDocument title
Metadata.AuthorStringDocument author
Metadata.SubjectStringDocument subject
Metadata.KeywordsStringDocument keywords
Metadata.CreatedDateStringCreation date (ISO 8601)
Metadata.ModifiedDateStringLast modification date
Metadata.PageCountNumberTotal page count
Metadata.WordCountNumberTotal word count

Scenario Examples

Automated Document Catalog Indexing Workflow

Complete Scenario Steps:

  1. Trigger: New document uploaded to library
  2. Get Document: Retrieve Word file
  3. Extract Metadata: Get all document properties
  4. Parse JSON: Extract title, author, keywords
  5. Create Index Entry: Insert metadata to catalog database
  6. Tag Document: Apply tags based on keywords
  7. Update Search Index: Add document to search system
  8. Email Cataloging Team: Send indexing confirmation
  9. Archive Original: Store in indexed archive
  10. Log Event: Record catalog entry creation

Business Benefits:

  • Catalogs 500+ documents monthly automatically
  • Metadata extraction enables searchability
  • Automated indexing eliminates manual data entry
  • Reduces cataloging time from 10 minutes to 30 seconds

Industry Use Cases & Applications

  • Document Audit Trails: Extract modification dates and authors for compliance audit evidence
  • Version Control Tracking: Monitor document versions using creation and modification timestamps
  • Author Verification: Validate document authors match authorized personnel lists
  • Retention Policy Enforcement: Check creation dates against retention schedules for archival decisions
  • Controlled Document Management: Track document properties for ISO/quality management compliance

Get Help