What Is XMP Metadata in a PDF?
XMP (Extensible Metadata Platform) is an XML-based metadata standard developed by Adobe and adopted as ISO 16684. It was created because the original PDF info dictionary — which stores fields like title, author, and creation date — is a rigid, limited structure with no support for custom fields or namespaces.
XMP solves this by embedding a full XML document inside the PDF file as a separate stream object. This XML packet can contain all the standard info dictionary fields plus an unlimited range of additional data: rights management information, copyright licensing terms, IPTC news metadata, Dublin Core fields, Adobe-specific workflow data, and any custom namespace fields defined by the creating software.
How XMP Differs From the Standard PDF Info Dictionary
The PDF info dictionary is a flat key-value structure with a fixed set of predefined fields. You can view these with the PDF Metadata Viewer. XMP is fundamentally different in three ways:
- Structure — XMP is a full XML document using RDF (Resource Description Framework), allowing nested objects, arrays, and typed values that the flat info dictionary cannot represent.
- Extensibility — Any software can define its own XMP namespace and embed arbitrary metadata. Adobe Photoshop, InDesign, Lightroom, and Acrobat all write proprietary XMP fields that are invisible to basic PDF viewers.
- Richness — XMP can express concepts the info dictionary has no fields for: Creative Commons licensing terms, IPTC copyright notices, image editing history, document permissions, colour management data, and much more.
Some PDFs have both — the info dictionary and an XMP stream — and they may contradict each other if a document was edited by tools that updated one but not the other.
What XMP Can Reveal
For privacy and compliance purposes, XMP metadata can expose information that goes far beyond the visible title and author fields:
- Document history — Adobe tools write a full editing history into XMP, recording every save operation with a timestamp and software version.
- Original authoring identity — Even if the info dictionary author field is blank, XMP may contain the creator's identity in Dublin Core or custom fields.
- Licensing and rights — Creative Commons, Getty, or custom licensing terms embedded by stock libraries and publishing workflows.
- Workflow data — Prepress software embeds colour profiles, trapping instructions, and press-ready settings in XMP.
- Software fingerprinting — The exact version of every tool that touched the document may be recorded in XMP, even if that information is absent from the info dictionary.
Which PDFs Contain XMP?
XMP is most commonly found in PDFs created or processed by Adobe software — Acrobat, Photoshop, InDesign, and Illustrator all write XMP automatically. Documents exported from professional publishing workflows, stock photo agencies, legal document management systems, and corporate document templates frequently contain rich XMP metadata.
PDFs created by browser print-to-PDF, basic converters, or minimal PDF libraries (like pdf-lib) typically do not include XMP. The XMP stream is optional in the PDF specification — its presence depends entirely on the creating tool.
How to Use the PDF XMP Viewer
- Drop your PDF — drag and drop onto the upload area, or click to browse. The file is read entirely in your browser.
- Extraction — the tool scans the raw PDF binary for the XMP packet markers (
<?xpacket begin=…<?xpacket end=) and extracts the embedded XML. - Review the XML — the XMP content is displayed as formatted XML in the viewer. Look for namespaces like
dc:(Dublin Core),xmp:(basic XMP),pdf:(PDF-specific),xmpRights:(rights management), and custom Adobe namespaces. - Export — download the XMP as a standalone
.xmlfile or copy it to your clipboard for use in scripts or other tools.
Privacy and Security
Your PDF is never uploaded to any server. The entire extraction process runs in your browser — the raw PDF bytes are searched for XMP markers using JavaScript, and the resulting XML is displayed locally. Closing the tab discards everything. There are no accounts, no analytics on file contents, and no data retention.
Related Tools
Other NoFileUpload tools for PDF metadata:
- PDF Metadata Viewer — view the standard info dictionary fields (title, author, dates, software) in a structured table.
- PDF Metadata Remover — strip all info dictionary metadata from a PDF before sharing it.
- PDF Metadata Exporter — export all info dictionary fields as JSON or CSV for use in scripts and workflows.