Why Export PDF Metadata as JSON or CSV?
Viewing PDF metadata in a browser is useful for a quick check, but many workflows require the data in a machine-readable format. Developers integrating document metadata into pipelines need structured JSON they can parse directly. Compliance teams auditing document provenance need CSV files they can open in spreadsheets and filter. Researchers cataloging document collections need exportable records.
The PDF Metadata Exporter extracts all available info dictionary fields from your PDF — title, author, subject, keywords, creator software, producer library, timestamps, page count, page dimensions, and PDF version — and exports them as a single JSON or CSV file. Everything runs in your browser; no file is uploaded to any server.
JSON Export — For Developers and Scripts
The JSON export produces a structured object with three top-level keys: filename(the original file name), exportedAt (the export timestamp in ISO 8601 format), and metadata (a flat key-value object containing all extracted fields). This format can be parsed directly with JSON.parse() in any language.
Common uses for the JSON export include feeding document metadata into document management APIs, building automated audit trails, populating database records for document cataloging systems, and verifying metadata programmatically before publishing documents.
CSV Export — For Spreadsheets and Reporting
The CSV export produces a three-column table — Category, Property, Value — that opens directly in Excel, Google Sheets, or Numbers. Each metadata field becomes a row, making it easy to sort, filter, and build reports across multiple documents by exporting each and merging the sheets.
CSV exports are especially useful for compliance reporting, document review workflows, editorial metadata audits before publication, and any context where non-technical stakeholders need to review document properties.
What Metadata Is Extracted?
The exporter reads the standard PDF info dictionary, which is the primary metadata container in most PDF files. Fields extracted include:
- Title — the document title as set by the authoring application
- Author — the person or organisation listed as the document author
- Subject — a brief summary or topic description
- Keywords — search and categorization terms
- Creator — the application that originally created the document (e.g. Microsoft Word, InDesign)
- Producer — the PDF library that generated the final PDF (e.g. Adobe Acrobat, pdf-lib)
- Creation Date — when the document was first created
- Modification Date — when the document was last modified
- Page Count — total number of pages
- Page Size — dimensions of the first page in points and millimetres
- PDF Version — the PDF specification version (e.g. 1.7, 2.0)
- File Size — the file size of the original PDF
How to Use the PDF Metadata Exporter
- Drop your PDF — drag and drop onto the upload area, or click to browse. Any standard PDF file is supported.
- Review the preview — all extracted fields are displayed in a table before you export, so you can confirm what will be included.
- Export or copy — click Download JSON for a structured file, Download CSV for a spreadsheet-ready table, or Copy JSON to paste directly into your code or terminal.
Privacy and Security
This tool processes your PDF entirely in your browser using JavaScript and the pdf-lib library. Your file is never transmitted to any server. The exported JSON and CSV files are generated locally and downloaded directly to your device. There are no accounts, no cloud storage, and no data retention. Closing the tab discards everything.
Related Tools
Other NoFileUpload tools for working with PDF metadata:
- PDF Metadata Viewer — browse all metadata fields in a categorized table without exporting.
- PDF Metadata Remover — strip all metadata from a PDF before sharing it.
- PDF XMP Viewer — view the raw XMP metadata stream embedded in your PDF, which can contain fields beyond the standard info dictionary.