What Is PDF/A? The Definitive Guide to PDF Archival Formats
A comprehensive guide to PDF/A — the ISO-standardized format for long-term document preservation used by governments, legal systems, and archives worldwide.
What Is PDF/A?
PDF/A is a specialized subset of the PDF format (ISO 32000) designed specifically for long-term digital preservation of electronic documents. Standardized as ISO 19005 by the International Organization for Standardization, PDF/A ensures that documents can be reliably reproduced and rendered exactly the same way, decades or even centuries into the future.
The "A" in PDF/A stands for "Archive." Unlike regular PDF — which can reference external resources, depend on system fonts, or include dynamic content like JavaScript — PDF/A mandates that everything needed to render the document is self-contained within the file.
PDF/A was first published in 2005 (ISO 19005-1) through a collaboration between AIIM (the Association for Intelligent Information Management) and NPES (the Association for Suppliers of Printing, Publishing, and Converting Technologies). Since then, it has become the international standard for document archiving, adopted by government agencies, courts, financial institutions, and cultural heritage organizations worldwide.
Why PDF/A Matters
Digital preservation is one of the most significant challenges of the information age. Technologies change, software becomes obsolete, and file formats evolve. A document created in a proprietary format today may be unreadable in 20 years if the software that created it no longer exists.
PDF/A solves this problem by requiring:
1. Font embedding: All fonts used in the document must be embedded, ensuring text renders correctly regardless of which fonts are installed on the viewer's system.
2. No external dependencies: PDF/A documents cannot reference external content — no linked images, no external fonts, no external color profiles.
3. No dynamic content: JavaScript, audio, video, and other dynamic elements are prohibited because their interpretation may change over time.
4. Device-independent color: Colors must be specified using device-independent color spaces (like ICC profiles) to ensure consistent color reproduction across different displays and printers.
5. Standardized metadata: XMP (Extensible Metadata Platform) metadata is required, providing machine-readable information about the document's creation, authorship, and purpose.
According to the Library of Congress, PDF/A is one of the preferred formats for long-term preservation of digital content, alongside TIFF for images and WAVE for audio.
PDF/A Conformance Levels
PDF/A defines multiple parts and conformance levels, each with increasing capabilities:
PDF/A-1 (ISO 19005-1:2005): Based on PDF 1.4. The original and most restrictive version. • Level A (Accessible): Requires tagged content (document structure markup), Unicode character mapping, and alternative text for images. Ensures the document is fully accessible for assistive technologies. • Level B (Basic): Requires visual appearance preservation but not tagged content. Ensures the document looks the same but does not guarantee structural accessibility.
PDF/A-2 (ISO 19005-2:2011): Based on PDF 1.7. Adds support for: • JPEG 2000 image compression (more efficient than JPEG) • Transparency effects • Layers (optional content groups) • PDF/A file attachments • Digital signatures conforming to PAdES (PDF Advanced Electronic Signatures)
PDF/A-3 (ISO 19005-3:2012): Same as PDF/A-2 but allows embedding any file format as an attachment. This enables hybrid documents — a PDF/A-3 file can contain the source spreadsheet, XML data, or original design files alongside the archived visual representation.
PDF/A-4 (ISO 19005-4:2020): Based on PDF 2.0. The latest version, simplifying conformance levels to just two: Level F (file-level conformance) and Level E (engineering document support with 3D content).
Which level to use? PDF/A-2b is the most commonly used level for general archiving — it provides visual fidelity with modern PDF features (transparency, JPEG 2000) without the strict accessibility requirements of Level A.
Who Requires PDF/A?
PDF/A compliance is mandated or strongly recommended across numerous industries and jurisdictions:
Government and courts: • The U.S. Federal Courts require PDF/A for electronic filing through the CM/ECF system • The European Union mandates PDF/A for official documents in the EU Official Journal • Germany's TR-ESOR (Technical Guidelines for Evidence Preservation) requires PDF/A for legally binding document storage
Financial services: • Banks and insurance companies use PDF/A for regulatory compliance, ensuring audit trails and financial records remain accessible for mandated retention periods (often 7–10 years)
Healthcare: • Medical records systems increasingly use PDF/A to meet long-term retention requirements under regulations like HIPAA
Cultural heritage: • The Library of Congress, British Library, and national archives worldwide use PDF/A-2b for digitized collections • Museums and galleries archive exhibition catalogs, research papers, and provenance documentation in PDF/A
Corporate compliance: • ISO 9001 and ISO 27001 certified organizations often require PDF/A for quality management and information security documentation
How to Validate PDF/A Compliance
Creating a PDF is not the same as creating a valid PDF/A document. Validation tools analyze a PDF against the ISO 19005 requirements and report any violations.
veraPDF (open-source): Developed with funding from the European Commission's PREFORMA project, veraPDF is the industry-reference validator for PDF/A compliance. It validates all parts and conformance levels of ISO 19005.
Adobe Acrobat Pro: The Preflight tool in Adobe Acrobat Pro can validate and convert PDFs to PDF/A. It provides detailed reports of any compliance issues.
Common validation failures: • Missing font embedding: Fonts referenced but not embedded in the file • Unsupported color spaces: Device-dependent colors (RGB without ICC profiles) • Transparency issues: (PDF/A-1 does not support transparency) • JavaScript or multimedia content: Dynamic elements prohibited in PDF/A • Missing XMP metadata: Required metadata streams not present
AuraPDF's PDF Health Checker analyzes documents for common compliance issues, including font embedding status, metadata completeness, and structural integrity — providing actionable recommendations for improving PDF quality.
PDF/A vs Regular PDF: Key Differences
| Feature | Regular PDF | PDF/A |
|---|---|---|
| Font embedding | Optional | Required |
| External references | Allowed | Prohibited |
| JavaScript | Allowed | Prohibited |
| Encryption | Allowed | Prohibited |
| Audio/Video | Allowed | Prohibited |
| Transparency | Allowed | Depends on version |
| XMP Metadata | Optional | Required |
| Color spaces | Any | Device-independent required |
| File attachments | Any | Restricted (except PDF/A-3) |
Key takeaway: PDF/A trades flexibility for long-term reliability. If you need a document to be perfectly reproducible 50 years from now, PDF/A is the right choice. For everyday document sharing and collaboration, regular PDF is more practical.
Note that PDF/A prohibits encryption — because encryption depends on algorithms that may become obsolete or compromised over time. For archival documents requiring both security and longevity, cryptographic signatures (which verify integrity without preventing access) are preferred over encryption.
Frequently Asked Questions
Can I open PDF/A files with a regular PDF reader?
Can I convert a regular PDF to PDF/A?
Is PDF/A the same as a scanned PDF?
Why can't PDF/A files be encrypted?
Try These Tools
Read Next
Written by the AuraPDF Team
The AuraPDF team builds free, secure PDF tools used by thousands of people worldwide. Our guides combine hands-on expertise with technical depth to help you work with PDFs more effectively.
Learn more about us