Skip to main content
AAuraPDF
Guide11 min read

What Is PDF/A? The Definitive Guide to PDF Archival Formats

A comprehensive guide to PDF/A — the ISO-standardized format for long-term document preservation used by governments, legal systems, and archives worldwide.

AuraPDF TeamMarch 28, 2026

What Is PDF/A?

PDF/A is a specialized subset of the PDF format (ISO 32000) designed specifically for long-term digital preservation of electronic documents. Standardized as ISO 19005 by the International Organization for Standardization, PDF/A ensures that documents can be reliably reproduced and rendered exactly the same way, decades or even centuries into the future.

The "A" in PDF/A stands for "Archive." Unlike regular PDF — which can reference external resources, depend on system fonts, or include dynamic content like JavaScript — PDF/A mandates that everything needed to render the document is self-contained within the file.

PDF/A was first published in 2005 (ISO 19005-1) through a collaboration between AIIM (the Association for Intelligent Information Management) and NPES (the Association for Suppliers of Printing, Publishing, and Converting Technologies). Since then, it has become the international standard for document archiving, adopted by government agencies, courts, financial institutions, and cultural heritage organizations worldwide.

Why PDF/A Matters

Digital preservation is one of the most significant challenges of the information age. Technologies change, software becomes obsolete, and file formats evolve. A document created in a proprietary format today may be unreadable in 20 years if the software that created it no longer exists.

PDF/A solves this problem by requiring:

1. Font embedding: All fonts used in the document must be embedded, ensuring text renders correctly regardless of which fonts are installed on the viewer's system.

2. No external dependencies: PDF/A documents cannot reference external content — no linked images, no external fonts, no external color profiles.

3. No dynamic content: JavaScript, audio, video, and other dynamic elements are prohibited because their interpretation may change over time.

4. Device-independent color: Colors must be specified using device-independent color spaces (like ICC profiles) to ensure consistent color reproduction across different displays and printers.

5. Standardized metadata: XMP (Extensible Metadata Platform) metadata is required, providing machine-readable information about the document's creation, authorship, and purpose.

According to the Library of Congress, PDF/A is one of the preferred formats for long-term preservation of digital content, alongside TIFF for images and WAVE for audio.

PDF/A Conformance Levels

PDF/A defines multiple parts and conformance levels, each with increasing capabilities:

PDF/A-1 (ISO 19005-1:2005): Based on PDF 1.4. The original and most restrictive version. • Level A (Accessible): Requires tagged content (document structure markup), Unicode character mapping, and alternative text for images. Ensures the document is fully accessible for assistive technologies. • Level B (Basic): Requires visual appearance preservation but not tagged content. Ensures the document looks the same but does not guarantee structural accessibility.

PDF/A-2 (ISO 19005-2:2011): Based on PDF 1.7. Adds support for: • JPEG 2000 image compression (more efficient than JPEG) • Transparency effects • Layers (optional content groups) • PDF/A file attachments • Digital signatures conforming to PAdES (PDF Advanced Electronic Signatures)

PDF/A-3 (ISO 19005-3:2012): Same as PDF/A-2 but allows embedding any file format as an attachment. This enables hybrid documents — a PDF/A-3 file can contain the source spreadsheet, XML data, or original design files alongside the archived visual representation.

PDF/A-4 (ISO 19005-4:2020): Based on PDF 2.0. The latest version, simplifying conformance levels to just two: Level F (file-level conformance) and Level E (engineering document support with 3D content).

Which level to use? PDF/A-2b is the most commonly used level for general archiving — it provides visual fidelity with modern PDF features (transparency, JPEG 2000) without the strict accessibility requirements of Level A.

Who Requires PDF/A?

PDF/A compliance is mandated or strongly recommended across numerous industries and jurisdictions:

Government and courts: • The U.S. Federal Courts require PDF/A for electronic filing through the CM/ECF system • The European Union mandates PDF/A for official documents in the EU Official Journal • Germany's TR-ESOR (Technical Guidelines for Evidence Preservation) requires PDF/A for legally binding document storage

Financial services: • Banks and insurance companies use PDF/A for regulatory compliance, ensuring audit trails and financial records remain accessible for mandated retention periods (often 7–10 years)

Healthcare: • Medical records systems increasingly use PDF/A to meet long-term retention requirements under regulations like HIPAA

Cultural heritage: • The Library of Congress, British Library, and national archives worldwide use PDF/A-2b for digitized collections • Museums and galleries archive exhibition catalogs, research papers, and provenance documentation in PDF/A

Corporate compliance: • ISO 9001 and ISO 27001 certified organizations often require PDF/A for quality management and information security documentation

How to Validate PDF/A Compliance

Creating a PDF is not the same as creating a valid PDF/A document. Validation tools analyze a PDF against the ISO 19005 requirements and report any violations.

veraPDF (open-source): Developed with funding from the European Commission's PREFORMA project, veraPDF is the industry-reference validator for PDF/A compliance. It validates all parts and conformance levels of ISO 19005.

Adobe Acrobat Pro: The Preflight tool in Adobe Acrobat Pro can validate and convert PDFs to PDF/A. It provides detailed reports of any compliance issues.

Common validation failures:Missing font embedding: Fonts referenced but not embedded in the file • Unsupported color spaces: Device-dependent colors (RGB without ICC profiles) • Transparency issues: (PDF/A-1 does not support transparency) • JavaScript or multimedia content: Dynamic elements prohibited in PDF/A • Missing XMP metadata: Required metadata streams not present

AuraPDF's PDF Health Checker analyzes documents for common compliance issues, including font embedding status, metadata completeness, and structural integrity — providing actionable recommendations for improving PDF quality.

PDF/A vs Regular PDF: Key Differences

FeatureRegular PDFPDF/A
Font embeddingOptionalRequired
External referencesAllowedProhibited
JavaScriptAllowedProhibited
EncryptionAllowedProhibited
Audio/VideoAllowedProhibited
TransparencyAllowedDepends on version
XMP MetadataOptionalRequired
Color spacesAnyDevice-independent required
File attachmentsAnyRestricted (except PDF/A-3)

Key takeaway: PDF/A trades flexibility for long-term reliability. If you need a document to be perfectly reproducible 50 years from now, PDF/A is the right choice. For everyday document sharing and collaboration, regular PDF is more practical.

Note that PDF/A prohibits encryption — because encryption depends on algorithms that may become obsolete or compromised over time. For archival documents requiring both security and longevity, cryptographic signatures (which verify integrity without preventing access) are preferred over encryption.

Frequently Asked Questions

Can I open PDF/A files with a regular PDF reader?
Yes. PDF/A is a subset of regular PDF, so any standard PDF viewer (Adobe Acrobat, Chrome, Firefox, Preview) can open PDF/A documents. They look and behave like regular PDFs.
Can I convert a regular PDF to PDF/A?
Yes, but the conversion may fail if the source PDF uses features prohibited by PDF/A (JavaScript, external fonts, encryption). Adobe Acrobat Pro and open-source tools like Ghostscript can perform this conversion.
Is PDF/A the same as a scanned PDF?
No. A scanned PDF is simply an image of each page. PDF/A is a compliance standard. A scanned PDF can be PDF/A compliant if it meets all the requirements (embedded fonts for any OCR text, device-independent colors, XMP metadata).
Why can't PDF/A files be encrypted?
PDF/A is designed for long-term preservation. Encryption algorithms may become obsolete or be broken over time, potentially making the document inaccessible. By prohibiting encryption, PDF/A ensures documents remain readable indefinitely.

Try These Tools

Read Next

A

Written by the AuraPDF Team

The AuraPDF team builds free, secure PDF tools used by thousands of people worldwide. Our guides combine hands-on expertise with technical depth to help you work with PDFs more effectively.

Learn more about us