What Is PDF/A?
PDF/A is a subset of the PDF specification designed for long-term digital preservation. It was published as ISO standard 19005 in 2005, with the explicit goal of ensuring that a document remains readable and reproducible decades from now, regardless of the software environment used to open it.
The core constraint of PDF/A is self-containment. Every resource the document needs to render correctly must be embedded inside the file itself. This means all fonts are fully embedded — not just subsetted. Color spaces must be fully described using ICC profiles so colors render consistently on any display. JavaScript is prohibited because behavior may change with different runtime versions. External links and references to external resources are not allowed, since those resources may not exist in the future. Encryption is also forbidden, because an encrypted document becomes unreadable if the decryption key is lost.
The result is a file that is slightly larger than an equivalent standard PDF, but one that is fully self-describing and immune to the software dependency problems that make older digital archives unreliable.
PDF/A-1, PDF/A-2, PDF/A-3: The Variants
PDF/A-1, published in 2005, is the strictest variant. It prohibits transparency effects, layers, and attachments. It is based on PDF 1.4 and remains the most widely accepted format for court submissions and regulatory filings in jurisdictions that specify PDF/A without indicating a version.
PDF/A-2, published in 2011, relaxes some constraints while preserving the archival guarantees. It is based on PDF 1.7 and adds support for JPEG 2000 image compression (which can significantly reduce file size compared to JPEG), layers (optional content groups), and transparency. It also allows PDF/A-compliant files to be embedded as attachments, enabling compound document structures.
PDF/A-3 extends this by permitting any file type as an attachment — not just other PDF/A files. This makes it the basis of several national e-invoicing standards: the ZUGFeRD format (Germany) and Factur-X (France) both embed an XML invoice data file inside a PDF/A-3 document, so the same file serves both human readers and machine processing systems. If you receive or generate electronic invoices in these formats, you are already working with PDF/A-3.
When to Use PDF/A
The clearest use cases are legal and administrative archives. Courts in many jurisdictions — particularly in EU member states — require PDF/A for electronically filed documents. The reasoning is straightforward: a court must be able to read an exhibit in 20 years, and standard PDFs with unembedded fonts or external dependencies may not render correctly in future software environments.
Public sector documents in Europe are frequently subject to explicit regulatory requirements specifying PDF/A. French administrative circulars, German public procurement documents, and EU institutional records are commonly produced in PDF/A-2 or PDF/A-3.
Long-duration contracts and university theses represent the same logic applied to the private sector. A 10-year commercial contract or a doctoral thesis deposited in an institutional repository needs to be readable for its entire retention period. Converting to PDF/A at the point of archiving insulates the document from software obsolescence.
PDF/A Limitations
PDF/A imposes real constraints that make it unsuitable for some document types. Embedded video and audio are not permitted — if your standard PDF contains a video clip, that content will be removed or rejected during PDF/A conversion. Dynamic forms with JavaScript validation are also prohibited, which rules out many interactive PDF forms used in data collection.
Encryption is entirely excluded. This means PDF/A cannot be used for documents that require password protection or digital rights management. If a document must be both archived long-term and protected against unauthorized reading, you'll need a separate strategy — typically storing the encrypted version alongside a PDF/A version in a secure archive.
File size is slightly larger on average because full font embedding and ICC color profiles add overhead. For documents with many unique typefaces, this overhead can be noticeable. In practice, for most business documents the size difference is under 10–15% compared to an equivalent standard PDF.