Where can I get an example PDF/UA file to test?

Download any of the three at the top of this article — an invoice, a financial report, and a CV exported from ReflowPDF. Each validates as PDF/UA-1 in veraPDF with zero failures. Open them in veraPDF (verapdf.org), the PAC checker, or Acrobat's accessibility checker and verify it yourself.

What is a Tagged PDF / PDF/UA?

A Tagged PDF carries an invisible structure tree describing what the content IS — headings, paragraphs, lists, tables, figures, links — plus the document language. PDF/UA (ISO 14289-1) is the standard for doing it correctly. Without tags, a screen reader sees only positioned glyphs and cannot read the document in a meaningful order.

Do other HTML-to-PDF tools produce accessible PDFs?

Most (dompdf, wkhtmltopdf) produce no tags at all — there is nothing for a screen reader to read. A few commercial engines (Prince, Antenna House, PDFreactor) do tag, typically at four-figure licensing. The common fallback is exporting from Word/InDesign and remediating by hand in Acrobat. ReflowPDF tags on export, from the browser.

Is ReflowPDF output certified PDF/UA?

There is no automatic per-document certificate. PDF/UA conformance is a producer claim plus, for full conformance, human review. Our output validates clean against veraPDF (the reference validator) on every built-in template. The parts a machine cannot judge — whether your alt text is meaningful, whether the reading order makes sense — remain the author's responsibility, as they are with every authoring tool.

What do I do in the editor to get an accessible export?

Write alt text on images (the editor pre-fills a sensible default and offers a 'decorative' toggle), use headings, and set the document language and title in Page Settings. The engine turns those into the correct PDF/UA tags automatically — structure tree, table header scopes, list structure, marked content, language, the works.

Why do I need an accessible PDF?

It makes documents usable by people who rely on screen readers — and it's increasingly a legal requirement: the EU Accessibility Act (in force June 2025), EN 301 549 for EU public-sector procurement, and US Section 508 all expect accessible PDFs. Inaccessible documents have driven ADA complaints in finance, education, and healthcare.

We Made an HTML-to-PDF Engine That Passes veraPDF (PDF/UA-1) — Here Are the Files, Check Them Yourself

Don’t take our word for it. Download these three PDFs and run them through any PDF/UA validator:

**Invoice (PDF/UA-1)** — a one-page billing document
**Financial report (PDF/UA-1)** — multi-page, table-heavy
**CV / résumé (PDF/UA-1)** — headings, lists, contact details

Drop any of them into veraPDF (the reference validator from the PDF Association), the PAC checker, or Acrobat’s accessibility checker. You’ll get the same result we do: PDF/UA-1, zero failures. The report alone passes 21,554 individual checks with none failing.

These came straight out of ReflowPDF’s HTML-to-PDF engine. No Acrobat. No manual remediation. That’s the part that’s rare — so here’s how it works and what it took.

What “accessible PDF” actually means

To a screen reader, an ordinary PDF is a bag of glyphs scattered at x/y coordinates with no idea which ones form a heading, which form a table, or what order to read them in. The document is perfectly legible to a sighted reader and completely opaque to everyone else.

A Tagged PDF fixes that by carrying a second, invisible layer: a structure tree — Document → H1 → P → Table → TR → TH/TD → … — that says what each piece of content is. PDF/UA (ISO 14289-1, “Universal Accessibility”) is the standard that defines how to build that tree correctly. It’s the difference between a document assistive technology can navigate and a wall of noise.

Why most HTML-to-PDF tools can’t do it

Building that tree requires knowing the structure. Most engines (dompdf, wkhtmltopdf) flatten everything to drawing operators and discard the structure — there’s nothing left to tag. The engines that tag well (Prince, Antenna House, PDFreactor) are premium commercial software starting in the thousands. The mainstream alternative is “export from Word, remediate by hand in Acrobat” — slow and easy to get wrong.

ReflowPDF is different for an architectural reason: the editor already works on the document’s semantic tree. A heading is a heading and a table is a table because that’s the model you edit — not a guess from font sizes. On export, we walk that same tree and emit the matching PDF/UA tags. Accessibility falls out of the architecture instead of being bolted on.

Concretely, every export now carries:

A real structure tree (/StructTreeRoot) with /MarkInfo /Marked true.
Marked content tying every glyph and image to the tree by MCID; decoration (backgrounds, borders, rules) marked /Artifact so readers skip it.
Headings normalized to start at H1 and never skip a level.
Tables as Table → THead/TBody → TR → TH/TD, with /Scope on every header cell.
Lists as L → LI → { Lbl, LBody }.
Figures with /Alt; links with /Contents wired into the tree.
Document language (/Lang) and title (/Info /Title + XMP dc:title), plus the pdfuaid identifier.

What it actually took (the unglamorous part)

We didn’t assume it worked — we ran every one of our 50 built-in templates through veraPDF. 16 of them failed. The validator is strict, and it caught real bugs a lighter checker had waved through:

Artifacts marked with the wrong operator. We were emitting /Artifact BDC where the spec requires /Artifact BMC (no property list). One missing operand meant the validator read every background and border as untagged content — 38 failures on one document.
Tables broken across page breaks. When a long table split onto a second page, the continuation rows lost their parent and ended up hanging off the document root instead of the table.
Headings that skip levels. Templates that jumped from an h1 to an h3, or started at h2 with no h1 at all — invalid heading hierarchy.
Images with no alt text becoming figures with no description.

Each is now handled automatically, for any document — heading levels are normalized, table sections are re-nested even across pages, undescribed images become artifacts, and so on. We re-ran the full set: 50 of 50 templates validate clean. The three files at the top of this article are three of them.

Where automation stops (the honest part)

No tool — not ReflowPDF, not Prince, not Acrobat — can make a document fully conformant on its own, because PDF/UA deliberately requires human judgment for things software can’t check:

Is the alt text meaningful (“Q1 revenue chart, up 12%”) or useless (“image1”)?
Is the reading order logical?
Are headings used semantically, not just because the text looked big?

ReflowPDF does the machine half completely and guides the human half — alt text with a sensible default and a “decorative” toggle, language and title in settings, headings you control. But the words in the alt field are yours, and for high-stakes regulated documents a human accessibility review is still the right final step. That’s true of every authoring tool; we just say it out loud.

So the honest claim is: ReflowPDF exports Tagged PDF that validates clean as PDF/UA-1 in veraPDF — accessible by construction, from the browser, with no manual remediation. Download the files above and check it.

Why it matters now

Accessibility stopped being optional. The EU Accessibility Act took effect in June 2025; EN 301 549 governs EU public-sector procurement; US Section 508 binds federal agencies and contractors; and inaccessible PDFs have driven ADA complaints in finance, education, and healthcare. If you publish into any of those worlds, untagged PDFs are a liability — and “export from Word, fix in Acrobat” doesn’t scale.

Editing a PDF structurally was always the point of ReflowPDF. Tagging it for accessibility turned out to be the same capability, pointed at a different problem.

Don’t take our word for it. Download these three PDFs and run them through any PDF/UA validator:

**Invoice (PDF/UA-1)** — a one-page billing document
**Financial report (PDF/UA-1)** — multi-page, table-heavy
**CV / résumé (PDF/UA-1)** — headings, lists, contact details

These came straight out of ReflowPDF’s HTML-to-PDF engine. No Acrobat. No manual remediation. That’s the part that’s rare — so here’s how it works and what it took.

What “accessible PDF” actually means

Why most HTML-to-PDF tools can’t do it

Concretely, every export now carries:

A real structure tree (/StructTreeRoot) with /MarkInfo /Marked true.
Marked content tying every glyph and image to the tree by MCID; decoration (backgrounds, borders, rules) marked /Artifact so readers skip it.
Headings normalized to start at H1 and never skip a level.
Tables as Table → THead/TBody → TR → TH/TD, with /Scope on every header cell.
Lists as L → LI → { Lbl, LBody }.
Figures with /Alt; links with /Contents wired into the tree.
Document language (/Lang) and title (/Info /Title + XMP dc:title), plus the pdfuaid identifier.

What it actually took (the unglamorous part)

Artifacts marked with the wrong operator. We were emitting /Artifact BDC where the spec requires /Artifact BMC (no property list). One missing operand meant the validator read every background and border as untagged content — 38 failures on one document.
Tables broken across page breaks. When a long table split onto a second page, the continuation rows lost their parent and ended up hanging off the document root instead of the table.
Headings that skip levels. Templates that jumped from an h1 to an h3, or started at h2 with no h1 at all — invalid heading hierarchy.
Images with no alt text becoming figures with no description.

Where automation stops (the honest part)

No tool — not ReflowPDF, not Prince, not Acrobat — can make a document fully conformant on its own, because PDF/UA deliberately requires human judgment for things software can’t check:

Is the alt text meaningful (“Q1 revenue chart, up 12%”) or useless (“image1”)?
Is the reading order logical?
Are headings used semantically, not just because the text looked big?

Why it matters now

Editing a PDF structurally was always the point of ReflowPDF. Tagging it for accessibility turned out to be the same capability, pointed at a different problem.

We Made an HTML-to-PDF Engine That Passes veraPDF (PDF/UA-1) — Here Are the Files, Check Them Yourself

What “accessible PDF” actually means

Why most HTML-to-PDF tools can’t do it

What it actually took (the unglamorous part)

Where automation stops (the honest part)

Why it matters now

Frequently asked questions

Keep reading

PDF/A-3a from HTML: Archival + Accessible PDFs That Pass veraPDF (Download & Check)

How to Edit Tables in a PDF Without Destroying the Layout

Overlay Editing vs Structural Editing: Why Your PDF Editor Breaks Documents

We Made an HTML-to-PDF Engine That Passes veraPDF (PDF/UA-1) — Here Are the Files, Check Them Yourself

What “accessible PDF” actually means

Why most HTML-to-PDF tools can’t do it

What it actually took (the unglamorous part)

Where automation stops (the honest part)

Why it matters now

Frequently asked questions

Keep reading

PDF/A-3a from HTML: Archival + Accessible PDFs That Pass veraPDF (Download & Check)

How to Edit Tables in a PDF Without Destroying the Layout

Overlay Editing vs Structural Editing: Why Your PDF Editor Breaks Documents