ai_qc/backend/document_mode
nickviljoen 2aeff24136 Wire veraPDF into axa_pdf_accessibility for PAC-equivalent PDF/UA-1 validation
AXA's accessibility QC team uses axes4 PAC (PDF/UA-1 / Matterhorn Protocol)
as their compliance gate, but our existing 9-criterion deterministic check
runs surface-level only and would pass documents PAC fails. Wired up the
existing _run_verapdf() stub so veraPDF — the open-source Matterhorn
implementation — runs as a subprocess and drives the score when available.

Verified locally: veraPDF on EAA_v1.pdf reports the exact same Content (86)
and Metadata (1) failure counts as PAC's report on the same document family,
confirming protocol parity.

Falls back cleanly to the deterministic layer when veraPDF isn't installed,
so deploys are safe before the binary lands on dev/prod servers.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-10 10:36:03 +02:00
..
data Add AXA document-mode QC pipeline (Phases 1, 3, 4, 5) 2026-05-01 18:38:14 +02:00
__init__.py Add AXA document-mode QC pipeline (Phases 1, 3, 4, 5) 2026-05-01 18:38:14 +02:00
accessibility_checks.py Wire veraPDF into axa_pdf_accessibility for PAC-equivalent PDF/UA-1 validation 2026-05-10 10:36:03 +02:00
checks.py Add AXA document-mode QC pipeline (Phases 1, 3, 4, 5) 2026-05-01 18:38:14 +02:00
diff_engine.py Add AXA document-mode QC pipeline (Phases 1, 3, 4, 5) 2026-05-01 18:38:14 +02:00
diff_report_writer.py Add AXA document-mode QC pipeline (Phases 1, 3, 4, 5) 2026-05-01 18:38:14 +02:00
dispatcher.py Add Boots Production Pack profile (multi-page document mode) 2026-05-05 12:47:13 +02:00
ingest.py Add Boots Production Pack profile (multi-page document mode) 2026-05-05 12:47:13 +02:00
page_classifier.py Add Boots Production Pack profile (multi-page document mode) 2026-05-05 12:47:13 +02:00
print_preflight_checks.py Add AXA document-mode QC pipeline (Phases 1, 3, 4, 5) 2026-05-01 18:38:14 +02:00
result_writer.py Wire veraPDF into axa_pdf_accessibility for PAC-equivalent PDF/UA-1 validation 2026-05-10 10:36:03 +02:00