ai_qc

Author	SHA1	Message	Date
nickviljoen	29ee941037	refactor(formatting_diff): narrow scope to bold + italic only First real-data test against the AXA car-insurance PDFs surfaced a noise problem: the new document is a brand refresh — every page flips font (PublicoBanner-Bold→PublicoHeadline-Bold) and colour (#893f4a→#2e3092). At medium-per-finding that crashed the diff score to 0.0 and drowned the bold-regression signal AXA actually flagged. Drop font, size, colour comparators. Keep bold + italic — the attributes the vision-LLM consistently misses on dense layouts. The LLM already narrates colour-scheme rebrands and font swaps in its Modified / Style-changes blocks; running both layers on the same visual change just double-counts it. Tests inverted from "X change is flagged" to "X change is NOT flagged" to lock the scope decision in. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-19 12:37:19 +02:00
nickviljoen	640bbe4671	docs(axa): note deterministic formatting layer added to axa_pdf_diff Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-19 10:23:15 +02:00
nickviljoen	9746ba249b	docs: refresh CLAUDE_AXA.md status + add AI-usage breakdown Updates the AXA client doc to reflect the 2026-05-10 state: - Status line now reads 2026-05-10, covers Phase 6 (veraPDF), profile split, and dev deploy - New "AI usage across AXA tools" section for client-facing communication (8 of 9 tools deterministic, only axa_pdf_diff uses AI) - Open items expanded to include the pending source-PDF request and the prod-deployment hold Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 11:54:24 +02:00
nickviljoen	a1cfc75309	Merge remote-tracking branch 'origin/develop' into feature/axa-accessibility-profile-split # Conflicts: # CLAUDE_AXA.md	2026-05-10 11:20:09 +02:00
nickviljoen	a46ba9fc71	Split AXA accessibility check into its own profile Removed axa_pdf_accessibility from axa_policy_document (was 8 checks, now 7) and created a new axa_accessibility profile that contains only that check. Marked the new profile strict_grade: true so a single PDF/UA-1 rule failure forces an unmistakable Fail badge on the report — mirrors how axes4 PAC is used in practice (single-purpose, binary verdict). Lets users run accessibility-only QC without sitting through the rest of the policy-document checks, and removes weight from the policy-document score that the accessibility check wasn't really earning (its 0/10 verdict was dragging the overall grade in a way that obscured the content checks). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 11:15:46 +02:00
nickviljoen	2aeff24136	Wire veraPDF into axa_pdf_accessibility for PAC-equivalent PDF/UA-1 validation AXA's accessibility QC team uses axes4 PAC (PDF/UA-1 / Matterhorn Protocol) as their compliance gate, but our existing 9-criterion deterministic check runs surface-level only and would pass documents PAC fails. Wired up the existing _run_verapdf() stub so veraPDF — the open-source Matterhorn implementation — runs as a subprocess and drives the score when available. Verified locally: veraPDF on EAA_v1.pdf reports the exact same Content (86) and Metadata (1) failure counts as PAC's report on the same document family, confirming protocol parity. Falls back cleanly to the deterministic layer when veraPDF isn't installed, so deploys are safe before the binary lands on dev/prod servers. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 10:36:03 +02:00
nickviljoen	59a0b2408c	Restructure CLAUDE.md docs: slim project-wide root, complete per-client coverage Splits the monolithic CLAUDE.md (962 lines) into a slim project-wide root (211 lines) plus per-client files. Auto-loaded context drops ~88% per session. Changes: - CLAUDE.md slimmed to project-wide essentials (architecture, auth, deployment, branch strategy, deploy scripts, prod troubleshooting, pre-session checklist). Adds explicit session-start convention pointing to CLAUDE_<CLIENT>.md for client-specific work. Updates client roster table to all 10 clients with profile counts. - New CLAUDE_AXA.md: document-mode pipeline + axa_policy_document profiles - New CLAUDE_DIAGEO.md: key_visual + packaging profiles, check inventories - New CLAUDE_UNILEVER.md: profiles + zero-score logic for face/new visibility - New CLAUDE_HONDA.md, CLAUDE_RANK.md, CLAUDE_GENERAL.md: stubs (clients use generic profiles only — kept for completeness and future expansion) - backend/CLAUDE.md: stale 932-line duplicate replaced with 18-line redirect to root + backend-specific quick pointers Per-client files (CLAUDE_LOREAL.md, CLAUDE_AMAZON.md, CLAUDE_BOOTS.md, CLAUDE_DOW_JONES.md) unchanged — already had the right content. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-06 12:29:16 +02:00

7 commits