ai_qc

Author	SHA1	Message	Date
nickviljoen	014a9cb8ff	feat(hp): promote HP client + add hp_copy_review profile HP is no longer a placeholder. The client gets a new hp_copy_review profile (single weighted check, client-specific visibility) as its default, plus the generic static_general and video_general profiles it already had visibility into.	2026-05-17 21:08:18 +02:00
nickviljoen	bf89466d06	feat(settings): default-profile UI per client (admin-only) for Box webhook flow Adds a "Default Profile" sub-tab to the Settings modal. Lists the current client's profiles as radio buttons, shows which is the active default and whether it's a runtime override or the static value from client_config.py. Admins click a different profile + Set to override; clear-override button reverts to the static value. Storage layer: backend/client_defaults.json (gitignored, per-server), following the same pattern as user_access.json. Resolution order in client_config.get_default_profile(): override → static default_profile field → None. The Box webhook handler is the sole consumer that needs profile selection without a logged-in user; it now reads via get_default_profile() so overrides take effect. Why a separate JSON, not rewriting client_config.py: a buggy override write can never break server boot — worst case the override is ignored and the static value applies. Cleaner separation between "static config you check in" and "runtime overrides admins make". Backend: - client_config.get_default_profile(client_id) — resolver - client_config.set_default_profile(client_id, profile_id) — validates + writes (rejects profiles not in client's profile list) - client_config.clear_default_profile_override(client_id) - GET /api/clients/<id>/default_profile (any auth'd user) - PUT /api/clients/<id>/default_profile (admin-only, _require_admin) - DELETE /api/clients/<id>/default_profile (admin-only) - Box webhook handler in api_server.py now uses get_default_profile() Frontend: - New "Default Profile" tab button + tab content in Settings modal - showTab hook loads settings when tab activates - loadDefaultProfileSettings / saveDefaultProfile / clearDefaultProfileOverride functions - DOM-construction (createElement + textContent) used throughout — no innerHTML with interpolated values, so user-controllable strings (client_id, profile_id) can never cause XSS Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-17 13:50:20 +02:00
nickviljoen	4a9ddee87f	feat(clients): wire LOREAL Box folders for webhook-driven QC First client to use the Phase 4 unattended-QC pipeline. Adds three optional fields to the loreal entry in client_config.py: - box_folder_id=381501258415 (AI-QC > INCOMING > AI QC LOREAL IN) - box_reports_folder_id=382076841334 (AI-QC > REPORTS > AI QC LOREAL REPORTS) - default_profile=loreal_static When a file lands in the INCOMING folder, /api/box/webhook will pick it up, run loreal_static (strict-grade), and upload the HTML report to the REPORTS folder. Other clients remain unaffected. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 09:50:40 +02:00
nickviljoen	8f995d557b	feat(box-jwt): JWT service-account client + webhook ingestion endpoint Adds machine-to-machine Box integration alongside the existing per-user OAuth scaffolding. The new JWT client (backend/box_jwt_client.py) is the auth/file/webhook surface used for unattended workflows: load the Custom App JSON config, sign a JWT assertion, exchange for a 60-minute service-account access token (cached + refreshed automatically), and expose file download/upload + V2 webhook CRUD + HMAC signature verification. Wires a new POST /api/box/webhook endpoint (NOT @auth.require_auth — it authenticates each delivery via Box's HMAC signature headers) that: 1. Verifies the signature against env-configured signing keys (BOX_WEBHOOK_PRIMARY_KEY / BOX_WEBHOOK_SECONDARY_KEY). 2. Dedups deliveries by box-delivery-id with a bounded in-memory cache. 3. Maps the source folder to a client via a new get_client_by_box_folder() helper on client_config. 4. Spawns a background thread that downloads the file, runs the same technical pre-flight + LLM check pipeline as the user-uploaded path, writes the HTML report to output/<client>/, uploads the report back to the client's box_reports_folder_id, and logs the run with a synthetic 'box_webhook' user. Webhook runs skip media-plan / localization / OCR context — those are user-UI concepts without a meaningful source in unattended runs. The existing /api/start_analysis path is unchanged. client_config.py gains three optional per-client fields used by the new flow when present: `box_folder_id`, `box_reports_folder_id`, and `default_profile`. Existing client entries keep working without them. .gitignore now excludes backend/config/box_jwt_config.json so the JWT config (with its embedded private key + passphrase) never lands in git. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-14 22:51:34 +02:00
Nick Viljoen	5d1eab493c	Merged in feature/add-demo-clients (pull request #4 ) Feature/add demo clients	2026-05-14 19:34:38 +00:00
nickviljoen	93dc030e0c	feat(clients): add Google, HP, Ferrero as demo placeholders Three new clients in demo/eval phase. Each uses Honda-style minimal setup (static_general + video_general only) until real scope and test assets arrive. Descriptions are placeholders to be replaced once scope is confirmed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-14 21:30:18 +02:00
nickviljoen	d1826d83f1	chore(dow-jones): remove client_config entry Drops the 'dow_jones' block from CLIENT_PROFILES. After this, the client picker no longer renders Dow Jones; the four archived profiles are unreachable from user flows. Nine clients remain. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-14 21:12:47 +02:00
nickviljoen	a46ba9fc71	Split AXA accessibility check into its own profile Removed axa_pdf_accessibility from axa_policy_document (was 8 checks, now 7) and created a new axa_accessibility profile that contains only that check. Marked the new profile strict_grade: true so a single PDF/UA-1 rule failure forces an unmistakable Fail badge on the report — mirrors how axes4 PAC is used in practice (single-purpose, binary verdict). Lets users run accessibility-only QC without sitting through the rest of the policy-document checks, and removes weight from the policy-document score that the accessibility check wasn't really earning (its 0/10 verdict was dragging the overall grade in a way that obscured the content checks). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 11:15:46 +02:00
nickviljoen	50d0063b37	Add Boots Production Pack profile (multi-page document mode) New profile boots_ppack for QCing multi-page Boots production packs (PowerPoint-exported PDFs, 4-18 pages each). Built on top of AXA's document-mode infrastructure — branched off feature/axa-document-mode because it reuses the dispatcher, ingest, and result writer. New checks: - boots_logo_compliance — three-path scoring (master wordmark / partner lock-up / no branding) so OLIVER x BOOTS-style footer lock-ups aren't scored against master wordmark rules. Conservative without a formal Boots logo guideline. - boots_colour_palette — verifies CMYK/RGB/Hex spec values on creative- guidance pages against canonical Boots Blue / Health Primary Blue / Offer Red, plus visual sanity-check on artwork pages. Existing checks tuned: - boots_brand_name_accuracy: closed-world list semantics. Brands not on the approved list now go to names_not_on_list (manual review) instead of failing — the list is sourced from the original 7 docs and is known incomplete (Remington, Imodium, Maybelline etc. are legitimate Boots- stocked brands not on it). - boots_tandc_wording: explicit font-weight caveat — Boots Sharp Regular vs Light isn't reliably distinguishable by vision LLM at small sizes. Surfaced via font_weight_caveat field + needs_manual_check value. Page classifier (document_mode/page_classifier.py): Heuristic tags each page as cover / checklist / palette / notes / artwork. Validated on all 10 sample packs. Strict-grade exemption (Profile.strict_grade flag): Only artwork-classified pages count towards Pass/Fail. Cover, checklist, palette, and notes pages are still QC'd and reported as Informational but cannot trigger a Fail. Banner shows exactly which artwork-page checks fell below 6. Result writer extended: - Per-page table with score + page_type pill for any page_each-scope check (auto-applied as fallback) - Strict-grade banner (red on violation, green when clean) - Page_type pills throughout the per-page strip Smoke-test result (Remington 4-page pack, 2026-05-05): Overall 70.75/100, strict-grade Fail. After two iterations of prompt tuning, all three remaining strict-grade violations are real catches: orphan asterisk in T&Cs, "they may not be stocked" wording deviation, missing "Charges may apply". brand_name_accuracy 7.0 (was 3.0 before list fix), logo_compliance 9.5 (was 1.5 before lock-up path fix). Local-only — not pushed to dev or merged to develop until after Boots show-and-tell. Same posture as feature/axa-document-mode. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 12:47:13 +02:00
nickviljoen	90563b8cf2	Add AXA document-mode QC pipeline (Phases 1, 3, 4, 5) Multi-page PDF QC for AXA Ireland policy documents. Runs as a third mode alongside static + video, gated on profile.mode. New code isolated under backend/document_mode/ with new endpoints under /api/document/*. Phase 1 — Spine + 6 deterministic doc-scope checks ($0, runs in seconds): - Scope-aware dispatcher (document/targeted/page_sample/page_pair/page_each) - axa_font_inventory, axa_phone_inventory, axa_bold_words_definitions, axa_page_numbering, axa_print_code, axa_omg_versioning - Bootstrap bold-words dictionary extracted from Example 1 General Definitions Phase 3 — Old-vs-new diff (~$0.50/run, 3-5 min): - Page alignment via difflib SequenceMatcher (windowed fuzzy match) - Vision-LLM page-pair diff via Gemini 2.5 Pro (8 concurrent) - Two-slot upload UX, axa_policy_document_diff profile, mode=document_diff Phase 4 — PDF accessibility (PyMuPDF, $0): - 9 PDF/UA-1 aligned criteria (tagged structure, /MarkInfo, title, /Lang, encryption, font embedding, PDF version, XMP UA-conformance, alt-text) - _run_verapdf() stub for optional Java-based veraPDF integration later Phase 5 — Print preflight (PyMuPDF, $0): - 7 criteria (page geometry, bleed, image colour spaces, image DPI, transparency, PDF/X conformance, spot colours) Profile additions: - axa_policy_document — 8 deterministic checks, $0 cost - axa_policy_document_diff — 1 page-pair LLM check, ~$0.50/run API additions: - POST /api/document/start_analysis (single PDF) - POST /api/document/start_diff (old + new PDFs) Frontend additions: - Third profile.mode value (document_diff) in applyProfileMode() - Two-slot upload UX with PDF-only file pickers - checkFormValidity() branches by mode for the analyse-button gate Smoke-tested locally against Example 1 (Home Insurance V8, 86pp) and Example 2 (Landlord V1 vs V10, 68→74pp) with real findings caught including bold-words gaps, missing PDF/UA flag, transparency on press, V1→V10 bold-formatting fixes. Plan + integration map + gotchas in backend/AXA_DOCUMENT_MODE_PLAN.md. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 18:38:14 +02:00
nickviljoen	b32e8f0c8b	Add wsj podcast profile to Dow Jones client, File naming check added to all profiles	2026-04-29 18:09:58 +02:00
nickviljoen	fdd591b371	Add AXA and Rank clients Both get the static_general + video_general profile bundle, matching Honda's setup. Total clients goes from 8 to 10. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 14:01:06 +02:00
nickviljoen	6592c38b0a	Add per-user client access control and admin management Default-deny access model with admin grant/revoke via new User Access tab. /api/clients filters by user grants; client-scoped endpoints enforce access server-side. Admin role and client grants persist in user_access.json with audit trail in usage logs. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-22 12:33:09 +02:00
nickviljoen	20259dcad0	Add Honda client, video QC, session refresh, Amazon check tuning - Add Honda client with static_general and video_general profiles - Add video QC capability using Gemini native video analysis (4 checks: visual_quality, brand_consistency, text_legibility, pacing_flow) - Add video_general profile assigned to all 8 clients - Extend session lifetime with MSAL silent token refresh (proactive every 45min + reactive on expiry), switch cache to localStorage - Re-enable OCR layout measurements for Amazon checks - Add scope boundary notes to all 6 Amazon checks to prevent cross- check penalization (locale errors isolated to logo_country only) - Relax margins left-alignment tolerance from 1% to 4% to account for logo lockup internal padding - Update brand guidelines DB with Amazon localization matrix and processed Dove PDF summary Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 14:53:52 +02:00
nickviljoen	6f3528b54f	Add Boots client QC profile with 5 compliance checks and split CLAUDE.md client docs New boots_static profile (5 checks, 2.0 weight each) for retail promotional artwork compliance: caveat rules, brand name accuracy (~170 names), offer mechanics, T&C wording, and currency/locale. Strict grading override (any check <6 = Fail). Guidelines embedded from 7 thematic guidance documents. Also splits client-specific documentation out of CLAUDE.md into separate CLAUDE_LOREAL.md, CLAUDE_AMAZON.md, CLAUDE_BOOTS.md, and CLAUDE_DOW_JONES.md files to reduce main file size. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 09:25:58 +02:00
nickviljoen	5a57a8a064	Add Dow Jones client with 3 sub-brand QC profiles (17 new checks) New client with embedded brand guidelines for Dow Jones Corporate, MarketWatch, and Wall Street Journal sub-brands. Guidelines sourced from live.standards.site scrapes and baked into check prompts. - dow_jones_static: 5 checks (logo, color, typography, square motif, photography) - marketwatch_static: 6 checks (logo, color, typography, image treatment, layout, art direction) - wsj_static: 6 checks (logo, color tiers, typography, imagery, capitalization, layout) - System now has 7 clients, 12 profiles, 65 QC checks Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 18:33:14 +02:00
nickviljoen	8a82df4438	Add client-scoped reporting dashboard and admin panel Move reporting from settings modal into a dedicated tab within each client's main view with date range filtering, usage stats, and cost tracking. Add admin panel with platform-wide user activity overview, accessible only to configured admin users. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 12:34:20 +02:00
nickviljoen	24df09aa9f	Add Amazon and Boots clients with Amazon ASD 2025 QC profile Add Boots client with static_general profile and Amazon client with 6 new brand-specific QC checks based on ASD 2025 design guidelines: amazon_required_elements, amazon_logo_country, amazon_typography, amazon_headline_layout, amazon_margins, and amazon_box_placement. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-26 11:52:03 +02:00
nickviljoen	cd8345e193	Fix profile visibility filtering to respect client assignments Issue: After implementing visibility control, all profiles were showing for all clients because existing profiles don't have visibility fields. Fix: Updated get_profiles_with_visibility() to fall back to CLIENT_PROFILES mapping when visibility settings are not present in profile JSON files. Behavior: - Profiles with visibility field: Use new visibility system - Profiles without visibility field: Use CLIENT_PROFILES mapping (backward compatible) This ensures: - L'Oreal sees: loreal_static, static_general - Diageo sees: diageo_key_visual, diageo_packaging, static_general - Unilever sees: unilever_key_visual, unilever_packaging, static_general - General sees: static_general, inclusive_accessibility Tested: All clients now correctly see only their assigned profiles Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-02 13:27:47 +02:00
nickviljoen	8bc1256e82	Add usage tracking reports, profile versioning, and token tracking Implements three major feature enhancements: 1. Usage Tracking Reports - Command-line tool (generate_usage_report.py) for comprehensive usage reports - Supports text, JSON, and CSV output formats - Filters by date range, client, and user - Aggregates statistics by client, user, profile, and date - Automated report generation via cron jobs 2. Profile Auto-Versioning & Visibility Control - Automatic version control: edits create new versions (v2, v3, etc.) - Original profiles preserved for rollback capability - Profile visibility control (all clients vs client-specific) - Client-profile relationship management with dynamic updates - Audit trail with timestamps and user tracking 3. Actual Token Usage Tracking - Captures real token counts from OpenAI and Gemini APIs - Precise cost calculations instead of estimates (99% accuracy) - Per-check and per-provider token breakdowns - Pricing validation tool (validate_pricing.py) - Token usage optimization recommendations Key Files Added: - backend/generate_usage_report.py - Usage report generator - backend/validate_pricing.py - Pricing validation tool - backend/USAGE_REPORTS.md - Usage reports documentation - backend/PROFILE_MANAGEMENT.md - Profile versioning guide - backend/TOKEN_TRACKING_ENHANCEMENT.md - Token tracking guide - backend/PRICING_GUIDE.md - Pricing validation guide - backend/NEW_FEATURES_QUICKSTART.md - Quick start guide - IMPLEMENTATION_SUMMARY.md - Complete implementation overview Key Files Modified: - backend/api_server.py - Profile versioning, token passthrough - backend/client_config.py - Visibility-aware profile filtering - backend/llm_config.py - Token usage extraction from APIs - backend/usage_tracker.py - Actual token tracking and cost calculation - CLAUDE.md - Updated documentation with new features Benefits: - Accurate cost tracking with real token usage - Safe profile editing with version history - Flexible profile visibility for multi-tenant setup - Comprehensive usage analytics for optimization - Better budget forecasting and client billing Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-02 13:22:33 +02:00
nickviljoen	08a9901d70	Create L'Oréal Static profile and update client configurations Created new L'Oréal Static profile (loreal_static.json): - Focused 3-check profile for L'Oréal digital static assets - Checks: language_consistency, text_readability_general, background_contrast_general - Weighting: 3.34, 3.33, 3.33 (total 10.0 for 100-point scale) - All checks use Gemini LLM Updated client_config.py: - L'Oréal: Added 'loreal_static' profile, kept 'static_general', removed 'general_check' - Diageo: Added 'static_general', removed 'general_check' - Unilever: Added 'static_general', removed 'general_check' - General: Added 'static_general', removed 'general_check' Result: - Static General (10 checks) now available to ALL clients - L'Oréal Static (3 checks) exclusive to L'Oréal - General Check profile removed from all clients (deprecated) Profile distribution by client: - Diageo: diageo_key_visual, diageo_packaging, static_general - Unilever: unilever_key_visual, unilever_packaging, static_general - L'Oréal: loreal_static, static_general - General: static_general, inclusive_accessibility Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-02 11:59:47 +02:00
nickviljoen	16741a96d6	Add L'Oréal Static General profile with multi-file queue and enhanced reporting ## New Features ### L'Oréal Static General Profile - Created new profile with 3 checks optimized for digital marketing assets - Even weighting (33.3% each) for 100-point scoring scale - Removed print-specific requirements (3m viewing distance) - Focus on marketing text vs product packaging distinction ### Multi-File Queue System (web_ui.html) - Added file queue functionality for batch processing - Users can now upload and process multiple files simultaneously - Queue displays file status (pending, analyzing, complete, error) - Individual file removal and queue clearing options - Progress tracking for batch operations ### New General QC Checks 1. background_contrast_general - Optimized for digital assets (no distance requirements) - Checks logo, product, and marketing text contrast - Detects overlapping and blending issues - Provides element-by-element breakdown 2. text_readability_general - Focus on marketing text only (excludes product packaging) - Checks for overlapping elements - Digital readability optimization - Specific issue identification 3. language_consistency (enhanced) - Better distinction between marketing and packaging text - Detailed language detection and reporting - Lists specific text analyzed ### Usage Tracking System - Added usage_tracker.py for analysis logging - Tracks user activity, profile usage, and costs - Daily log files in JSONL format - Cost estimation per LLM provider ## Bug Fixes ### Authentication & User Management - Fixed Flask 'g' import missing issue - Fixed user info access in background threads - Pass user_info to threads instead of accessing g.user - Improved error handling for usage logging ### HTML Report Generation - Fixed missing analysis details in reports - Now extracts and displays all JSON fields properly - Shows comprehensive breakdowns: - Analysis details - Elements checked (logo, product, text) - Marketing text found - Issues identified - Specific recommendations - No more blank "Pass/Fail" results ### Scoring System - Fixed usage_tracker to handle dict of check results (not list) - Better handling of model_used field variations - Skip non-dict check results gracefully ## Configuration Changes ### Model Versions (llm_config.py) - Fixed invalid GPT-4.1 model ID to gpt-4o - Added Gemini 3 Pro beta model option - AVAILABLE_MODELS dict for UI selection - Model version override support ### Profile Updates - Static General: 3 checks, total weight 10.0 - Each check: text_readability_general (3.33), background_contrast_general (3.33), language_consistency (3.34) - Maximum score: 100 points ## Technical Improvements - Enhanced prompt engineering for consistent LLM outputs - Mandatory detailed explanations in all checks - Structured JSON responses with comprehensive fields - Better error messages and fallback handling - Client configuration support (client_config.py) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-02 10:58:39 +02:00

22 commits