video-accessibility

Author	SHA1	Message	Date
Vadym Samoilenko	76bee82119	fix(pipeline): fix 5 QA tickets — caption alignment, glossary, source_has_ad render, filler words, NL error surfacing - caption_aligner: lower match ratio 0.5→0.35, widen search window 60→150, add time-based cursor fallback on miss - gemini.py: explicit 'MUST use glossary terms' requirement in translate_vtt prompt; source_has_ad prompt now instructs not to include AD narration in captions - ingest_and_ai: load glossary for source language and pass to extract_accessibility - render_accessible_video: handle source_has_ad=True via caption-embed path (ffmpeg subtitle inject, no AD pipeline) - translate_and_synthesize: track failed languages, write translation_errors to DB, add exc_info to error log - vtt.py: expand _FILLER_PATTERNS to nl/pt/pl/uk/ru, widen EN/ES/FR/DE/IT lists - gemini_ingestion.md: strengthen line:0% placement rule, expand disfluency examples per language Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 18:36:59 +01:00
Vadym Samoilenko	c8a610b3f7	fix(vtt): auto-fix overlapping cues from AI-generated output Gemini occasionally produces captions where a cue's start_time is earlier than the previous cue's end_time. Add VTTEditor.fix_overlapping_cues() that trims each cue's end_time to 1ms before the next cue's start, applied to both captions and AD VTT immediately after AI generation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 13:23:08 +01:00
Vadym Samoilenko	290d5e32e6	fix: 7 caption/AD quality bugs + retranslation error handling Bug fixes: - Bug 1a: source_has_ad flag prevents AI generating AD over existing professional AD; JobBrief/Job models, gemini service prompt conditional, NewBrief UI checkbox - Bug 1b: disable native textTracks on video element to prevent double captions - Bug 2: caption ALL audible speech including off-screen narrators (prompt fix) - Bug 3: DCMP §6.01 disfluency removal for EN/ES/FR/DE/IT (prompt + post-pass) - Bug 4: VTT cue settings (line:0%, position:) preserved through parser round-trip - Bug 5: Whisper word-level timestamp alignment via new caption_aligner service - Bug 6: assert_cue_alignment used .start/.end; renamed to .start_time/.end_time - New migration: backfill source_has_ad=False on existing jobs and job_briefs Also fix retranslation error handling to preserve existing GCS URIs on failure so video_native captions remain accessible if retranslation fails. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-07 15:38:20 +01:00
Vadym Samoilenko	fddf803b74	feat(translation): enforce EN-first pipeline with cue-preserving translations All translations now derive strictly from the approved English master VTT, eliminating the cue-count and timestamp drift reported by linguists (e.g. PL AD = 11 cues vs EN AD = 17 cues). Key changes: - Remove video_native translation mode entirely; all languages go through translate_vtt() which guarantees 1:1 cue alignment with EN master - Transcreation languages now use translate_vtt(style="transcreate") — same cue-preserving contract, culturally-adapted instructions - Post-translation cue alignment validator added (VTTEditor.assert_cue_alignment) - After ingestion, job moves to PENDING_QC (EN-only) instead of TRANSLATING; translation pipeline dispatches automatically when EN QC is approved - New POST /jobs/{id}/retranslate-language endpoint for PM/admin to fix legacy video_native jobs on demand - Frontend: origin badge (EN-aligned / transcreated / video-native warning), EN-first gate banner on target-language cards, Re-translate from EN button with confirm modal, removed translation mode selector from NewJob Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-06 12:11:35 +01:00
Vadym Samoilenko	5fd370c093	test: fix all unit tests — 168 passing, 0 failures - conftest.py: set required env vars before app import to prevent Settings() crash - gcs.py: lazy bucket init checks _bucket instead of _client; add @bucket.setter - vtt.py: fix float precision in _format_timestamp; include empty-text cues in parser - security.py: guard verify_password against empty hash (passlib UnknownHashError) - tts.py: _parse_timestamp raises ValueError("Invalid timestamp format: …") - emailer.py: HTML-escape job_title in _render_completion_template (XSS fix) - test_emailer.py: rewrite for Mailgun-based service (replaced SendGrid) - test_gcs.py: fix UploadFile constructor, MIME type, remove executor.submit mock - test_gemini.py: patch module-level client instead of non-existent genai.upload_file; translate_vtt tests use numbered-list mock responses matching new implementation - test_tts.py: fix aiohttp async CM mock pattern; fix error message match - test_models.py: update JobCreate to use source_is_english instead of language - test_security.py: set jwt_access_ttl_min in token test - test_cross_tenant_isolation.py: add patch to imports Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 14:02:04 +01:00
Vadym Samoilenko	a3b300b76a	docs: add canonical documentation + audit cleanup - AGENTS.md: canonical project entry point (Quick Nav, pipeline, constraints) - docs/: complete docs tree — architecture, API spec, DB schema, infra, runbook, requirements, tech stack, principles, reference ADRs, guides, tasks backlog, testing strategy - tests/README.md: test commands, structure, known gaps - README.md / CLAUDE.md / DEPLOYMENT.md: updated with canonical doc links - .archive/: backup of pre-documentation-pipeline originals - backend/uv.lock: uv dependency lockfile - Delete committed __pycache__ .pyc files (should have been gitignored) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 14:22:51 +01:00
Vadym Samoilenko	fa351e4d25	feat: per-client glossary — hybrid exact/vector retrieval + AI injection Adds full glossary system so Gemini uses client-approved terminology when generating subtitles and translations (critical for 3M brand names and product codes across 16 target locales). Backend: - lib/locales.py: BCP-47 locale registry, normalises xlsx fr_fr → fr-FR - models/glossary.py: Glossary / GlossaryVersion / GlossaryTerm + enums - services/glossary_service.py: xlsx parse (openpyxl), ingest to Mongo, hybrid retrieval (Aho-Corasick exact + Atlas Vector Search), prompt block - services/embedding_service.py: Gemini text-embedding-004, batch 100, retry - tasks/embed_glossary.py: Celery background task for async embedding - api/v1/routes_glossaries.py: CRUD endpoints under /clients/{id}/glossaries - gemini.py: _build_glossary_block(), {GLOSSARY} injection in all 4 call sites - tts.py / gemini_tts.py: pass full locale codes (no split("-")[0] truncation) - tasks/translate_and_synthesize.py: glossary lookup + injection per language - prompts: {GLOSSARY} placeholder in ingestion, targeted, transcreation prompts - pyproject.toml: +openpyxl, +pyahocorasick Frontend: - routes/admin/glossaries/: GlossaryList, GlossaryUpload, GlossaryDetail - App.tsx: 3 new routes under /admin/clients/:clientId/glossaries - ClientDetail.tsx: Glossaries card with count + quick links - types/api.ts: Glossary, GlossaryVersion, GlossaryDetail, GlossaryTerm types - lib/api.ts: 7 new API methods (upload, list, detail, terms, versions, activate, archive) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 13:03:38 +01:00
Vadym Samoilenko	6f963ff7c4	feat: DCMP compliance, descriptive transcript, new languages, QA bug fixes - Rewrote VTT translation to two-step (text-only → Gemini → apply to original timestamps) preventing caption timing desync - Added polling fallback for all processing states and Safari visibilitychange WebSocket reconnect - Added 11 new TTS languages (cs, da, fi, hu, no, sk, sv, es-419, pt-BR, fr-CA) - Updated caption/AD prompts to DCMP Captioning Key & Description Key standards (line splitting, ♪ music notation, italic tags, caption positioning, ethics guidelines) - Added descriptive transcript generation (WCAG 2.1 §1.2.1) combining captions + AD into plain text - Fixed amix normalize=0 to prevent audio loss in rendered videos - Fixed AD re-timing double-count when source_ms is None - Fixed cue block numbering to be 1-based in VttEditor and Timeline Preview Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 11:50:43 +00:00
michael	af2562096a	initial commit	2025-08-24 16:28:33 -05:00

9 commits