video-accessibility

Author	SHA1	Message	Date
Vadym Samoilenko	3a2bbc9ca0	fix(membership): correct \$unwind option preserveNullAndEmpty → preserveNullAndEmptyArrays MongoDB 7.0 rejects the invalid key with code 28811, causing 500 on GET /organizations/{id}/members. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 18:58:07 +01:00
Vadym Samoilenko	6588feedc7	fix(membership): use 'or ""' to guard against null email/full_name u.get("key", "") returns None when key exists with null value in MongoDB, causing Pydantic ValidationError on MemberDetail.email/full_name: str → 500. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 18:41:31 +01:00
Vadym Samoilenko	a3cfe2ff8c	fix(renderer): skip ffprobe Phase 3.5 — use pre-computed freeze duration Cloud Run-generated freeze segments caused FFprobe to return code 1 with empty stderr when dispatched to the Celery ffmpeg queue, crashing the render for every language. The freeze segments are created to an exact pre-computed duration (ad_duration + silence_before + silence_after), so probing is unnecessary — assign that value directly instead. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 17:29:33 +01:00
Vadym Samoilenko	2f4925353a	feat(pause-insert): adaptive buffer, forward-snap, timeline drag + share link fix Backend (Phase A): - A1: Adaptive silence buffer — natural_gap_ms persisted per cue; renderer computes per-cue silence_before/silence_after instead of fixed 500ms; per-cue silence files - A2: Forward-preferred snap — snap_pause_point prefers boundaries up to 4s ahead over boundaries within 1.5s behind, reducing mid-scene cuts - A3: Min-gap validation — pause points with < 200ms gap trigger forward search to the next acceptable gap - natural_gap_ms added to PausePointData model and api.ts type - New config fields: whisper_snap_forward_window, whisper_snap_backward_window, ad_silence_buffer_default, ad_silence_buffer_min_after, ad_min_acceptable_gap - Tests: test_whisper_snap.py (13 tests), test_video_renderer_buffers.py Frontend (Phase B): - B1: Drag pause-point markers — pointer state machine with 3px move threshold, clamp to min/max bounds, click-without-move still opens PausePointEditor - B2: Drag freeze blocks — orange blocks translate with linked pause point - B3: Time tooltip visible during drag, hidden on release - Tests: TimelinePreview.drag.test.tsx (10 tests) Fixes: - Share link pointed to ai-sandbox.oliver.solutions — added app_url to Settings with correct optical-dev.oliver.solutions default; share_url now configurable via APP_URL env var - Removed all ai-sandbox.oliver.solutions references from docker-compose, apache config, docs, and scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 16:09:09 +01:00
Vadym Samoilenko	8dee0b6ff5	fix(membership): guard against missing/invalid role_in_org in membership docs list_org_members and _membership_from_doc used bracket access on role_in_org which raises KeyError if the field is absent (old docs or direct DB inserts). Also handles ValueError if the stored value doesn't match a valid OrgRole. Falls back to OrgRole.MEMBER in both cases. Fixes 500 on GET /organizations/{org_id}/members. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 14:35:09 +01:00
Vadym Samoilenko	b427ee9f49	fix(authz): MT-3/6/7/8 org isolation + P1 English-first QC enforcement Multi-tenancy isolation (P0): - MT-3: Add get_job_or_403 (org membership check) to all 19+ job action endpoints - MT-6: Same gate added to all review_notes (5) and vtt_versions (4) handlers - MT-7: WebSocket /ws/jobs/{job_id} closes with 4403 on org mismatch; /ws/jobs passes accessible_org_ids to ConnectionManager; server-side keepalive at 20 s (asyncio.wait_for timeout) prevents proxy idle drops - MT-8: list_users scoped to org memberships for non-platform-admins WebSocket fixes (Mod Comms 2026-03-18 incident): - Frontend heartbeat lowered 30 000 → 20 000 ms (was at Apache timeout edge) - Terminal close codes 4001/4003/4004/4403 no longer trigger reconnect loop - Silently discard server "keepalive" frames alongside existing "pong" English-first QC (P1): - _assert_can_approve blocks target language approval until source is APPROVED - PRODUCTION/ADMIN roles bypass the gate - Source VTT edits reset stale APPROVED/PENDING_REVIEW/IN_REVIEW target states Tests (all passing): - backend/tests/unit/test_language_qc_english_first.py (15 cases) - backend/tests/unit/test_routes_jobs_org_isolation.py (12 cases) - backend/tests/unit/test_review_notes_org_isolation.py (16 parametrized cases) - backend/tests/unit/test_vtt_versions_org_isolation.py (16 parametrized cases) - backend/tests/unit/test_websocket_org_isolation.py (11 cases) - backend/tests/unit/test_admin_users_org_filter.py (7 cases) - frontend: useJobStatusWebSocket.terminal.test.ts (9 cases) - frontend: useJobStatusWebSocket.heartbeat.test.ts (9 cases) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 11:43:10 +01:00
Vadym Samoilenko	3bed598025	fix(glossary+jobs): add debug logging for glossary failures and fix AllJobs filter stale state - glossary_service: add step-by-step debug/warning logs at each early-return point so the exact failure reason is visible in worker logs (project not found, no active version, etc.) - glossary_service: guard against source_term_lower=None in ahocorasick automaton build - glossary_service: guard against target_locale=None in _get_translation - glossary_service: add full traceback to the outer exception catch for easier debugging - JobsList: fix statusFilter stale state — useEffect now always syncs with URL params, clearing the filter when no ?status= param is present (previously the filter was never cleared, so navigating from /jobs?status=X to /jobs kept the old filter) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 21:25:41 +01:00
Vadym Samoilenko	3fb8dce3ee	feat(ai): upgrade Gemini models to 3.1-pro-preview and 3.1-pro-tts-preview Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 21:00:32 +01:00
Vadym Samoilenko	08a8a0d636	fix(tts): convert lameenc bytearray to bytes before GCS upload lameenc.encode() returns bytearray, but google-cloud-storage's _to_bytes() only accepts bytes/str — causing TypeError on every upload_from_string() call. Cast to bytes() before returning. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 19:35:28 +01:00
Vadym Samoilenko	a53cf960ae	fix(tts): replace pydub MP3 export with lameenc (pure Python, no system ffmpeg) Gemini TTS _pcm_to_mp3 used pydub.AudioSegment.export(format='mp3') which requires a system ffmpeg binary. Worker containers don't have ffmpeg installed (video ops run on Cloud Run). Switch to lameenc which is pure Python and encodes PCM→MP3 without any system binary. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 18:24:15 +01:00
Vadym Samoilenko	31199f8705	chore: push all session changes — backend hardening, tests, apache config, deploy scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 15:52:14 +01:00
Vadym Samoilenko	5fd370c093	test: fix all unit tests — 168 passing, 0 failures - conftest.py: set required env vars before app import to prevent Settings() crash - gcs.py: lazy bucket init checks _bucket instead of _client; add @bucket.setter - vtt.py: fix float precision in _format_timestamp; include empty-text cues in parser - security.py: guard verify_password against empty hash (passlib UnknownHashError) - tts.py: _parse_timestamp raises ValueError("Invalid timestamp format: …") - emailer.py: HTML-escape job_title in _render_completion_template (XSS fix) - test_emailer.py: rewrite for Mailgun-based service (replaced SendGrid) - test_gcs.py: fix UploadFile constructor, MIME type, remove executor.submit mock - test_gemini.py: patch module-level client instead of non-existent genai.upload_file; translate_vtt tests use numbered-list mock responses matching new implementation - test_tts.py: fix aiohttp async CM mock pattern; fix error message match - test_models.py: update JobCreate to use source_is_english instead of language - test_security.py: set jwt_access_ttl_min in token test - test_cross_tenant_isolation.py: add patch to imports Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 14:02:04 +01:00
Vadym Samoilenko	d5e63129dd	feat(upload): PR-3 GCS resumable chunked upload for large videos Files >100 MB bypass the load balancer via browser→GCS direct upload: - POST /jobs/upload/init — creates GCS resumable session, returns job_id + session URI - POST /jobs/upload/complete — verifies GCS object, creates job, dispatches ingestion - Frontend sends 8 MB chunks with Content-Range directly to GCS session URI - infra/gcs-cors.json + deploy-dev.sh ensure_gcs_cors() enable browser CORS on bucket Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 11:35:13 +01:00
Vadym Samoilenko	3f557724d3	feat(api): L-18 blocked-on-source, PR-10 promote-to-qc, R-12 reviewed_cues reset - POST /{job_id}/actions/blocked_on_source (L-18): linguist/reviewer flags a source video issue; moves job to QC_FEEDBACK and records blocked_on_source_reason/at/by - POST /{job_id}/actions/promote_to_qc (PR-10): production/admin manually bypasses AI processing for edge-case failures; adds audit history entry - Reset reviewed_cues to 0 on submit_for_review (R-12) so reviewer must re-acknowledge all cues after each linguist resubmit - Add assert_job_in_user_org + get_user_org_ids to core/dependencies.py (used by the new endpoints and the cross-tenant isolation test suite) - Remove unused ingest_and_ai_task / translate_and_synthesize_task imports Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 10:38:39 +01:00
Vadym Samoilenko	ff372c7322	fix(security): close MT-17/18/19, restore cross-tenant tests, quick wins Blocks 1–5 of stabilization plan: SECURITY - validation.py: restore settings.upload_max_video_bytes (T-14 regression fix) and JSON object key validation that was incorrectly removed - MT-18: add accessible_org_ids filter to list_for_reviewer/list_for_linguist so reviewers/linguists only see jobs from their own org in QC queue - MT-17: add Membership.team_ids[], write to it on invitation acceptance and direct team add/remove; migration backfills from Team.member_user_ids - MT-19: validate all target_team_ids belong to invitation's org_id at creation TESTS - Restore test_cross_tenant_isolation.py (was deleted, only .pyc remained) - Extend with MT-18 reviewer org isolation tests QUICK WINS - W-8: remove time.sleep(1) + dead debug block from POST /jobs (task was undefined — would have caused NameError → HTTP 500 on every job creation) - T-13: warn at startup when REDIS_URL configured but connection failed - T-16: skip language_qc lifespan migration when count=0 (no DB scan on startup) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-30 10:32:23 +01:00
Vadym Samoilenko	b3ace22009	feat(infra): move heavy workers to Cloud Run Jobs Heavy pipeline tasks (ingest, translate, render, rerender) now dispatch to a Cloud Run Job (va-worker) instead of local Celery workers. optical-dev runs only api + lightweight worker (notify/embed) within its 2-CPU budget. - backend/app/tasks/runner.py — Cloud Run Job entrypoint - backend/app/services/cloud_run_dispatch.py — replaces .delay() for heavy tasks - backend/Dockerfile.cloudrun — Cloud Run worker image (ffmpeg included) - docker-compose.optical-dev.yml — 2-CPU safe overrides, disables heavy workers - cloudbuild.yaml — builds va-worker image and updates Cloud Run Job - deploy-dev.sh — uses 3-file compose, builds only api+worker locally - routes_jobs, routes_admin_production, ingest_and_ai, translate_and_synthesize — all dispatch sites updated to use cloud_run_dispatch.dispatch() USE_CELERY_FALLBACK=true in .env.local to use Celery locally during dev. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 21:47:10 +01:00
Vadym Samoilenko	54fcf47887	feat(mt-14): gcs_prefix on Job, gcs_path helper, rewrite path sites - gcs_path(job, *parts) helper in gcs.py: uses job.gcs_prefix if set, falls back to job._id (legacy) — backward-compatible for all old jobs - create_job: sets gcs_prefix=orgs/{org_id}/jobs/{job_id} when organization_id is known; legacy jobs without org get null prefix - Rewrote hardcoded f"{job_id}/{lang}/..." paths in: - ingest_and_ai.py (4 upload sites) - translate_and_synthesize.py (9 sites via bulk regex) - render_accessible_video.py (3 sites: segments, video, captions) - rerender_accessible_video.py (3 sites) - tools/migrate_gcs_org_prefix.py: idempotent operator script — preflight checks, copy→verify(count+md5)→mongo update→delete, ThreadPoolExecutor(4), resume file, dry-run + rollback modes Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 20:45:12 +01:00
Vadym Samoilenko	312af2d7fb	feat(mt-11): cross-org assignment guard in language_qc Prevent PM in org A from assigning linguist/reviewer from org B. Added _assert_user_in_job_org() helper that resolves job org_id (with project fallback) and checks db.memberships for the assignee. Also added assert_user_in_org() and get_job_or_403() to core/authz.py for use in upcoming MT-13 and MT-15 commits. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 20:22:46 +01:00
Vadym Samoilenko	08fcb4daa4	feat(pr6): WS real-time updates, per-cue AD playback, upload guard W-4: team assignment (linguist/reviewer) stored on job at creation, auto-assigned to all language QC states on first GET /language-qc (lazy init via auto_assign_defaults) L-3 WS: broadcast_to_job when reviewer opens VTT for editing; QCDetail shows "User X is editing [lang]" banner (auto-clears 5s) R-5: comment broadcast via broadcast_to_job on add_comment(); QCDetail invalidates comments query on language_qc_comment WS event L-15: QCDetail subscribes to language_qc_assigned WS event → refetches lang-qc data and shows toast R-7: VttEditor gets onCuePlay prop; AD editor in QCDetail wires handleAdCuePlay → switches to accessible video mode, seeks & plays T-15: beforeunload warning in NewJob while upload is in progress Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 19:42:57 +01:00
Vadym Samoilenko	13db347d65	feat(pr3+pr4): deadline field, job clone, reject categories, reviewed-cues gate PM-1 (deadline): - Job model: add deadline field (job-level PM deadline) - POST /jobs: accept deadline as ISO date form param - JobsList: deadline column with overdue highlight (red + warning icon) - NewJob: date picker for deadline field - useMultiUpload: pass deadline to batch job creation PM-2 (clone job): - POST /jobs/{id}/clone: creates config copy in 'created' state, no reupload - useCloneJob hook, Clone button in JobsList actions - navigate to cloned job on success R-4 (reject categories): - LanguageQCState: add reject_category field - reject_language service: accept optional category (timing/mistranslation/terminology/profanity/length/other) - RejectLanguageRequest: add category field - QCDetail reject modal: category pill-selector before free-text notes R-2 (reviewed-cues tracking): - LanguageQCState: add reviewed_cues (int) + total_cues (nullable) - POST /jobs/{id}/languages/{lang}/mark-cue-reviewed endpoint - QCDetail: progress bar + approve gated at 80% for reviewer (admin bypasses) - markCueReviewed API client method Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 18:39:05 +01:00
Vadym Samoilenko	c7a6f13b10	feat(workflow): PR-2 workflow blockers — PM/Production dashboards, two-stage QC, role routing Changes: - Dashboard: add project_manager case (final review / QC counts / new job widgets) and production case (AI pipeline / failures widgets) - Sidebar: add project_manager to Final Review and Audit Log nav items; live badge counts for QC Queue (pending_qc) and Final Review (pending_final_review) - App.tsx: add project_manager to Final Review and Audit Log RoleGates (W-10, PM-18) - Login: role-based redirect after login — linguist/reviewer → /qc/queue, others → / - language_qc._assert_can_approve: enforce two-stage QC; remove linguist self-approve fallback; require reviewer assignment + submitted_for_review_at (W-6) - routes_jobs.complete_job: allow project_manager to complete jobs (W-9) - notify.py: re-enable email notifications (W-7) - Fix 400 on cue save: treat empty-string audio_description_vtt/captions_vtt as absent both in backend (truthy check) and frontend (\|\| undefined) — root cause was adVtt initialising to '' when job has no AD track Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 18:18:24 +01:00
Vadym Samoilenko	a168af1aa7	feat: two-stage QC (linguist→reviewer), project picker, comments, email notifications, deadlines - Two-stage QC workflow: linguist edits + submits → reviewer approves/rejects per language. New statuses: in_progress, pending_review, in_review. New service functions: submit_for_review, open_review, assign_reviewer, reassign_reviewer, add_comment. Linguist and reviewer deadlines. - Reject now resets language to in_progress so linguist can iterate without full re-assignment. - QC comment threads per language (append-only), visible to all assignees. - Email notifications via Mailgun on: assignment, submit-for-review, comment, approve, reject. Best-effort (failures do not roll back QC actions). asyncio.gather for parallel fan-out. - New audit actions: LANGUAGE_QC_REVIEWER_ASSIGN/REASSIGN, LANGUAGE_QC_SUBMIT, LANGUAGE_QC_OPEN_REVIEW, LANGUAGE_QC_COMMENT. - Inline project picker in NewJob: "＋ Create new project…" option with name, default languages, default linguist, default reviewer. Pre-fills languages on the new job. - Project model extended with default_languages, default_linguist_id, default_reviewer_id. - RBAC: CLIENT org-members can now create projects (backend guard relaxed). - LinguistQueue: role toggle "As linguist / As reviewer" + new status tabs. - QCDetail: two-slot assignment cards (linguist + reviewer), deadline display, role-aware action buttons, comments panel with optimistic insert and 15s refetch. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 16:59:40 +01:00
Vadym Samoilenko	bfb3a18d65	fix: switch embedding model to gemini-embedding-001 text-embedding-004 and text-multilingual-embedding-002 are not available through this API key. gemini-embedding-001 (768-dim, multilingual) is. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 16:02:12 +01:00
Vadym Samoilenko	be0bffe459	fix: get_terms_page avoids GlossaryTerm validation on partial projection Projected docs only have _id/source_term/translations; validating against GlossaryTerm (which requires glossary_id, version_id, source_term_lower) caused 500 on the terms endpoint. Return plain dicts instead. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 15:57:12 +01:00
Vadym Samoilenko	e7917cde10	fix: use 'is None' check for Motor collection to avoid NotImplementedError PyMongo Collection raises NotImplementedError on bool(), so 'if not self.collection' crashes on every audit log write. Changed to 'if self.collection is None'. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 15:23:44 +01:00
Vadym Samoilenko	a3b300b76a	docs: add canonical documentation + audit cleanup - AGENTS.md: canonical project entry point (Quick Nav, pipeline, constraints) - docs/: complete docs tree — architecture, API spec, DB schema, infra, runbook, requirements, tech stack, principles, reference ADRs, guides, tasks backlog, testing strategy - tests/README.md: test commands, structure, known gaps - README.md / CLAUDE.md / DEPLOYMENT.md: updated with canonical doc links - .archive/: backup of pre-documentation-pipeline originals - backend/uv.lock: uv dependency lockfile - Delete committed __pycache__ .pyc files (should have been gitignored) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 14:22:51 +01:00
Vadym Samoilenko	4c6624c3d4	fix: code health sweep — M-01 through M-07 M-01 authz.py: move cache_key above try block to avoid NameError when first Redis call returns None M-02 main.py: re-enable validation middleware (was TEMPORARILY DISABLED) M-03 routes_auth.py / main.py: replace print() debug lines with structured logger calls; logger now module-level in routes_auth.py M-04 gcs.py: asyncio.get_event_loop() → get_running_loop() (deprecation) M-05 translate_and_synthesize.py: bind loop vars in closure defaults to fix B023 ruff warnings (transcreate/translate_captions/etc.) M-06 rate_limiting.py: only trust X-Forwarded-For when X-Forwarded-Proto is https; use rightmost entry (proxy-appended) not leftmost M-07 validation.py: extend MongoDB operator blocklist to cover $expr, $function, $accumulator, $nin, $gte, $lte, $jsonSchema, $mod Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 14:18:02 +01:00
Vadym Samoilenko	87ae6571fe	perf: use DI connection pool for auth routes, async httpx for MS SSO (H-01, H-02) - login and microsoft_login routes now use Depends(get_database) instead of creating a per-request MongoClient — removes connection-pool churn under load - MicrosoftAuthService._get_openid_config/_get_jwks/validate_token are now async, using httpx.AsyncClient instead of blocking requests.get — removes ~400ms event-loop block per Microsoft login - Removed unused AsyncIOMotorClient import from routes_auth.py Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 14:13:50 +01:00
Vadym Samoilenko	103b409f78	fix: handle role as str or Enum in audit_logger user.role stored as plain string in MongoDB — calling .value on it caused AttributeError on every login, blocking all auth. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 14:09:46 +01:00
Vadym Samoilenko	fa351e4d25	feat: per-client glossary — hybrid exact/vector retrieval + AI injection Adds full glossary system so Gemini uses client-approved terminology when generating subtitles and translations (critical for 3M brand names and product codes across 16 target locales). Backend: - lib/locales.py: BCP-47 locale registry, normalises xlsx fr_fr → fr-FR - models/glossary.py: Glossary / GlossaryVersion / GlossaryTerm + enums - services/glossary_service.py: xlsx parse (openpyxl), ingest to Mongo, hybrid retrieval (Aho-Corasick exact + Atlas Vector Search), prompt block - services/embedding_service.py: Gemini text-embedding-004, batch 100, retry - tasks/embed_glossary.py: Celery background task for async embedding - api/v1/routes_glossaries.py: CRUD endpoints under /clients/{id}/glossaries - gemini.py: _build_glossary_block(), {GLOSSARY} injection in all 4 call sites - tts.py / gemini_tts.py: pass full locale codes (no split("-")[0] truncation) - tasks/translate_and_synthesize.py: glossary lookup + injection per language - prompts: {GLOSSARY} placeholder in ingestion, targeted, transcreation prompts - pyproject.toml: +openpyxl, +pyahocorasick Frontend: - routes/admin/glossaries/: GlossaryList, GlossaryUpload, GlossaryDetail - App.tsx: 3 new routes under /admin/clients/:clientId/glossaries - ClientDetail.tsx: Glossaries card with count + quick links - types/api.ts: Glossary, GlossaryVersion, GlossaryDetail, GlossaryTerm types - lib/api.ts: 7 new API methods (upload, list, detail, terms, versions, activate, archive) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 13:03:38 +01:00
Vadym Samoilenko	05f25a1141	feat: per-language QC workflow with linguist assignment - Job.language_qc dict tracks per-language status (pending/in_review/approved/rejected) with full event history; qc_assignments denormalized array enables efficient queue queries - language_qc service handles assign/reassign/approve/reject/reopen with atomic DB updates, audit logging, and auto-advancement to pending_final_review when all languages approved - Linguists can only edit VTT and trigger re-renders for their assigned language (403 guard) - return_to_qc resets all language statuses while preserving assignments - routes_language_qc.py: 7 new endpoints; /me/language-qc-queue for linguist queue - Startup migration idempotently seeds language_qc for all existing jobs - Frontend: LanguageQCState types, API methods, LinguistQueue page, QCDetail redesigned with per-language status badges, assignment dropdown, inline approve/reject buttons, progress bar, and reject modal; My QC Queue sidebar link Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 12:09:40 +01:00
Vadym Samoilenko	bab30e1508	feat: VTT version control — snapshots, diff, restore Backend: - VttVersion model (vtt_version.py): immutable snapshot per job/lang/kind/version - vtt_versioning service: create_version (atomic counter + GCS snapshot), list_versions, get_version, restore_version, diff_versions (difflib line-level) - routes_vtt_versions.py: GET /versions, GET /versions/{v}, GET /versions/diff, POST /versions/{v}/restore (PRODUCTION/ADMIN only, overwrites live file + audit log) - Hook create_version into update_job_vtt_content before each live-file overwrite - Mongo indexes: unique (job_id, lang, kind, version) + (job_id, created_at) Frontend: - VttVersionSummary / VttVersionFull / VttDiffResponse types - api.ts: listVttVersions, getVttVersion, diffVttVersions, restoreVttVersion - VersionsTab.tsx: lang/kind switcher, version list with A/B compare buttons, inline diff viewer (color-coded +/−), content viewer, restore with confirm dialog - JobDetail.tsx: new "VTT Versions" tab wired to VersionsTab Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 11:46:21 +01:00
Vadym Samoilenko	1563714454	feat(saas): Phase 3 — membership-based authz + Mailgun + job.organization_id authz.py (new): - MembershipContext — per-request membership dict for the current user - get_membership_context FastAPI dependency - require_org_role(min_role) — dependency factory keyed off org_id path param - require_platform_admin() - OrgScopedQuery — adds organization_id filter; platform admin passes through - bump_user_membership_cache — invalidates Redis key on membership writes dependencies.py: - get_accessible_project_ids now queries memberships collection first; legacy pm_client_ids / team.member_user_ids fallback retained until migration runs (four job-route access checks at lines 608/1054/1181/1538 are fixed via this function) routes_clients.py: - _assert_pm_or_admin and _assert_client_access are now async and query memberships - All 10 call sites updated with await + db arg emailer.py: - Switched from SendGrid to Mailgun REST API via httpx (already in requirements) - _send() is now fully async; same public method signatures preserved - send_completion_email uses _send() config.py: - Added mailgun_api_key, mailgun_domain, mailgun_from settings - sendgrid_api_key kept with empty default for backward compat migration_2026-04-28-000003: - Backfills job.organization_id from project.client_id - Creates (organization_id, status, created_at) sparse index on jobs routes_organizations.py / routes_invitations.py: - Call bump_user_membership_cache after every membership write Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 16:56:42 +01:00
Vadym Samoilenko	00fb1aacc6	feat(saas): Phase 2 — invitation flow, email templates, MS SSO zero-membership Backend: - models/invitation.py — Invitation model + create/accept/preview schemas - routes_invitations.py — org-scoped POST/GET/DELETE + public preview/accept endpoints Single-use token via find_one_and_update; sha256(token) stored in DB, plaintext in email URL - emailer.py — _send() helper; send_invitation_email, send_welcome_email, send_password_reset_email send_completion_email refactored to use _send() - migration_2026-04-28-000002 — creates invitations collection with TTL index (30d audit trail) - routes_auth.py — new MS SSO users provisioned with zero memberships instead of role=PRODUCTION; they land on "no access" page until an admin invites them - main.py — registers invitations_org_router and invitations_router Frontend: - routes/AcceptInvite.tsx — public page at /accept-invite?token=... Four states: new user (name+password), existing user (confirm), MS user, already-member - App.tsx — /accept-invite route outside RequireAuth - types/api.ts — Invitation, InvitationCreate, InvitationPreview, InvitationAcceptRequest/Response - lib/api.ts — listInvitations, createInvitation, revokeInvitation, previewInvitation, acceptInvitation - hooks/useClients.ts — useInvitations, useCreateInvitation, useRevokeInvitation Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 16:52:08 +01:00
Vadym Samoilenko	6f1be645ce	feat(saas): Phase 0+1 — Organization/Membership entities and dev branch Introduces the multi-tenant SaaS foundation alongside the existing client/team/project model (zero-downtime shim period): Backend: - app/models/organization.py — Organization + OrgRole enum (OWNER/ADMIN/MANAGER/MEMBER/VIEWER) - app/models/membership.py — Membership model with MemberDetail for enriched responses - app/services/membership_service.py — upsert/remove/list/has_org_role helpers - app/api/v1/routes_organizations.py — /organizations CRUD + /members sub-resource + /me/memberships - main.py — registers organizations router - migrations: create memberships collection (unique index) + backfill from pm_client_ids/team members Frontend: - types/api.ts — Organization, OrgRole, Membership, OrganizationCreateRequest types; Client marked @deprecated - hooks/useClients.ts — useOrganizations, useOrganization, useOrgMembers, useAddOrgMember, useUpdateOrgMember, useRemoveOrgMember, useMyMemberships - lib/api.ts — listOrganizations, getOrganization, createOrganization, updateOrganization, listOrgMembers, addOrgMember, updateOrgMember, removeOrgMember, getMyMemberships Reads fall back to the clients collection during transition; all writes go to organizations. Existing /clients endpoints and hooks are untouched. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 16:46:24 +01:00
Michael Clervi	f7d4624fc7	fix: correct cost tracker API field names and endpoint path - preflight: use allow (not allowed), deny_reason, project_external_id - record: units dict {token_input/token_output/char} instead of flat fields - record: use /usage/record endpoint path Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 13:42:29 +00:00
Vadym Samoilenko	ea21cace96	feat: replace SDK with direct HTTP integration to centralized cost tracker - New services/cost_tracker.py: sync httpx preflight()/record() + async wrappers; BudgetExceeded exception; no-op when COST_TRACKER_BASE_URL is empty - Preflight budget check added before ingestion (Gemini), per-language translation (video-native + traditional), and per-language TTS dispatch - _record_gemini_usage and _record_tts_cost now call cost_tracker directly; removes broken asyncio.get_event_loop() hack from sync Celery worker - Fix: _cost_ctx now threaded into extract_accessibility_targeted (video-native path) - Fix: user_id/cost_project_id now propagated through dispatch_language_tts → synthesize_cue_task.s() and the rerender_accessible_video.py re-render path - Remove oliver-cost-tracker SDK dependency (was commented-out/never installed) - Drop cost_tracker_outbox_path setting and get_cost_tracker() factory - Update COST_TRACKER_BASE_URL default to optical-dev.oliver.solutions in .env.prod.example, docker-compose.yml, and all Cloud Run service yamls - Cloud Run yamls use Secret Manager ref (cost-tracker-api-key) for the API key Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 13:36:15 +01:00
Vadym Samoilenko	ae2c474061	feat: integrate oliver-cost-tracker SDK into video-accessibility Add AI cost tracking to all Gemini and TTS call sites: - config.py: add COST_TRACKER_* env vars (base_url, api_key, source_app, outbox_path, enabled) - dependencies.py: add get_cost_tracker() factory (lru_cache, graceful degradation if SDK not installed) - models/job.py: add cost_tracker_project_id field for cost attribution - services/gemini.py: - add import time, _record_gemini_usage() helper (reads usage_metadata) - add _cost_ctx kwarg to extract_accessibility, extract_accessibility_targeted, transcreate_content, translate_vtt, rewrite_tts_cue - record usage after every generate_content call via asyncio.create_task() - tasks/ingest_and_ai.py: pass _cost_ctx (user_id, job_id, project_id) to extract_accessibility - tasks/translate_and_synthesize.py: build _cost_ctx from job_doc and pass to transcreate_content + translate_vtt calls - tasks/tts_synthesis.py: add user_id + cost_project_id kwargs, add _record_tts_cost() helper (records len(text) chars to cost tracker) - pyproject.toml: document SDK install instructions (comment) - .env.prod.example: add COST_TRACKER_* vars Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 11:30:46 +01:00
Vadym Samoilenko	8356dbdbfe	fix: add charset=utf-8 to VTT content-type to prevent ♪ encoding issues Without charset specification, browsers/tools interpret text/vtt as Latin-1, causing UTF-8 multi-byte characters like ♪ (U+266A) to render as garbled text (â™ª). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 14:17:16 +00:00
Vadym Samoilenko	6f963ff7c4	feat: DCMP compliance, descriptive transcript, new languages, QA bug fixes - Rewrote VTT translation to two-step (text-only → Gemini → apply to original timestamps) preventing caption timing desync - Added polling fallback for all processing states and Safari visibilitychange WebSocket reconnect - Added 11 new TTS languages (cs, da, fi, hu, no, sk, sv, es-419, pt-BR, fr-CA) - Updated caption/AD prompts to DCMP Captioning Key & Description Key standards (line splitting, ♪ music notation, italic tags, caption positioning, ethics guidelines) - Added descriptive transcript generation (WCAG 2.1 §1.2.1) combining captions + AD into plain text - Fixed amix normalize=0 to prevent audio loss in rendered videos - Fixed AD re-timing double-count when source_ms is None - Fixed cue block numbering to be 1-based in VttEditor and Timeline Preview Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 11:50:43 +00:00
Vadym Samoilenko	f4ddcce066	fix: resolve QA-reported bugs — MP3/VTT desync, crashes, notifications, and more BUG-1 & BUG-2 — Wrong audio plays after re-render / MP3 doesn't match text Root cause: audio files were named by index (cue_0.mp3, cue_1.mp3). When a cue was inserted or deleted, all following indices shifted but old MP3 files kept their original names, so re-render would play the wrong audio for the wrong cue. Fix: renamed files to cue_N_CONTENTHASH.mp3 and introduced an ad_cue_manifest stored in the job document that maps each cue index to its correct GCS URI. Re-render now reads from the manifest instead of guessing by filename. Also: editing AD cue text in the VTT editor now automatically queues TTS regeneration for changed cues — no more silent mismatches. BUG-3 — App crash / state desync when uploading VTT or clearing TTS queue Fixed handleVttFileUpload to only update local editor state after the server confirms the save — previously local state was updated first, so a network error left the UI showing content that wasn't actually saved. Fixed handleClearRegenerationQueue to only remove items from local state if the server removal succeeded — previously all items were cleared regardless. BUG-4 — AI generates different audio descriptions every time Added GenerateContentConfig(temperature=0.2, top_p=0.8, top_k=40) to the Gemini API call so output is more consistent across runs. BUG-5 — On-screen text inconsistently described Strengthened the AI prompt rule from a vague suggestion to a mandatory requirement with an explicit format: "Text on screen reads: [exact text]". Applied to both gemini_ingestion.md and gemini_ingestion_targeted.md. BUG-6 — No notification when re-render finishes Added rendering_qc toast notification and a dismissible green banner that appears in QCDetail when re-render transitions to pending_qc. The banner auto-dismisses after 10 seconds. Also increased WebSocket reconnect attempts from 5 to 15 and capped backoff at 60s to prevent falling back to manual refresh. BUG-7 — Timeline preview looks accurate but isn't after edits Added isStale prop to TimelinePreview. The timeline now shows an amber tint and "Preview may be outdated" label whenever there are unsaved pause point changes, pending TTS regenerations, or a new VTT has been uploaded. BUG-8 — ElevenLabs API errors break TTS with no fallback Added try/except fallback chain in _synthesize_single_cue: if the configured provider fails, it automatically retries with google, then gemini. BUG-9 — Concurrent re-render requests cause race conditions Made the PENDING_QC → RENDERING_QC status transition conditional (only succeeds if the job is still in PENDING_QC). Returns HTTP 409 if a re-render is already in progress. The completion transition back to PENDING_QC is also conditional so a cancelled/overridden render doesn't corrupt job state. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-24 13:23:55 +00:00
Vadym Samoilenko	c413fcb747	feat: add SDH (Subtitles for Deaf and Hard of Hearing) caption output SDH captions extend standard VTT with speaker identification labels, sound effects [PHONE RINGS], music notation ♪, and off-screen indicators. - Add sdh_vtt flag to RequestedOutputs model and frontend form - Add sdh_captions_vtt_gcs field to LangOutput model - Inject SDH generation instructions into both Gemini prompts via {SDH_FIELD} and {SDH_GUIDELINES} placeholders when requested - Upload sdh_captions.vtt to GCS in ingest task - Pass SDH through video_native translation (Gemini generates it directly) and traditional translation (translate source SDH VTT via Gemini) - Expose sdh_captions_vtt in downloads endpoint and bulk zip export Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 15:02:18 +00:00
Vadym Samoilenko	2e8a8dc287	feat: add brand context, ethics guidelines, and improved AD prompt rules - Add brand_context field (job model, API, frontend form) so clients can list brand names present in their video; Gemini uses these names instead of generic descriptors (e.g. "Sellotape" not "sticky tape") - Add ethical guidelines section to both Gemini prompts covering person-first language, consistent race/gender description only when plot-relevant, no guessing at unconfirmed identity - Revamp audio description rules: priority ordering (essential → high-priority → time-permitting), pre-teaching placement, no cinematic jargon, succinct style replacing the former "20% longer" instruction - Thread brand_context through full stack: routes → job doc → ingest task → translate task → both Gemini prompt templates Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 14:46:09 +00:00
Vadym Samoilenko	222826baa7	fix: propagate ElevenLabs voice fetch errors to frontend - elevenlabs_voices.py: re-raise exception on first fetch failure (empty cache) instead of silently returning empty list - routes_tts.py: catch get_voices() exception and return available=False with the error detail; add optional error field to ProviderVoicesResponse - VoiceSelector: show actual API error message when available=false Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-03 14:27:45 +00:00
Vadym Samoilenko	1e177a6d5c	feat: add ElevenLabs voice selection to frontend and backend Add dynamic ElevenLabs voice catalog with provider toggle in the UI, allowing users to browse ElevenLabs voices, configure stability and similarity boost settings, and preview/synthesize with ElevenLabs TTS. Backend: - New elevenlabs_voices.py service with 1-hour cached API fetching - TTS routes support ?provider= query param for voices and options - Preview endpoint routes to ElevenLabs or Gemini based on provider - stability/similarity_boost params flow through TTS synthesis pipeline - TTSPreferences model extended with ElevenLabs-specific fields - Deprecated hardcoded elevenlabs_voices config (now fetched dynamically) Frontend: - Provider toggle (Gemini/ElevenLabs) in VoiceSelector - ElevenLabsSettingsPanel with stability and similarity boost sliders - VoicePreviewButton supports provider-specific preview parameters - API client passes provider param to voices, options, and preview endpoints - New VoiceInfo, ProviderVoicesResponse, ProviderOptionsResponse types Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-03 13:58:56 +00:00
michael	030f1b67ee	fix: enforce AD cue pause_point monotonicity to preserve cue order Whisper's snap_pause_point() finds the nearest sentence boundary independently per cue, which can move a later cue's pause_point before an earlier cue's. The renderer then sorts by pause_point, producing non-sequential cue indices in the timeline. Add a forward monotonicity pass (clamp each pause_point >= previous) at three layers for defense-in-depth: - whisper_service: Phase 3 after consolidation - video_renderer: before temporal sort in _render_pause_insert_method - rerender_accessible_video: in _build_placements_with_adjustments Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 08:15:06 -06:00
michael	30483b3ec1	fix: preserve cue order when consolidated AD cues share same pause point Add ad_cue_index as secondary sort key when sorting placements, ensuring that consolidated cues maintain their original VTT order (cue 0 before cue 1). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-12 10:15:12 -06:00
michael	a6cd4cde07	fix: store source video coordinates in pause points for correct re-rendering The re-render task was using pause point coordinates from the accessible video timeline (which includes freeze frame durations) instead of the original source video coordinates. This caused pause points to exceed the source video duration and get clamped incorrectly. Changes: - Add source_ms field to PausePointData model to store source video cut point - Update video_renderer.py to populate source_ms when building pause points - Update rerender_accessible_video.py to use source_ms for placement calculations - Apply user adjustments as relative offsets (delta-based adjustment) - Update API responses and TypeScript types to include source_ms - Add backward compatibility fallback for jobs without source_ms Note: Existing jobs need to be re-processed from initial render to populate the new source_ms field. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-11 10:48:41 -06:00
michael	d965d1467a	fix: use rendered video coordinates for pause point positions Pause points were being stored with source video timestamps instead of rendered video timeline coordinates. This caused misalignment between the pause point markers and freeze frame segments in the timeline UI. Now pause points are calculated from the freeze frame segment start positions in the rendered timeline, ensuring they align correctly with the AD audio segments. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-11 09:37:31 -06:00
michael	aa6777d2c2	feat: add QC accessible video review and editing capabilities - Reorder workflow: translations now happen BEFORE QC Review step - Add language tabs to switch between translated languages in QC - Add video mode tabs (Original Video / Accessible Video) - Add interactive timeline preview showing video segments and AD cues - Enable pause point adjustment with millisecond precision - Add TTS regeneration queue for selective cue re-synthesis - Add re-render controls with optional Whisper refinement - Persist video segments and TTS MP3s to GCS for editability - Add new RENDERING_QC job status for re-render operations - Create 5 new API endpoints for accessible video editing - Add rerender_accessible_video.py Celery task Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-11 08:32:27 -06:00

1 2

100 commits