video-accessibility

Author	SHA1	Message	Date
Vadym Samoilenko	000e99c2d0	feat(audit): add missing AuditAction enum values for clients, orgs, invitations, QC, briefs, share	2026-05-14 11:28:30 +01:00
Vadym Samoilenko	4645e67611	fix(glossary-list): show real embedding progress in glossary list view - Add current_version_embedding_status/embedded_count/term_count to GlossaryResponse - Batch-fetch current versions in list endpoint (single extra query, not N queries) - Add get_versions_by_ids() helper to glossary_service - Fix GlossaryList.tsx: embeddingBadge('') → embeddingBadge(g) with real status + pct Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 19:00:56 +01:00
Vadym Samoilenko	ca312d48fa	chore(lint): fix all ruff errors — 0 warnings remaining - B904 (55): add `from err` / `from None` to raise-in-except across 13 files - F821 (1): add missing HTTPException import in routes_language_qc.py - F841 (7): remove unused variable assignments (current_user, job_title, tts_provider, etc.) - W293 (13): strip trailing whitespace from blank lines - C416 (4): rewrite unnecessary dict comprehensions as dict() - C401 (1): rewrite unnecessary generator as set comprehension - E701 (4): split multi-statement lines in cost_tracker.py - E741 (1): rename ambiguous `l` to `lang` in cloud_run_dispatch.py - B007 (4): prefix unused loop variables with _ in tts.py, video_renderer.py - I001 (1): sort imports in tasks/__init__.py (move stdlib to top) - E402 (3): move threading/time/signals imports to top of tasks/__init__.py - UP042 (9): replace (str, Enum) with StrEnum in all model/schema enums Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 17:13:08 +01:00
Vadym Samoilenko	290d5e32e6	fix: 7 caption/AD quality bugs + retranslation error handling Bug fixes: - Bug 1a: source_has_ad flag prevents AI generating AD over existing professional AD; JobBrief/Job models, gemini service prompt conditional, NewBrief UI checkbox - Bug 1b: disable native textTracks on video element to prevent double captions - Bug 2: caption ALL audible speech including off-screen narrators (prompt fix) - Bug 3: DCMP §6.01 disfluency removal for EN/ES/FR/DE/IT (prompt + post-pass) - Bug 4: VTT cue settings (line:0%, position:) preserved through parser round-trip - Bug 5: Whisper word-level timestamp alignment via new caption_aligner service - Bug 6: assert_cue_alignment used .start/.end; renamed to .start_time/.end_time - New migration: backfill source_has_ad=False on existing jobs and job_briefs Also fix retranslation error handling to preserve existing GCS URIs on failure so video_native captions remain accessible if retranslation fails. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-07 15:38:20 +01:00
Vadym Samoilenko	fddf803b74	feat(translation): enforce EN-first pipeline with cue-preserving translations All translations now derive strictly from the approved English master VTT, eliminating the cue-count and timestamp drift reported by linguists (e.g. PL AD = 11 cues vs EN AD = 17 cues). Key changes: - Remove video_native translation mode entirely; all languages go through translate_vtt() which guarantees 1:1 cue alignment with EN master - Transcreation languages now use translate_vtt(style="transcreate") — same cue-preserving contract, culturally-adapted instructions - Post-translation cue alignment validator added (VTTEditor.assert_cue_alignment) - After ingestion, job moves to PENDING_QC (EN-only) instead of TRANSLATING; translation pipeline dispatches automatically when EN QC is approved - New POST /jobs/{id}/retranslate-language endpoint for PM/admin to fix legacy video_native jobs on demand - Frontend: origin badge (EN-aligned / transcreated / video-native warning), EN-first gate banner on target-language cards, Re-translate from EN button with confirm modal, removed translation mode selector from NewJob Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-06 12:11:35 +01:00
Vadym Samoilenko	68ac65ac05	fix(briefs): fix Project/Assign-To dropdowns and expand Requested Outputs Projects: - PM now sees all active projects (same as admin/production) — was filtering to empty when pm_client_ids and org memberships were both unset Assign To: - Replaced useOrganizations()+useOrgMembers() with a new GET /admin/brief-assignees endpoint accessible to all authenticated users — returns active admin/PM/production users sorted by name; shows role next to name in dropdown Requested Outputs: - Added SDH Captions (VTT), Descriptive Transcript, Accessible Video (MP4) - Accessible Video shows Pause Insert / Voice Overlay radio selector - Added descriptive_transcript field to RequestedOutputs model (backend + frontend) Access: - Brief routes now open to 'client' role in addition to admin/PM/production Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 17:54:21 +01:00
Vadym Samoilenko	2f4925353a	feat(pause-insert): adaptive buffer, forward-snap, timeline drag + share link fix Backend (Phase A): - A1: Adaptive silence buffer — natural_gap_ms persisted per cue; renderer computes per-cue silence_before/silence_after instead of fixed 500ms; per-cue silence files - A2: Forward-preferred snap — snap_pause_point prefers boundaries up to 4s ahead over boundaries within 1.5s behind, reducing mid-scene cuts - A3: Min-gap validation — pause points with < 200ms gap trigger forward search to the next acceptable gap - natural_gap_ms added to PausePointData model and api.ts type - New config fields: whisper_snap_forward_window, whisper_snap_backward_window, ad_silence_buffer_default, ad_silence_buffer_min_after, ad_min_acceptable_gap - Tests: test_whisper_snap.py (13 tests), test_video_renderer_buffers.py Frontend (Phase B): - B1: Drag pause-point markers — pointer state machine with 3px move threshold, clamp to min/max bounds, click-without-move still opens PausePointEditor - B2: Drag freeze blocks — orange blocks translate with linked pause point - B3: Time tooltip visible during drag, hidden on release - Tests: TimelinePreview.drag.test.tsx (10 tests) Fixes: - Share link pointed to ai-sandbox.oliver.solutions — added app_url to Settings with correct optical-dev.oliver.solutions default; share_url now configurable via APP_URL env var - Removed all ai-sandbox.oliver.solutions references from docker-compose, apache config, docs, and scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 16:09:09 +01:00
Vadym Samoilenko	5d8d992e5a	feat(briefs+notify+downloads): fix projects dropdown, add assignee, expand languages, fix PM email, add Download All - NewBrief: use useAllProjects() (was useProjects('') which never fired) - NewBrief: expand languages from 12 to 52 options with region variants - NewBrief: add Assign To dropdown from org members - Backend: add GET /clients/all-projects endpoint for cross-client project listing - Backend: add assignee_id to JobBriefCreate/JobBriefResponse models + routes - notify.py: send completion email to PMs (pm_client_ids) not client user — fixes email never arriving (was looking up users._id by client entity ID) - Downloads: add Download All button that fetches all files sequentially Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 21:47:28 +01:00
Vadym Samoilenko	f681bd4f53	feat: add Stop Process button to cancel in-progress jobs Adds POST /jobs/{id}/cancel endpoint that revokes the Celery task and sets status to 'cancelled'. Shows a confirmation widget in the job detail sidebar for admin/production roles when the job is in an active processing state. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 19:50:39 +01:00
Vadym Samoilenko	105895dd14	feat: apply EN source VTT changes to all target languages When a reviewer saves the source language VTT during QC and confirms the re-translate dialog, all target languages are re-translated via Celery. Job transitions to `translating` and returns to `pending_qc` when done. Existing polling in useJob covers progress display. - schemas/job.py: add `retranslate_languages: bool` to VttUpdateRequest - audit_log.py: add VTT_RETRANSLATE audit action - translate_and_synthesize_task: accept languages/retranslate params, filter to specified languages, skip video render, return to PENDING_QC - routes_jobs.py: add _trigger_retranslation helper, call after VTT save - types/api.ts: add retranslate_languages to VttUpdateRequest - useJob.ts: invalidate all lang VTTs on retranslate - QCDetail.tsx: confirmation dialog when saving source VTT with targets Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 17:13:06 +01:00
Vadym Samoilenko	31199f8705	chore: push all session changes — backend hardening, tests, apache config, deploy scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 15:52:14 +01:00
Vadym Samoilenko	e4b350cd7d	feat(ux): R-8 linguist language warn, PM CC editing, timeline right-click + CC insert R-8 — Linguist language competence: - Add User.languages[] BCP-47 field to backend model + UserResponse schema - Frontend: show amber warning in assign modal when selected linguist has no competence listed for the target language PM VTT editing (FinalDetail): - PM and ADMIN can now edit captions/AD in the final review stage - VttEditor becomes read-write with onCueSave wired to updateVttMutation - Other roles remain read-only Timeline right-click + add pause: - Right-click anywhere on the timeline opens a context menu showing the timestamp - If near a pause point marker: "Edit timing" + "Regenerate TTS" options - If on empty space: "Add AD cue at Xs" → inserts a new AD cue in the editor - Pause point markers widened from 1px → 2px (3px on hover) for easier clicking - Right-click on a pause point marker directly opens the editor VttEditor insertAtTimeMs prop: - New prop triggers programmatic insert at a specific video timestamp - Used by the timeline right-click "Add AD cue here" action Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 10:51:31 +01:00
Vadym Samoilenko	ff372c7322	fix(security): close MT-17/18/19, restore cross-tenant tests, quick wins Blocks 1–5 of stabilization plan: SECURITY - validation.py: restore settings.upload_max_video_bytes (T-14 regression fix) and JSON object key validation that was incorrectly removed - MT-18: add accessible_org_ids filter to list_for_reviewer/list_for_linguist so reviewers/linguists only see jobs from their own org in QC queue - MT-17: add Membership.team_ids[], write to it on invitation acceptance and direct team add/remove; migration backfills from Team.member_user_ids - MT-19: validate all target_team_ids belong to invitation's org_id at creation TESTS - Restore test_cross_tenant_isolation.py (was deleted, only .pyc remained) - Extend with MT-18 reviewer org isolation tests QUICK WINS - W-8: remove time.sleep(1) + dead debug block from POST /jobs (task was undefined — would have caused NameError → HTTP 500 on every job creation) - T-13: warn at startup when REDIS_URL configured but connection failed - T-16: skip language_qc lifespan migration when count=0 (no DB scan on startup) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-30 10:32:23 +01:00
Vadym Samoilenko	595897e61a	feat(w-12): JobBrief model, endpoints, migration + brief→job linkage - JobBrief model (DRAFT→SUBMITTED→APPROVED→FULFILLED) with 6 CRUD endpoints: list, create, get, patch (DRAFT only), submit, approve - All endpoints use MembershipContext; read=VIEWER, mutate=MANAGER, approve=ADMIN for org-scoped access - create_job accepts brief_id Form field; validates APPROVED brief, copies organization_id/project_id/deadline from brief, marks brief FULFILLED after job insert - organization_id now populated from project client_id on job create (fixes missing multi-tenant field on new jobs) - migration_2026-04-29-000001: job_briefs collection + 4 indexes - Wired briefs router into main.py Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 20:38:08 +01:00
Vadym Samoilenko	3e3be935c6	feat(w-13): structured Job.failure schema, PROCESSING_FAILED status, audit actions Add JobFailure model (step, type, message, retriable, occurred_at, retry_count) to job.py. Add PROCESSING_FAILED to JobStatus (legacy TTS_FAILED/RENDER_FAILED preserved for back-compat). Add missing Job fields that existed in DB but not the Pydantic model: organization_id, brief_id, gcs_prefix, initial_linguist_id, initial_reviewer_id, failure, retry_count. Add JOB_TASK_FAILED, JOB_RETRY, JOB_BULK_RETRY to AuditAction enum. Add migration 2026-04-29-000000: processing_failed in schema validator + compound indexes (failure.step/status) and (status/org_id/created_at). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 20:27:28 +01:00
Vadym Samoilenko	bdfa0f82ab	fix(lint): restore baseline lint count — no new errors introduced QCDetail.tsx: 4 new `any` types replaced with `unknown` + type casts. backend: ruff auto-fix sorted imports, removed unused imports, updated Optional[X] → X \| None in routes_share + share_token model. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 19:16:35 +01:00
Vadym Samoilenko	abf81515a4	feat(pm15): share read-only link for client preview Backend: - ShareToken model (share_tokens collection) - POST /jobs/{id}/share — create token (PM/PROD/ADMIN) - GET /jobs/{id}/share — list active tokens - DELETE /jobs/{id}/share/{token_id} — revoke token - GET /public/share/{token} — unauthenticated preview with signed GCS URLs (6h TTL) Returns video, captions, AD for all languages Frontend: - ShareView.tsx — public page at /share/:token with language switcher, video player, download tiles - App.tsx — /share/:token route (no auth wrapper) - QCDetail.tsx — "↗ Share link" button in header → modal to generate + copy link Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 18:56:44 +01:00
Vadym Samoilenko	13db347d65	feat(pr3+pr4): deadline field, job clone, reject categories, reviewed-cues gate PM-1 (deadline): - Job model: add deadline field (job-level PM deadline) - POST /jobs: accept deadline as ISO date form param - JobsList: deadline column with overdue highlight (red + warning icon) - NewJob: date picker for deadline field - useMultiUpload: pass deadline to batch job creation PM-2 (clone job): - POST /jobs/{id}/clone: creates config copy in 'created' state, no reupload - useCloneJob hook, Clone button in JobsList actions - navigate to cloned job on success R-4 (reject categories): - LanguageQCState: add reject_category field - reject_language service: accept optional category (timing/mistranslation/terminology/profanity/length/other) - RejectLanguageRequest: add category field - QCDetail reject modal: category pill-selector before free-text notes R-2 (reviewed-cues tracking): - LanguageQCState: add reviewed_cues (int) + total_cues (nullable) - POST /jobs/{id}/languages/{lang}/mark-cue-reviewed endpoint - QCDetail: progress bar + approve gated at 80% for reviewer (admin bypasses) - markCueReviewed API client method Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 18:39:05 +01:00
Vadym Samoilenko	a168af1aa7	feat: two-stage QC (linguist→reviewer), project picker, comments, email notifications, deadlines - Two-stage QC workflow: linguist edits + submits → reviewer approves/rejects per language. New statuses: in_progress, pending_review, in_review. New service functions: submit_for_review, open_review, assign_reviewer, reassign_reviewer, add_comment. Linguist and reviewer deadlines. - Reject now resets language to in_progress so linguist can iterate without full re-assignment. - QC comment threads per language (append-only), visible to all assignees. - Email notifications via Mailgun on: assignment, submit-for-review, comment, approve, reject. Best-effort (failures do not roll back QC actions). asyncio.gather for parallel fan-out. - New audit actions: LANGUAGE_QC_REVIEWER_ASSIGN/REASSIGN, LANGUAGE_QC_SUBMIT, LANGUAGE_QC_OPEN_REVIEW, LANGUAGE_QC_COMMENT. - Inline project picker in NewJob: "＋ Create new project…" option with name, default languages, default linguist, default reviewer. Pre-fills languages on the new job. - Project model extended with default_languages, default_linguist_id, default_reviewer_id. - RBAC: CLIENT org-members can now create projects (backend guard relaxed). - LinguistQueue: role toggle "As linguist / As reviewer" + new status tabs. - QCDetail: two-slot assignment cards (linguist + reviewer), deadline display, role-aware action buttons, comments panel with optimistic insert and 15s refetch. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 16:59:40 +01:00
Vadym Samoilenko	e48d63bdbd	fix: generate valid ObjectId for audit log entries default_factory=PyObjectId produced "" (empty string) since Annotated[str, ...] is a type annotation, not a callable factory. Replace with lambda: str(ObjectId()) to generate a real unique ID. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 15:06:15 +01:00
Vadym Samoilenko	a3b300b76a	docs: add canonical documentation + audit cleanup - AGENTS.md: canonical project entry point (Quick Nav, pipeline, constraints) - docs/: complete docs tree — architecture, API spec, DB schema, infra, runbook, requirements, tech stack, principles, reference ADRs, guides, tasks backlog, testing strategy - tests/README.md: test commands, structure, known gaps - README.md / CLAUDE.md / DEPLOYMENT.md: updated with canonical doc links - .archive/: backup of pre-documentation-pipeline originals - backend/uv.lock: uv dependency lockfile - Delete committed __pycache__ .pyc files (should have been gitignored) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 14:22:51 +01:00
Vadym Samoilenko	fa351e4d25	feat: per-client glossary — hybrid exact/vector retrieval + AI injection Adds full glossary system so Gemini uses client-approved terminology when generating subtitles and translations (critical for 3M brand names and product codes across 16 target locales). Backend: - lib/locales.py: BCP-47 locale registry, normalises xlsx fr_fr → fr-FR - models/glossary.py: Glossary / GlossaryVersion / GlossaryTerm + enums - services/glossary_service.py: xlsx parse (openpyxl), ingest to Mongo, hybrid retrieval (Aho-Corasick exact + Atlas Vector Search), prompt block - services/embedding_service.py: Gemini text-embedding-004, batch 100, retry - tasks/embed_glossary.py: Celery background task for async embedding - api/v1/routes_glossaries.py: CRUD endpoints under /clients/{id}/glossaries - gemini.py: _build_glossary_block(), {GLOSSARY} injection in all 4 call sites - tts.py / gemini_tts.py: pass full locale codes (no split("-")[0] truncation) - tasks/translate_and_synthesize.py: glossary lookup + injection per language - prompts: {GLOSSARY} placeholder in ingestion, targeted, transcreation prompts - pyproject.toml: +openpyxl, +pyahocorasick Frontend: - routes/admin/glossaries/: GlossaryList, GlossaryUpload, GlossaryDetail - App.tsx: 3 new routes under /admin/clients/:clientId/glossaries - ClientDetail.tsx: Glossaries card with count + quick links - types/api.ts: Glossary, GlossaryVersion, GlossaryDetail, GlossaryTerm types - lib/api.ts: 7 new API methods (upload, list, detail, terms, versions, activate, archive) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 13:03:38 +01:00
Vadym Samoilenko	05f25a1141	feat: per-language QC workflow with linguist assignment - Job.language_qc dict tracks per-language status (pending/in_review/approved/rejected) with full event history; qc_assignments denormalized array enables efficient queue queries - language_qc service handles assign/reassign/approve/reject/reopen with atomic DB updates, audit logging, and auto-advancement to pending_final_review when all languages approved - Linguists can only edit VTT and trigger re-renders for their assigned language (403 guard) - return_to_qc resets all language statuses while preserving assignments - routes_language_qc.py: 7 new endpoints; /me/language-qc-queue for linguist queue - Startup migration idempotently seeds language_qc for all existing jobs - Frontend: LanguageQCState types, API methods, LinguistQueue page, QCDetail redesigned with per-language status badges, assignment dropdown, inline approve/reject buttons, progress bar, and reject modal; My QC Queue sidebar link Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 12:09:40 +01:00
Vadym Samoilenko	bab30e1508	feat: VTT version control — snapshots, diff, restore Backend: - VttVersion model (vtt_version.py): immutable snapshot per job/lang/kind/version - vtt_versioning service: create_version (atomic counter + GCS snapshot), list_versions, get_version, restore_version, diff_versions (difflib line-level) - routes_vtt_versions.py: GET /versions, GET /versions/{v}, GET /versions/diff, POST /versions/{v}/restore (PRODUCTION/ADMIN only, overwrites live file + audit log) - Hook create_version into update_job_vtt_content before each live-file overwrite - Mongo indexes: unique (job_id, lang, kind, version) + (job_id, created_at) Frontend: - VttVersionSummary / VttVersionFull / VttDiffResponse types - api.ts: listVttVersions, getVttVersion, diffVttVersions, restoreVttVersion - VersionsTab.tsx: lang/kind switcher, version list with A/B compare buttons, inline diff viewer (color-coded +/−), content viewer, restore with confirm dialog - JobDetail.tsx: new "VTT Versions" tab wired to VersionsTab Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 11:46:21 +01:00
Vadym Samoilenko	00fb1aacc6	feat(saas): Phase 2 — invitation flow, email templates, MS SSO zero-membership Backend: - models/invitation.py — Invitation model + create/accept/preview schemas - routes_invitations.py — org-scoped POST/GET/DELETE + public preview/accept endpoints Single-use token via find_one_and_update; sha256(token) stored in DB, plaintext in email URL - emailer.py — _send() helper; send_invitation_email, send_welcome_email, send_password_reset_email send_completion_email refactored to use _send() - migration_2026-04-28-000002 — creates invitations collection with TTL index (30d audit trail) - routes_auth.py — new MS SSO users provisioned with zero memberships instead of role=PRODUCTION; they land on "no access" page until an admin invites them - main.py — registers invitations_org_router and invitations_router Frontend: - routes/AcceptInvite.tsx — public page at /accept-invite?token=... Four states: new user (name+password), existing user (confirm), MS user, already-member - App.tsx — /accept-invite route outside RequireAuth - types/api.ts — Invitation, InvitationCreate, InvitationPreview, InvitationAcceptRequest/Response - lib/api.ts — listInvitations, createInvitation, revokeInvitation, previewInvitation, acceptInvitation - hooks/useClients.ts — useInvitations, useCreateInvitation, useRevokeInvitation Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 16:52:08 +01:00
Vadym Samoilenko	6f1be645ce	feat(saas): Phase 0+1 — Organization/Membership entities and dev branch Introduces the multi-tenant SaaS foundation alongside the existing client/team/project model (zero-downtime shim period): Backend: - app/models/organization.py — Organization + OrgRole enum (OWNER/ADMIN/MANAGER/MEMBER/VIEWER) - app/models/membership.py — Membership model with MemberDetail for enriched responses - app/services/membership_service.py — upsert/remove/list/has_org_role helpers - app/api/v1/routes_organizations.py — /organizations CRUD + /members sub-resource + /me/memberships - main.py — registers organizations router - migrations: create memberships collection (unique index) + backfill from pm_client_ids/team members Frontend: - types/api.ts — Organization, OrgRole, Membership, OrganizationCreateRequest types; Client marked @deprecated - hooks/useClients.ts — useOrganizations, useOrganization, useOrgMembers, useAddOrgMember, useUpdateOrgMember, useRemoveOrgMember, useMyMemberships - lib/api.ts — listOrganizations, getOrganization, createOrganization, updateOrganization, listOrgMembers, addOrgMember, updateOrgMember, removeOrgMember, getMyMemberships Reads fall back to the clients collection during transition; all writes go to organizations. Existing /clients endpoints and hooks are untouched. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 16:46:24 +01:00
Vadym Samoilenko	269ab09fa6	fix: serialize Client/Team/Project with id not _id + guard undefined client hooks Pydantic v2 + FastAPI serializes Field(alias="_id") as _id in JSON, so client.id was always undefined on the frontend — causing option values to fall back to text content ("3M") and firing /clients/3M/teams 404s. - Remove Field(alias="_id") from Client/Team/Project models; id is now a plain string field populated explicitly in _client_from_doc etc. - API now returns id not _id, matching the TypeScript Client interface - Add clientId !== "undefined" guard to useTeams, usePMs, useProjects Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 16:18:28 +01:00
Vadym Samoilenko	2b721d182b	feat: Client → Team → Project isolation system with Project Manager role Backend: - New UserRole.PROJECT_MANAGER with pm_client_ids[] on User model - New models: Client (slug-based), Team (member_user_ids[]), Project (client-scoped) - Job model gains project_id field - New GET/POST/PATCH/DELETE /clients, /clients/{id}/teams, /clients/{id}/projects, /clients/{id}/pm routes (admin-only client CRUD; PM or admin for teams/projects) - get_accessible_project_ids() helper: staff→all, PM→their clients' projects, CLIENT→projects from teams they belong to (with legacy owner fallback) - list_jobs, get_job, bulk_download, get_vtt_content, delete_job all use new isolation Frontend: - UserRole type gains 'project_manager' - Job, JobCreateRequest gain project_id field - Client, Team, Project, PMUser types added - ApiClient: full client/team/project/PM CRUD methods - useClients hook with all query/mutation hooks - Admin pages: ClientList + ClientDetail (teams, members, projects, PM assignment) - NewJob form: client + project picker (shown when clients exist) - Sidebar: Clients nav item for admin and project_manager roles - Routes: /admin/clients and /admin/clients/:clientId behind RoleGate Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 15:11:13 +01:00
Vadym Samoilenko	ae2c474061	feat: integrate oliver-cost-tracker SDK into video-accessibility Add AI cost tracking to all Gemini and TTS call sites: - config.py: add COST_TRACKER_* env vars (base_url, api_key, source_app, outbox_path, enabled) - dependencies.py: add get_cost_tracker() factory (lru_cache, graceful degradation if SDK not installed) - models/job.py: add cost_tracker_project_id field for cost attribution - services/gemini.py: - add import time, _record_gemini_usage() helper (reads usage_metadata) - add _cost_ctx kwarg to extract_accessibility, extract_accessibility_targeted, transcreate_content, translate_vtt, rewrite_tts_cue - record usage after every generate_content call via asyncio.create_task() - tasks/ingest_and_ai.py: pass _cost_ctx (user_id, job_id, project_id) to extract_accessibility - tasks/translate_and_synthesize.py: build _cost_ctx from job_doc and pass to transcreate_content + translate_vtt calls - tasks/tts_synthesis.py: add user_id + cost_project_id kwargs, add _record_tts_cost() helper (records len(text) chars to cost tracker) - pyproject.toml: document SDK install instructions (comment) - .env.prod.example: add COST_TRACKER_* vars Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 11:30:46 +01:00
Vadym Samoilenko	cf761c4bb6	feat: add linguist role and user management navigation - Add LINGUIST role to UserRole enum (backend + frontend) - Grant linguists access to QC Review, Final Review, review notes, and VTT editing - Add MongoDB migration to update schema validator with linguist role - Add admin seed: vadymsamoilenko@oliver.agency is promoted to admin on startup - Add User Management sidebar link for admin users - Fix Login.tsx role type cast to use UserRole instead of hardcoded union Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 11:46:33 +01:00
Vadym Samoilenko	6f963ff7c4	feat: DCMP compliance, descriptive transcript, new languages, QA bug fixes - Rewrote VTT translation to two-step (text-only → Gemini → apply to original timestamps) preventing caption timing desync - Added polling fallback for all processing states and Safari visibilitychange WebSocket reconnect - Added 11 new TTS languages (cs, da, fi, hu, no, sk, sv, es-419, pt-BR, fr-CA) - Updated caption/AD prompts to DCMP Captioning Key & Description Key standards (line splitting, ♪ music notation, italic tags, caption positioning, ethics guidelines) - Added descriptive transcript generation (WCAG 2.1 §1.2.1) combining captions + AD into plain text - Fixed amix normalize=0 to prevent audio loss in rendered videos - Fixed AD re-timing double-count when source_ms is None - Fixed cue block numbering to be 1-based in VttEditor and Timeline Preview Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 11:50:43 +00:00
Vadym Samoilenko	f4ddcce066	fix: resolve QA-reported bugs — MP3/VTT desync, crashes, notifications, and more BUG-1 & BUG-2 — Wrong audio plays after re-render / MP3 doesn't match text Root cause: audio files were named by index (cue_0.mp3, cue_1.mp3). When a cue was inserted or deleted, all following indices shifted but old MP3 files kept their original names, so re-render would play the wrong audio for the wrong cue. Fix: renamed files to cue_N_CONTENTHASH.mp3 and introduced an ad_cue_manifest stored in the job document that maps each cue index to its correct GCS URI. Re-render now reads from the manifest instead of guessing by filename. Also: editing AD cue text in the VTT editor now automatically queues TTS regeneration for changed cues — no more silent mismatches. BUG-3 — App crash / state desync when uploading VTT or clearing TTS queue Fixed handleVttFileUpload to only update local editor state after the server confirms the save — previously local state was updated first, so a network error left the UI showing content that wasn't actually saved. Fixed handleClearRegenerationQueue to only remove items from local state if the server removal succeeded — previously all items were cleared regardless. BUG-4 — AI generates different audio descriptions every time Added GenerateContentConfig(temperature=0.2, top_p=0.8, top_k=40) to the Gemini API call so output is more consistent across runs. BUG-5 — On-screen text inconsistently described Strengthened the AI prompt rule from a vague suggestion to a mandatory requirement with an explicit format: "Text on screen reads: [exact text]". Applied to both gemini_ingestion.md and gemini_ingestion_targeted.md. BUG-6 — No notification when re-render finishes Added rendering_qc toast notification and a dismissible green banner that appears in QCDetail when re-render transitions to pending_qc. The banner auto-dismisses after 10 seconds. Also increased WebSocket reconnect attempts from 5 to 15 and capped backoff at 60s to prevent falling back to manual refresh. BUG-7 — Timeline preview looks accurate but isn't after edits Added isStale prop to TimelinePreview. The timeline now shows an amber tint and "Preview may be outdated" label whenever there are unsaved pause point changes, pending TTS regenerations, or a new VTT has been uploaded. BUG-8 — ElevenLabs API errors break TTS with no fallback Added try/except fallback chain in _synthesize_single_cue: if the configured provider fails, it automatically retries with google, then gemini. BUG-9 — Concurrent re-render requests cause race conditions Made the PENDING_QC → RENDERING_QC status transition conditional (only succeeds if the job is still in PENDING_QC). Returns HTTP 409 if a re-render is already in progress. The completion transition back to PENDING_QC is also conditional so a cancelled/overridden render doesn't corrupt job state. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-24 13:23:55 +00:00
Vadym Samoilenko	c413fcb747	feat: add SDH (Subtitles for Deaf and Hard of Hearing) caption output SDH captions extend standard VTT with speaker identification labels, sound effects [PHONE RINGS], music notation ♪, and off-screen indicators. - Add sdh_vtt flag to RequestedOutputs model and frontend form - Add sdh_captions_vtt_gcs field to LangOutput model - Inject SDH generation instructions into both Gemini prompts via {SDH_FIELD} and {SDH_GUIDELINES} placeholders when requested - Upload sdh_captions.vtt to GCS in ingest task - Pass SDH through video_native translation (Gemini generates it directly) and traditional translation (translate source SDH VTT via Gemini) - Expose sdh_captions_vtt in downloads endpoint and bulk zip export Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 15:02:18 +00:00
Vadym Samoilenko	2e8a8dc287	feat: add brand context, ethics guidelines, and improved AD prompt rules - Add brand_context field (job model, API, frontend form) so clients can list brand names present in their video; Gemini uses these names instead of generic descriptors (e.g. "Sellotape" not "sticky tape") - Add ethical guidelines section to both Gemini prompts covering person-first language, consistent race/gender description only when plot-relevant, no guessing at unconfirmed identity - Revamp audio description rules: priority ordering (essential → high-priority → time-permitting), pre-teaching placement, no cinematic jargon, succinct style replacing the former "20% longer" instruction - Thread brand_context through full stack: routes → job doc → ingest task → translate task → both Gemini prompt templates Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 14:46:09 +00:00
Vadym Samoilenko	1e177a6d5c	feat: add ElevenLabs voice selection to frontend and backend Add dynamic ElevenLabs voice catalog with provider toggle in the UI, allowing users to browse ElevenLabs voices, configure stability and similarity boost settings, and preview/synthesize with ElevenLabs TTS. Backend: - New elevenlabs_voices.py service with 1-hour cached API fetching - TTS routes support ?provider= query param for voices and options - Preview endpoint routes to ElevenLabs or Gemini based on provider - stability/similarity_boost params flow through TTS synthesis pipeline - TTSPreferences model extended with ElevenLabs-specific fields - Deprecated hardcoded elevenlabs_voices config (now fetched dynamically) Frontend: - Provider toggle (Gemini/ElevenLabs) in VoiceSelector - ElevenLabsSettingsPanel with stability and similarity boost sliders - VoicePreviewButton supports provider-specific preview parameters - API client passes provider param to voices, options, and preview endpoints - New VoiceInfo, ProviderVoicesResponse, ProviderOptionsResponse types Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-03 13:58:56 +00:00
michael	f47820a6a4	fix: make source_ms optional for backward compatibility with existing jobs Existing jobs in the database don't have source_ms field. Making it optional allows the API to load these jobs without validation errors. The re-render task already handles the fallback to original_ms when source_ms is None. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-12 07:04:08 -06:00
michael	a6cd4cde07	fix: store source video coordinates in pause points for correct re-rendering The re-render task was using pause point coordinates from the accessible video timeline (which includes freeze frame durations) instead of the original source video coordinates. This caused pause points to exceed the source video duration and get clamped incorrectly. Changes: - Add source_ms field to PausePointData model to store source video cut point - Update video_renderer.py to populate source_ms when building pause points - Update rerender_accessible_video.py to use source_ms for placement calculations - Apply user adjustments as relative offsets (delta-based adjustment) - Update API responses and TypeScript types to include source_ms - Add backward compatibility fallback for jobs without source_ms Note: Existing jobs need to be re-processed from initial render to populate the new source_ms field. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-11 10:48:41 -06:00
michael	aa6777d2c2	feat: add QC accessible video review and editing capabilities - Reorder workflow: translations now happen BEFORE QC Review step - Add language tabs to switch between translated languages in QC - Add video mode tabs (Original Video / Accessible Video) - Add interactive timeline preview showing video segments and AD cues - Enable pause point adjustment with millisecond precision - Add TTS regeneration queue for selective cue re-synthesis - Add re-render controls with optional Whisper refinement - Persist video segments and TTS MP3s to GCS for editability - Add new RENDERING_QC job status for re-render operations - Create 5 new API endpoints for accessible video editing - Add rerender_accessible_video.py Celery task Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-11 08:32:27 -06:00
michael	e44210ea64	feat: auto-rewrite TTS cues that fail synthesis When TTS synthesis fails after 3 retries, the system now: - Sends problematic cue text to Gemini for TTS-safe rewriting - Updates the VTT file in GCS with rewritten text - Retries TTS synthesis with the new text - Records successful rewrites in job.tts_rewrites field UI changes: - JobDetail shows amber caution box with original/rewritten text - JobsList shows warning icon next to jobs with rewrites - Error display clarifies text shown is "after rewrite attempt" Files changed: - backend/app/models/job.py: Add tts_rewrites field - backend/app/prompts/gemini_tts_rewrite.md: New prompt template - backend/app/services/gemini.py: Add rewrite_tts_cue method - backend/app/tasks/tts_synthesis.py: Add VTT update utilities - backend/app/tasks/translate_and_synthesize.py: Rewrite+retry logic - frontend/src/types/api.ts: Add TTSRewriteItem type - frontend/src/routes/jobs/JobDetail.tsx: Caution display - frontend/src/routes/jobs/JobsList.tsx: Warning indicator 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-05 14:42:50 -06:00
michael	c512bdc184	feat: use AD VTT pause points instead of Gemini video analysis Optimize the accessible video workflow by eliminating the dedicated Gemini video analysis call for pause point estimation. Instead: - Use AD VTT cue start times as initial pause points for Whisper refinement - Add user-selectable accessible video method (pause_insert/overlay) at QC approval - Add bulk approval API endpoint with method selection - Add method selector UI to QCDetail page - Add bulk approval modal to QCList for jobs with accessible video Benefits: - Eliminates expensive Gemini API call with video upload - Faster workflow (~5-15 seconds saved per job) - Cost savings on Gemini video analysis - User control over accessible video integration method Backend changes: - Add accessible_video_method to RequestedOutputs and ApproveSourceRequest - Add POST /jobs/bulk/approve endpoint - Replace Gemini call with _build_placements_from_ad_vtt() helper - Mark analyze_accessible_video_placement() as deprecated Frontend changes: - Add method selector radio buttons to QCDetail - Add bulk approval modal with method selection to QCList - Update API client and React Query hooks 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-03 19:05:45 -06:00
michael	c1c0b876fc	feat: add RENDER_FAILED status with error propagation to GUI - Add RENDER_FAILED job status for when video rendering fails - Fix _check_accessible_video_completion to detect failures and transition job status accordingly (was stuck in RENDERING_VIDEO forever) - Store detailed error info in job.error including failed_languages array - Call completion check after failures to properly update job status - Broadcast WebSocket notification on render failures Frontend: - Add render_failed to JobStatus type and StatusBadge (red styling) - Add tts_failed and render_failed to JobsList STATUS_LABELS - Enhance JobDetail error display with: - Warning icon and prominent styling - Error type and message - Failed languages list with per-language errors - Timestamp of when error occurred - Update ProgressIndicator to handle failed states with red dot 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-01 10:18:27 -06:00
michael	d2d8e32819	feat: add video-native translation mode for multi-language content Add a new "Video Native Mode" translation option that re-processes the video through Gemini for each target language, generating captions and audio descriptions directly from visual context. This produces more natural and culturally appropriate content compared to traditional VTT text translation. Changes: - Add translation_mode field to RequestedOutputs (video_native \| traditional) - Create gemini_ingestion_targeted.md prompt for target language generation - Add extract_accessibility_targeted() method to Gemini service - Modify translate_and_synthesize task to handle both translation modes - Add Translation Mode UI selector in NewJob screen (video_native is default) - Remove transcreation UI (replaced by video_native mode) - Remove Google Translate service (replaced by Gemini translation) - Add LanguageSelector component with searchable dropdown 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-31 13:50:05 -06:00
michael	e8b940aee8	feat: add TTS_FAILED status and robust error handling for TTS synthesis Add comprehensive error handling for TTS synthesis failures: Backend: - Add TTS_FAILED status to JobStatus enum for failed synthesis jobs - Add TTSSynthesisError exception with cue index and context tracking - Improve null-safe error handling in Gemini TTS response parsing - Add _synthesize_cue_with_retry() with exponential backoff (3 attempts) - Enhanced error logging with text preview and model context Frontend: - Add TTS_FAILED status styling (red badge) in StatusBadge component - Add tts_failed to JobStatus TypeScript type 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-30 14:26:07 -06:00
michael	396e4e74e0	feat: add rendering_video status for accessible video processing When jobs with accessible video option enabled enter video rendering phase, the status now transitions to 'rendering_video' so users can see why processing is taking longer. This provides better visibility into the video rendering pipeline. Changes: - Added RENDERING_VIDEO status to JobStatus enum - Updated render_accessible_video task to set new status - Added status display to StatusBadge, jobStatusMessages - Included new status in JobsList Translation filter group 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 06:49:46 -06:00
michael	6effe58dc9	feat: add video review with timestamped notes to Final Review Add a comprehensive video review feature to the Final Review page that allows reviewers to watch videos with caption overlays and add timestamped notes. Backend: - New ReviewNote model for MongoDB with job_id, asset_key, timestamp, content - CRUD API endpoints at /jobs/{job_id}/review-notes - Owner-only edit/delete permissions (admins can bypass) - Database indexes for efficient querying Frontend: - VideoReviewPlayer component with video player and caption overlay - NotesSidebar for viewing/adding notes with auto-highlight when video reaches timestamp - SyncedCaptionList with auto-scroll and click-to-seek - AssetTabs for switching between languages and accessible videos - React Query hooks with 30s polling for collaborative updates Features: - Notes persist to database and are shared across all reviewers - Notes highlight for 5 seconds when video playback reaches their timestamp - Click note to seek video to that position - Pause video to add note at current timestamp - Accessible videos use retimed captions when available 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-26 15:30:00 -06:00
michael	80d3866d32	feat: add accessible video (MP4 with embedded audio descriptions) Add new deliverable type that renders video with audio descriptions embedded. Supports two AI-determined methods: - Direct Overlay: ducks original audio and overlays AD TTS (for minimal dialogue) - Pause-Insert: freeze-frame video, insert AD, re-time subtitles (for significant dialogue) Backend: - Add Pydantic schemas for Gemini analysis response - Add Gemini prompt and analyze_accessible_video_placement() method - Add video_renderer.py service using FFmpeg for both rendering methods - Add vtt_retimer.py service for pause-insert subtitle adjustment - Add render_accessible_video.py Celery task - Modify TTS service to return individual per-cue segments - Update translate_and_synthesize.py to save segments and trigger rendering - Update download endpoint to include accessible video outputs Frontend: - Add accessible_video_mp4 checkbox to NewJob form - Update TypeScript types for new deliverable 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-26 11:06:41 -06:00
michael	865fcdc246	feat: add TTS settings panel with model, speed, and style options - Add model selection (flash vs pro) for quality control - Add speed slider (0.5x - 2.0x) for pacing adjustment - Add style presets (neutral, calm, energetic, professional, warm, documentary) - Add custom style prompt option for advanced customization - New /tts/options endpoint returns available TTS options - Voice preview now tests all settings so users hear exact output - Backward compatible: all new fields have sensible defaults 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-22 15:22:14 -06:00
michael	29643f6683	upgrade TTS to Gemini TTS with voice selection and preview - Add Gemini TTS service with 30 voices and 24 languages - Add TTS API endpoints for voice listing and preview - Add per-language voice selection in job creation form - Add voice override at QC approval stage - Add VoiceSelector and VoicePreviewButton components - Update TTSPreferences model with provider and voice mapping 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-22 14:41:57 -06:00
michael	58a4f1f627	add support for non-English original video uploads - Upload form now has "English / Different language" radio with optional language hint - Gemini auto-detects language and saves outputs to outputs.{detected_language} - QC review dynamically loads/saves VTT for source language - New APPROVED_SOURCE status for non-English videos (APPROVED_ENGLISH kept for backwards compat) - Translation pipeline reads from source language and passes source_language to Google Translate - All existing English jobs continue to work unchanged 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-22 10:33:58 -06:00
michael	aefd559e68	added production user role and made it default for new MSAL users - production can access everything EXCEPT user management - that's only for admin	2025-10-10 10:07:30 -05:00

1 2

52 commits