video-accessibility

Author	SHA1	Message	Date
Vadym Samoilenko	bbd324e3c7	feat: add Project Manager role + client/team assignment panel in admin user editor - Add project_manager to all role dropdowns (UserList filter, create modal, UserDetail edit form) - Add indigo badge color for project_manager in user list table - Expose pm_client_ids in UserResponse schema and all admin user endpoints - Add pm_client_ids to frontend User type - Add UserAssignmentsPanel to UserDetail sidebar: PM users see client toggle list; other roles see client → team membership picker - Add flexible hooks (useTeamsForClient, useAssignPMAny, useRemovePMAny, useAddTeamMemberAny, useRemoveTeamMemberAny) - Fix useClient guard against literal "undefined" string causing 404 requests Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 15:58:55 +01:00
Vadym Samoilenko	26bfedd7c7	feat: add cost_tracker_project_id assignment UI to QC and Final Review - PATCH /jobs/{job_id} endpoint for updating title and cost_tracker_project_id - cost_tracker_project_id exposed on JobResponse (GET /jobs/{id}) - Inline project ID field in QCDetail and FinalDetail — saved via PATCH - "AI Cost Dashboard" link in UserList header - cost_tracker_project_id added to Job type and JobUpdateRequest schema Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 11:40:13 +01:00
michael	0c3102b77f	feat: add Return to QC action for jobs in resting statuses Allow production/admin users to move jobs back to pending_qc from completed, pending_final_review, rejected, qc_feedback, tts_failed, render_failed, approved_english, and approved_source statuses. Includes single-job endpoint, bulk endpoint, JobDetail inline form with required notes, bulk action in JobsList with confirmation modal, and a Review Notes card on the job overview tab. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-14 13:18:02 -06:00
michael	e371dc401a	feat: add save button to voice settings panel for TTS regeneration Add ability to save voice settings changes in QC Review screen without needing to approve the job. When saved, all TTS segments are regenerated across all languages with the new voice settings. Changes: - Add PUT /jobs/{id}/tts-preferences endpoint to update TTS preferences - Add UpdateTTSPreferencesRequest schema - Add updateTTSPreferences API method and useUpdateTTSPreferences hook - Add Save Voice Settings button with change detection to QCDetail Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-12 09:05:56 -06:00
michael	f47820a6a4	fix: make source_ms optional for backward compatibility with existing jobs Existing jobs in the database don't have source_ms field. Making it optional allows the API to load these jobs without validation errors. The re-render task already handles the fallback to original_ms when source_ms is None. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-12 07:04:08 -06:00
michael	a6cd4cde07	fix: store source video coordinates in pause points for correct re-rendering The re-render task was using pause point coordinates from the accessible video timeline (which includes freeze frame durations) instead of the original source video coordinates. This caused pause points to exceed the source video duration and get clamped incorrectly. Changes: - Add source_ms field to PausePointData model to store source video cut point - Update video_renderer.py to populate source_ms when building pause points - Update rerender_accessible_video.py to use source_ms for placement calculations - Apply user adjustments as relative offsets (delta-based adjustment) - Update API responses and TypeScript types to include source_ms - Add backward compatibility fallback for jobs without source_ms Note: Existing jobs need to be re-processed from initial render to populate the new source_ms field. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-11 10:48:41 -06:00
michael	aa6777d2c2	feat: add QC accessible video review and editing capabilities - Reorder workflow: translations now happen BEFORE QC Review step - Add language tabs to switch between translated languages in QC - Add video mode tabs (Original Video / Accessible Video) - Add interactive timeline preview showing video segments and AD cues - Enable pause point adjustment with millisecond precision - Add TTS regeneration queue for selective cue re-synthesis - Add re-render controls with optional Whisper refinement - Persist video segments and TTS MP3s to GCS for editability - Add new RENDERING_QC job status for re-render operations - Create 5 new API endpoints for accessible video editing - Add rerender_accessible_video.py Celery task Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-11 08:32:27 -06:00
michael	83e4752327	feat: add server-side zip download for bulk job downloads Replace sequential browser-based bulk download with server-side zip generation. When users select "Download All Files" from bulk actions, the system now creates a single organized .zip file containing all job assets. Changes: - Add POST /jobs/bulk/download endpoint that streams zip to client - Add BulkDownloadRequest schema for the new endpoint - Create zip_download.py service with streaming zip generation - Update frontend to call new endpoint and download single zip file - Organize files in zip by job title and language subdirectories Zip structure: accessible_video_YYYYMMDD_HHMMSS.zip └── {job_title}/ ├── source.mp4 └── {lang}/ ├── captions.vtt ├── ad.vtt ├── ad.mp3 ├── accessible_video.mp4 └── accessible_captions.vtt 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-04 15:57:57 -06:00
michael	c512bdc184	feat: use AD VTT pause points instead of Gemini video analysis Optimize the accessible video workflow by eliminating the dedicated Gemini video analysis call for pause point estimation. Instead: - Use AD VTT cue start times as initial pause points for Whisper refinement - Add user-selectable accessible video method (pause_insert/overlay) at QC approval - Add bulk approval API endpoint with method selection - Add method selector UI to QCDetail page - Add bulk approval modal to QCList for jobs with accessible video Benefits: - Eliminates expensive Gemini API call with video upload - Faster workflow (~5-15 seconds saved per job) - Cost savings on Gemini video analysis - User control over accessible video integration method Backend changes: - Add accessible_video_method to RequestedOutputs and ApproveSourceRequest - Add POST /jobs/bulk/approve endpoint - Replace Gemini call with _build_placements_from_ad_vtt() helper - Mark analyze_accessible_video_placement() as deprecated Frontend changes: - Add method selector radio buttons to QCDetail - Add bulk approval modal with method selection to QCList - Update API client and React Query hooks 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-03 19:05:45 -06:00
michael	3689653135	refactor: remove manual language selection, always auto-detect Remove the "Original Video Language" control from job upload form. All videos now use AI auto-detection for source language, simplifying the UX and eliminating potential for incorrect manual selection. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-03 18:42:45 -06:00
michael	12cae0919a	feat: implement full-gap-overlap algorithm for AD pause insertion Changes pause point calculation to use the entire gap between sentences as a buffer on BOTH sides of the audio description: - pause_point: Just BEFORE next sentence starts (gap.end - 50ms) - resume_from: Just AFTER previous sentence ends (gap.start + 50ms) This means a small portion of video plays twice (the gap duration), but creates a much more natural listening experience by maximizing the breathing room around audio descriptions. Changes: - whisper_service.py: snap_pause_point() now returns (pause_point, resume_from) - video_renderer.py: Uses resume_from for current_time after freeze segment - vtt_retimer.py: Calculates effective_offset including overlap duration - accessible_video.py: Added resume_from field to ADPlacementCue schema 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-28 22:51:49 -06:00
michael	8806289eca	feat: improve pause point precision with sentence boundary detection - Update Gemini prompt to require transcription with precise timestamps - Add sentence_boundaries output field for validation - Add pause_point_rationale field to explain each pause point choice - Emphasize terminal punctuation only (., ?, !) - never commas - Expand Whisper search window from ±5s to ±10s - Increase post-pause buffer from 50ms to 175ms 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-28 14:41:12 -06:00
michael	05bde8326d	feat: add Whisper-based pause point refinement for audio descriptions Implements word-level speech analysis using faster-whisper to refine AD pause points. Gemini's timestamps are snapped to natural speech gaps (sentence/phrase boundaries) to prevent pauses mid-word. Key changes: - Add WhisperService for transcription and gap detection - Add dedicated Celery task routed to 'whisper' queue - Integrate refinement into render_accessible_video task - Cache Whisper transcripts in MongoDB for reuse across languages - Add dedicated whisper-worker with concurrency=1 to prevent OOM Configuration: - Uses faster-whisper 'base' model (multilingual, ~145MB) - 5-second search window after Gemini's recommended point - Falls back to original timestamp if no gap found Infrastructure: - New Docker stage: whisper-worker - New Cloud Run service: accessible-video-whisper-worker - Updated docker-compose.yml with whisper-worker service 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 08:27:48 -06:00
michael	150a3e27bd	fix: include client_id in JobResponse for user filter The Created By filter dropdown was empty because client_id was not being returned by the API. Added client_id to JobResponse schema and included it in the list_jobs response. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 07:28:05 -06:00
michael	46b0f2c092	feat: add filtering, sorting, and table view to All Jobs tab - Add created_by_name field to JobResponse schema and API - Batch-fetch user names in list_jobs endpoint for efficiency - Convert JobsList from card layout to sortable data table - Add search box (job name, filename, created by user) - Add user filter dropdown (populated from current jobs) - Add status filter dropdown (individual statuses from current jobs) - Add date range filter (All Time, Last 7 Days, Last 30 Days) - Add sortable columns: Job Name, Created By, Date Created, Status - Fetch all jobs for full client-side filtering capability - Add responsive horizontal scroll for mobile 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 07:16:21 -06:00
michael	3cdea9dfec	fix: video review caption sync and event listener issues - Fix video event listeners not re-attaching when video element remounts (add activeTab?.videoUrl to useEffect dependency array) - Add retimed_captions_vtt to VTT API response for accessible videos - Use retimed captions for accessible video tab in VideoReviewPlayer 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-26 16:23:48 -06:00
michael	6effe58dc9	feat: add video review with timestamped notes to Final Review Add a comprehensive video review feature to the Final Review page that allows reviewers to watch videos with caption overlays and add timestamped notes. Backend: - New ReviewNote model for MongoDB with job_id, asset_key, timestamp, content - CRUD API endpoints at /jobs/{job_id}/review-notes - Owner-only edit/delete permissions (admins can bypass) - Database indexes for efficient querying Frontend: - VideoReviewPlayer component with video player and caption overlay - NotesSidebar for viewing/adding notes with auto-highlight when video reaches timestamp - SyncedCaptionList with auto-scroll and click-to-seek - AssetTabs for switching between languages and accessible videos - React Query hooks with 30s polling for collaborative updates Features: - Notes persist to database and are shared across all reviewers - Notes highlight for 5 seconds when video playback reaches their timestamp - Click note to seek video to that position - Pause video to add note at current timestamp - Accessible videos use retimed captions when available 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-26 15:30:00 -06:00
michael	80d3866d32	feat: add accessible video (MP4 with embedded audio descriptions) Add new deliverable type that renders video with audio descriptions embedded. Supports two AI-determined methods: - Direct Overlay: ducks original audio and overlays AD TTS (for minimal dialogue) - Pause-Insert: freeze-frame video, insert AD, re-time subtitles (for significant dialogue) Backend: - Add Pydantic schemas for Gemini analysis response - Add Gemini prompt and analyze_accessible_video_placement() method - Add video_renderer.py service using FFmpeg for both rendering methods - Add vtt_retimer.py service for pause-insert subtitle adjustment - Add render_accessible_video.py Celery task - Modify TTS service to return individual per-cue segments - Update translate_and_synthesize.py to save segments and trigger rendering - Update download endpoint to include accessible video outputs Frontend: - Add accessible_video_mp4 checkbox to NewJob form - Update TypeScript types for new deliverable 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-26 11:06:41 -06:00
michael	29643f6683	upgrade TTS to Gemini TTS with voice selection and preview - Add Gemini TTS service with 30 voices and 24 languages - Add TTS API endpoints for voice listing and preview - Add per-language voice selection in job creation form - Add voice override at QC approval stage - Add VoiceSelector and VoicePreviewButton components - Update TTSPreferences model with provider and voice mapping 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-22 14:41:57 -06:00
michael	58a4f1f627	add support for non-English original video uploads - Upload form now has "English / Different language" radio with optional language hint - Gemini auto-detects language and saves outputs to outputs.{detected_language} - QC review dynamically loads/saves VTT for source language - New APPROVED_SOURCE status for non-English videos (APPROVED_ENGLISH kept for backwards compat) - Translation pipeline reads from source language and passes source_language to Google Translate - All existing English jobs continue to work unchanged 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-22 10:33:58 -06:00
michael	665b49c3f1	added MSAL microsoft authentication	2025-10-10 09:19:39 -05:00
michael	0910ade371	more fixes for refresh token - this time maintaining the username and role properly across refresh	2025-10-08 23:09:29 -05:00
michael	af2562096a	initial commit	2025-08-24 16:28:33 -05:00

23 commits