video-accessibility

Author	SHA1	Message	Date
Vadym Samoilenko	2f4925353a	feat(pause-insert): adaptive buffer, forward-snap, timeline drag + share link fix Backend (Phase A): - A1: Adaptive silence buffer — natural_gap_ms persisted per cue; renderer computes per-cue silence_before/silence_after instead of fixed 500ms; per-cue silence files - A2: Forward-preferred snap — snap_pause_point prefers boundaries up to 4s ahead over boundaries within 1.5s behind, reducing mid-scene cuts - A3: Min-gap validation — pause points with < 200ms gap trigger forward search to the next acceptable gap - natural_gap_ms added to PausePointData model and api.ts type - New config fields: whisper_snap_forward_window, whisper_snap_backward_window, ad_silence_buffer_default, ad_silence_buffer_min_after, ad_min_acceptable_gap - Tests: test_whisper_snap.py (13 tests), test_video_renderer_buffers.py Frontend (Phase B): - B1: Drag pause-point markers — pointer state machine with 3px move threshold, clamp to min/max bounds, click-without-move still opens PausePointEditor - B2: Drag freeze blocks — orange blocks translate with linked pause point - B3: Time tooltip visible during drag, hidden on release - Tests: TimelinePreview.drag.test.tsx (10 tests) Fixes: - Share link pointed to ai-sandbox.oliver.solutions — added app_url to Settings with correct optical-dev.oliver.solutions default; share_url now configurable via APP_URL env var - Removed all ai-sandbox.oliver.solutions references from docker-compose, apache config, docs, and scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 16:09:09 +01:00
Vadym Samoilenko	98764f5065	fix(tts-worker): make concurrency configurable via TTS_WORKER_CONCURRENCY env var Hardcoded --concurrency=8 with 512MB memory limit caused 1162+ OOM restarts. Default is 2; set TTS_WORKER_CONCURRENCY in .env.production to override. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 10:22:06 +01:00
Vadym Samoilenko	5db01248b6	fix: pass USE_CELERY_FALLBACK to containers and show real error in failure UI - docker-compose.yml: add USE_CELERY_FALLBACK env var to api and worker services so cloud_run_dispatch uses Celery on optical-dev - JobDetail.tsx: show actual error message instead of generic "Processing failed at ." when failure step is unknown; also show job.error string when no structured failure object exists Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 17:48:02 +01:00
Vadym Samoilenko	1e5a07b06e	fix(deploy): change API host port to 8012 (8010 also occupied) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 22:02:44 +01:00
Vadym Samoilenko	582f8ad2e8	fix(deploy): change API host port 8003→8010, move image to video-accessibility repo Port 8003 is occupied by infra-api-1 on optical-dev server. Artifact Registry repo renamed from nexus to video-accessibility. cloudbuild.yaml defaults _TAG to 'latest' for manual runs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 22:02:14 +01:00
Vadym Samoilenko	ea21cace96	feat: replace SDK with direct HTTP integration to centralized cost tracker - New services/cost_tracker.py: sync httpx preflight()/record() + async wrappers; BudgetExceeded exception; no-op when COST_TRACKER_BASE_URL is empty - Preflight budget check added before ingestion (Gemini), per-language translation (video-native + traditional), and per-language TTS dispatch - _record_gemini_usage and _record_tts_cost now call cost_tracker directly; removes broken asyncio.get_event_loop() hack from sync Celery worker - Fix: _cost_ctx now threaded into extract_accessibility_targeted (video-native path) - Fix: user_id/cost_project_id now propagated through dispatch_language_tts → synthesize_cue_task.s() and the rerender_accessible_video.py re-render path - Remove oliver-cost-tracker SDK dependency (was commented-out/never installed) - Drop cost_tracker_outbox_path setting and get_cost_tracker() factory - Update COST_TRACKER_BASE_URL default to optical-dev.oliver.solutions in .env.prod.example, docker-compose.yml, and all Cloud Run service yamls - Cloud Run yamls use Secret Manager ref (cost-tracker-api-key) for the API key Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 13:36:15 +01:00
michael	9580979ac8	feat: add environment-based worker concurrency for Cloud Run mode Allow configuring Celery worker concurrency via environment variables to take advantage of Cloud Run autoscaling: - Add WORKER_CONCURRENCY, WHISPER_WORKER_CONCURRENCY, FFMPEG_WORKER_CONCURRENCY settings to config.py with recommended values documented - Update Dockerfile to use ${WORKER_CONCURRENCY} and ${WHISPER_WORKER_CONCURRENCY} environment variables instead of hardcoded values - Update docker-compose.yml to pass concurrency env vars to worker commands - Add WHISPER_SERVICE_URL and FFMPEG_SERVICE_URL to relevant workers Recommended settings: Local mode: WHISPER=1, FFMPEG=1 (CPU/RAM constrained) Cloud Run mode: WHISPER=10, FFMPEG=20 (match autoscaling limits) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-02 10:27:07 -06:00
michael	6689778be7	feat: add dedicated TTS worker with parallel per-cue synthesis Break out TTS synthesis into a dedicated Celery worker (tts queue) with concurrency=8 for parallel processing. Each AD cue is now synthesized as a separate task, enabling up to 8 cues to be processed simultaneously. Key changes: - Add tts_synthesis.py with synthesize_cue_task for per-cue synthesis - Refactor translate_and_synthesize.py to dispatch cue tasks in parallel - Add tts-worker service to docker-compose.yml (concurrency=8) - Add Cloud Run service config for production deployment Benefits: - Parallel synthesis even for single jobs (e.g., 50 cues → 8 concurrent) - Natural rate limiting across multiple concurrent jobs - Fault tolerance with per-cue retries and GCS persistence 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-30 14:23:11 -06:00
michael	e8fde7962f	chore: increase accessible-video-worker concurrency from 4 to 8 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-28 23:20:58 -06:00
michael	54638d1065	feat: switch Whisper model from large-v3 to medium Medium model is faster and uses less memory (~1.5GB vs ~3GB) while still providing good multilingual transcription quality. Updated in: - config.py - docker-compose.yml - whisper-worker-service.yaml - cloudbuild.yaml - Dockerfile (pre-download) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-28 22:35:47 -06:00
michael	6baf6cc254	fix: set WHISPER_MODEL env var default to large-v3 Environment variables were overriding config.py with 'base' model. Updated defaults in: - docker-compose.yml - whisper-worker-service.yaml 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-28 21:56:41 -06:00
michael	1395807b23	feat: increase whisper worker memory to 8GB for large-v3 model - docker-compose.yml: 4G -> 8G limit, 2G -> 4G reservation - whisper-worker-service.yaml: 4Gi -> 8Gi limit, 2Gi -> 4Gi request - cloudbuild.yaml: 4Gi -> 8Gi, WHISPER_MODEL=base -> large-v3 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-28 21:23:29 -06:00
michael	05bde8326d	feat: add Whisper-based pause point refinement for audio descriptions Implements word-level speech analysis using faster-whisper to refine AD pause points. Gemini's timestamps are snapped to natural speech gaps (sentence/phrase boundaries) to prevent pauses mid-word. Key changes: - Add WhisperService for transcription and gap detection - Add dedicated Celery task routed to 'whisper' queue - Integrate refinement into render_accessible_video task - Cache Whisper transcripts in MongoDB for reuse across languages - Add dedicated whisper-worker with concurrency=1 to prevent OOM Configuration: - Uses faster-whisper 'base' model (multilingual, ~145MB) - 5-second search window after Gemini's recommended point - Falls back to original timestamp if no gap found Infrastructure: - New Docker stage: whisper-worker - New Cloud Run service: accessible-video-whisper-worker - Updated docker-compose.yml with whisper-worker service 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 08:27:48 -06:00
michael	901083b426	fix: ensure temp files use shared volume with correct permissions - Modified render_accessible_video.py to explicitly pass TMPDIR to tempfile.TemporaryDirectory() so files are created in shared volume - Updated docker-compose.yml to run containers as root initially, chown /shared-tmp to app:app, then switch to app user for celery - This ensures both worker containers can access the same temp files 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 06:15:45 -06:00
michael	c3bc0bd523	fix: add shared volume between worker containers for temp files The main worker and ffmpeg-worker run in separate containers with isolated filesystems. Add a shared volume mounted at /shared-tmp and set TMPDIR so Python's tempfile module uses it. This allows ffmpeg-worker to access video files created by the main worker. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 06:05:06 -06:00
michael	bf1c321088	feat: add dedicated ffmpeg queue to prevent server overload Add a dedicated Celery queue (ffmpeg) with concurrency=1 to serialize all FFmpeg operations. This prevents CPU spikes when multiple render tasks run in parallel with multiple languages. Changes: - Add ffmpeg_operations.py with run_ffmpeg_command and run_ffprobe_command tasks - Update VideoRendererService to dispatch ffmpeg commands via the queue - Add ffmpeg-worker service to docker-compose with --concurrency=1 - Configure main worker to exclude the ffmpeg queue 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-26 17:56:23 -06:00
michael	22e1744f25	suppress verbose MongoDB INFO logs in docker - Add all logging components with verbosity 0 in mongod.conf - Add WiredTiger verbose=[] to suppress checkpoint messages - Add --quiet flag to mongod command - Change healthcheck from mongosh to TCP port check to avoid connection metadata spam every 30s - Increase healthcheck interval from 30s to 60s 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-22 11:26:44 -06:00
michael	d4417acd35	env variables for MSAL added to docker compose	2025-10-10 10:32:54 -05:00
michael	2f90da34c9	configured tts key in docker-compose	2025-10-08 18:44:28 -05:00
michael	99b49fbc1e	added config for mongodb to reduce log verbosity	2025-10-08 18:34:06 -05:00
michael	7921079446	reduced verbosity of mongodb logs	2025-10-08 18:30:08 -05:00
michael	38e6f37450	fixed overly aggressive security algorithm	2025-10-08 17:49:20 -05:00
michael	aafc4078ae	bug fixes for frontend when trying to npm run build	2025-10-08 16:47:14 -05:00
michael	1a1ed3048d	wrote docker files and deployment instructions	2025-10-08 16:00:12 -05:00
michael	de61d0bd39	removed mongodb change stream monitoring, added global websockets monitoring for notifications, broke symmetry between toasts and persistent notifications (and refined which notifications get sent and how)	2025-08-25 15:48:18 -05:00
michael	af2562096a	initial commit	2025-08-24 16:28:33 -05:00

26 commits