video-accessibility

Author	SHA1	Message	Date
Vadym Samoilenko	77a9d3b255	fix(docker): add ffmpeg to base image — fixes pydub AudioSegment in worker ffmpeg was missing from the base image, causing all pydub operations (AudioSegment.from_file, export) to fail in worker and tts-worker containers. Moved ffmpeg install from whisper-worker stage to the shared base stage so all container variants (api, worker, tts-worker, whisper-worker) have it. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 19:12:57 +01:00
Vadym Samoilenko	89fa87ba8a	refactor(docker): remove ffmpeg from api/worker images — runs on Cloud Run Jobs Heavy pipeline tasks (ingest, translate, render, tts) now dispatch to va-worker Cloud Run Job which has its own Dockerfile.cloudrun with ffmpeg. API and lightweight Celery worker (notify/embed) don't need it. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 22:08:25 +01:00
Vadym Samoilenko	85e1e852ed	fix: add --no-root to poetry install in Dockerfiles (Poetry 2.x)	2026-04-29 14:35:28 +01:00
Vadym Samoilenko	fd154e7799	fix: upgrade poetry in Dockerfile from 1.8.2 to 2.1.4 poetry.lock was generated with 2.1.4 — using 1.8.2 caused incompatible lock file error and failed Docker builds. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 14:21:42 +01:00
Vadym Samoilenko	743a8597c2	fix: auto-sync poetry.lock during Docker build Prevents build failures when pyproject.toml changes without a lock regen. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 14:21:01 +01:00
michael	9580979ac8	feat: add environment-based worker concurrency for Cloud Run mode Allow configuring Celery worker concurrency via environment variables to take advantage of Cloud Run autoscaling: - Add WORKER_CONCURRENCY, WHISPER_WORKER_CONCURRENCY, FFMPEG_WORKER_CONCURRENCY settings to config.py with recommended values documented - Update Dockerfile to use ${WORKER_CONCURRENCY} and ${WHISPER_WORKER_CONCURRENCY} environment variables instead of hardcoded values - Update docker-compose.yml to pass concurrency env vars to worker commands - Add WHISPER_SERVICE_URL and FFMPEG_SERVICE_URL to relevant workers Recommended settings: Local mode: WHISPER=1, FFMPEG=1 (CPU/RAM constrained) Cloud Run mode: WHISPER=10, FFMPEG=20 (match autoscaling limits) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-02 10:27:07 -06:00
michael	e8fde7962f	chore: increase accessible-video-worker concurrency from 4 to 8 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-28 23:20:58 -06:00
michael	54638d1065	feat: switch Whisper model from large-v3 to medium Medium model is faster and uses less memory (~1.5GB vs ~3GB) while still providing good multilingual transcription quality. Updated in: - config.py - docker-compose.yml - whisper-worker-service.yaml - cloudbuild.yaml - Dockerfile (pre-download) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-28 22:35:47 -06:00
michael	4f82fad5dd	feat: pre-download Whisper large-v3 model during Docker build Downloads the model (~3GB) at build time to avoid cold start delays. Also updated comment to reflect large-v3 memory usage (~4-6GB). 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-28 21:25:44 -06:00
michael	05bde8326d	feat: add Whisper-based pause point refinement for audio descriptions Implements word-level speech analysis using faster-whisper to refine AD pause points. Gemini's timestamps are snapped to natural speech gaps (sentence/phrase boundaries) to prevent pauses mid-word. Key changes: - Add WhisperService for transcription and gap detection - Add dedicated Celery task routed to 'whisper' queue - Integrate refinement into render_accessible_video task - Cache Whisper transcripts in MongoDB for reuse across languages - Add dedicated whisper-worker with concurrency=1 to prevent OOM Configuration: - Uses faster-whisper 'base' model (multilingual, ~145MB) - 5-second search window after Gemini's recommended point - Falls back to original timestamp if no gap found Infrastructure: - New Docker stage: whisper-worker - New Cloud Run service: accessible-video-whisper-worker - Updated docker-compose.yml with whisper-worker service 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-27 08:27:48 -06:00
michael	6acb452cfa	fix: add render queue to Celery worker The accessible video render task was being dispatched to the 'render' queue but no worker was listening to it. Added 'render' to: - Dockerfile CMD args for worker queue list - celery_worker.py import and log message 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-26 11:39:34 -06:00
michael	093b55c473	fix: add ffmpeg to API container for TTS audio conversion The Gemini TTS service uses pydub which requires ffmpeg to convert audio formats. Previously only the Worker container had ffmpeg. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-22 14:55:14 -06:00
michael	06f958c974	fixed docker file for dependency installs	2025-10-08 17:06:41 -05:00
michael	1a1ed3048d	wrote docker files and deployment instructions	2025-10-08 16:00:12 -05:00
michael	af2562096a	initial commit	2025-08-24 16:28:33 -05:00

15 commits