video-accessibility

History

michael 05bde8326d feat: add Whisper-based pause point refinement for audio descriptions Implements word-level speech analysis using faster-whisper to refine AD pause points. Gemini's timestamps are snapped to natural speech gaps (sentence/phrase boundaries) to prevent pauses mid-word. Key changes: - Add WhisperService for transcription and gap detection - Add dedicated Celery task routed to 'whisper' queue - Integrate refinement into render_accessible_video task - Cache Whisper transcripts in MongoDB for reuse across languages - Add dedicated whisper-worker with concurrency=1 to prevent OOM Configuration: - Uses faster-whisper 'base' model (multilingual, ~145MB) - 5-second search window after Gemini's recommended point - Falls back to original timestamp if no gap found Infrastructure: - New Docker stage: whisper-worker - New Cloud Run service: accessible-video-whisper-worker - Updated docker-compose.yml with whisper-worker service 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>		2025-12-27 08:27:48 -06:00
..
__pycache__	initial commit	2025-08-24 16:28:33 -05:00
app	feat: add Whisper-based pause point refinement for audio descriptions	2025-12-27 08:27:48 -06:00
tests	initial commit	2025-08-24 16:28:33 -05:00
.dockerignore	fixed dockerignore	2025-10-08 17:17:39 -05:00
.dockerignore.old	wrote docker files and deployment instructions	2025-10-08 16:00:12 -05:00
.env	initial commit	2025-08-24 16:28:33 -05:00
.env.example	initial commit	2025-08-24 16:28:33 -05:00
.gitignore	fixed front end build errors	2025-10-10 10:26:57 -05:00
celery_worker.py	feat: add dedicated ffmpeg queue to prevent server overload	2025-12-26 17:56:23 -06:00
cors-config.json	initial commit	2025-08-24 16:28:33 -05:00
create_test_users.py	added production user role and made it default for new MSAL users - production can access everything EXCEPT user management - that's only for admin	2025-10-10 10:07:30 -05:00
debug_login.py	initial commit	2025-08-24 16:28:33 -05:00
Dockerfile	feat: add Whisper-based pause point refinement for audio descriptions	2025-12-27 08:27:48 -06:00
Dockerfile.old	wrote docker files and deployment instructions	2025-10-08 16:00:12 -05:00
gunicorn_conf.py	initial commit	2025-08-24 16:28:33 -05:00
migrate.py	initial commit	2025-08-24 16:28:33 -05:00
optical-414516-80e2475f6412.json	initial commit	2025-08-24 16:28:33 -05:00
poetry.lock	upgrade to Gemini 3 Pro preview model	2025-12-22 14:02:02 -06:00
pyproject.toml	feat: add Whisper-based pause point refinement for audio descriptions	2025-12-27 08:27:48 -06:00
setup_secrets.py	initial commit	2025-08-24 16:28:33 -05:00
simple_login_test.py	initial commit	2025-08-24 16:28:33 -05:00
test_auth.py	initial commit	2025-08-24 16:28:33 -05:00
test_db.py	initial commit	2025-08-24 16:28:33 -05:00
test_endpoint.py	initial commit	2025-08-24 16:28:33 -05:00
test_mp3_serving.py	initial commit	2025-08-24 16:28:33 -05:00