video-accessibility/backend/app/services
Vadym Samoilenko fddf803b74 feat(translation): enforce EN-first pipeline with cue-preserving translations
All translations now derive strictly from the approved English master VTT,
eliminating the cue-count and timestamp drift reported by linguists
(e.g. PL AD = 11 cues vs EN AD = 17 cues).

Key changes:
- Remove video_native translation mode entirely; all languages go through
  translate_vtt() which guarantees 1:1 cue alignment with EN master
- Transcreation languages now use translate_vtt(style="transcreate") —
  same cue-preserving contract, culturally-adapted instructions
- Post-translation cue alignment validator added (VTTEditor.assert_cue_alignment)
- After ingestion, job moves to PENDING_QC (EN-only) instead of TRANSLATING;
  translation pipeline dispatches automatically when EN QC is approved
- New POST /jobs/{id}/retranslate-language endpoint for PM/admin to fix
  legacy video_native jobs on demand
- Frontend: origin badge (EN-aligned / transcreated / video-native warning),
  EN-first gate banner on target-language cards, Re-translate from EN button
  with confirm modal, removed translation mode selector from NewJob

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-06 12:11:35 +01:00
..
audit_logger.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
cloud_run_dispatch.py feat(translation): enforce EN-first pipeline with cue-preserving translations 2026-05-06 12:11:35 +01:00
cost_tracker.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
descriptive_transcript.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
elevenlabs_voices.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
emailer.py test: fix all unit tests — 168 passing, 0 failures 2026-04-30 14:02:04 +01:00
embedding_service.py fix: switch embedding model to gemini-embedding-001 2026-04-29 16:02:12 +01:00
ffmpeg_http_service.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
gcs.py test: fix all unit tests — 168 passing, 0 failures 2026-04-30 14:02:04 +01:00
gemini.py feat(translation): enforce EN-first pipeline with cue-preserving translations 2026-05-06 12:11:35 +01:00
gemini_tts.py fix(tts): convert lameenc bytearray to bytes before GCS upload 2026-04-30 19:35:28 +01:00
glossary_service.py fix(glossary+jobs): add debug logging for glossary failures and fix AllJobs filter stale state 2026-04-30 21:25:41 +01:00
language_qc.py feat(translation): enforce EN-first pipeline with cue-preserving translations 2026-05-06 12:11:35 +01:00
membership_service.py fix(membership): correct \$unwind option preserveNullAndEmpty → preserveNullAndEmptyArrays 2026-05-01 18:58:07 +01:00
microsoft_auth.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
secrets_manager.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
tts.py test: fix all unit tests — 168 passing, 0 failures 2026-04-30 14:02:04 +01:00
validation.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
video_renderer.py fix(renderer): skip ffprobe Phase 3.5 — use pre-computed freeze duration 2026-05-01 17:29:33 +01:00
vtt_retimer.py fix: use actual freeze segment durations for VTT subtitle retiming 2026-01-05 15:52:57 -06:00
vtt_versioning.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
websocket.py fix(authz): MT-3/6/7/8 org isolation + P1 English-first QC enforcement 2026-05-01 11:43:10 +01:00
websocket_publisher.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
whisper_http_service.py chore: push all session changes — backend hardening, tests, apache config, deploy scripts 2026-04-30 15:52:14 +01:00
whisper_service.py feat(pause-insert): adaptive buffer, forward-snap, timeline drag + share link fix 2026-05-01 16:09:09 +01:00
zip_download.py feat: DCMP compliance, descriptive transcript, new languages, QA bug fixes 2026-03-27 11:50:43 +00:00