amazon-transcreation

Author	SHA1	Message	Date
DJP	bb8ed2a004	Round 2.7: three broken promises — empty TM, supplementary files, new-TM casing Bug 1: Empty tm_channels was silently re-defaulted to [campaign channel] in both agent_single.py and job_tasks.py via `or [channel]`. Python's `or` treats [] as falsy, so the frontend's empty-list intent was lost. Fixed by replacing `or` with an explicit `is not None` check at both sites. Empty list now means "load no TMs"; None still falls back. Bug 2: Supplementary files dropped by Agent1Validator. The validator built FileManifest(...) with explicit kwargs but forgot supplementary_files, so the raw field from _resolve_file_manifest never reached agent_single.run(). Files were uploaded to disk but never inlined into the LLM context. Fixed by adding supplementary_files=raw.get("supplementary_files", []) to the validator's FileManifest construction. Bug 3: New TM channels lowercased in StepReview.tsx, breaking case-sensitive file lookup. On Linux, "flat_primecbmt_nl-be.json" ≠ "flat_PrimeCBMT_nl-be.json", so the file was silently skipped and zero TM entries loaded. Legacy channels worked only because the hardcoded CHANNEL_FILE_MAP has lowercase keys mapping to canonically-cased filenames; auto-discovered channels (PrimeCBM, PrimeCBMT, etc.) had no such safety net. Two-part fix: 3a. StepReview.tsx no longer lowercases tm_channels — preserves case end-to-end from registry → frontend → backend → disk. 3b. _resolve_all_tm_paths builds a case-insensitive index of the locale's TM directory once per call and resolves filenames against it. Forgives any historical case-drift between registry and disk. Verified end-to-end with a standalone test script run inside the backend container: 8/8 assertions pass covering empty tm_channels, supplementary file passthrough, exact-case lookups, lowercase fallback, missing channels, legacy MASS in both cases, and empty tm_channels yielding no TM paths. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-11 10:57:21 -04:00
DJP	d3f6a57386	Round 2.5 feedback: TM replacements take effect, supplementary files reach LLM, larger briefs fit, free-text channel uploads TM upload-replacement bug (critical): - Uploads were writing to /storage/clients/<uuid>/tm/... but the pipeline reads from /storage/amazon/tm/... — replacements were silently ignored - upload_tm_file now writes to the canonical pipeline path /storage/amazon/tm/<locale>/flat_<channel>_<lc>.json (overwrites in place) - Filename casing is preserved when an existing file is being replaced (the on-disk seeded files use mixed casing: flat_MASS, flat_value, flat_PrimeSpeed); falls back to CHANNEL_FILE_MAP, then user-typed case - Registry upsert by (client_id, locale_code, channel): replaces row in place rather than inserting duplicates - Verified: replacement file at canonical path, registry COUNT=1, no dupes Supplementary files now reach the LLM (critical): - New supplementary_files field on FileManifest - _resolve_file_manifest scans /storage/jobs/<job_id>/supplementary/ and populates the manifest, with per-locale gating by filename prefix (e.g. de-DE_glossary.txt only goes to de-DE; global_brief.txt goes to all) - _format_supplementary_for_prompt reads each file (.txt/.md/.json/.csv/.tsv /.docx) and inlines its text into the LLM user message under a "## SUPPLEMENTARY MATERIAL" header, capped at 40k chars per file - .docx files are extracted via inline zipfile read (no new dependency) New job wizard: - Per-supplementary-file locale dropdown ("Global" or one of 12 locales) - Filename gets prefixed with the locale on upload (de-DE_brief.docx) Admin TM upload: - Channel field is now a free-text input with autocomplete suggestions (datalist of known channels) — lets users add brand-new channels like PrimeCBM that didn't exist before Pipeline scaling: - Bumped dynamic max_tokens tiers: 80+ lines now gets 64k output budget (was 32k); 132-line briefs no longer truncate. Sonnet 4.6 caps at 64k - Added stop_reason logging — "max_tokens" stop now shows up in logs loud and clear rather than silently truncating Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 14:28:20 -04:00
DJP	9825b0497c	Round 2 feedback: parser fix, dynamic max_tokens, polling, TM auto-discovery, reviewer comments in export A1 Export columns shifted (critical): - V25 LLM occasionally emits 12/13-col tables with Copy Type/Char Limit prefix - Parser now anchors on "Option 1" header position; robust to any prefix shift - Verified with 23/23 unit tests covering 11/12/13-col variants - Source-line block in prompt no longer uses pipe separators (defence in depth) A2 Linguistic summary fallback: - Drop the metadata key/value table fallback on Tab 2 - Show "No linguistic summary was generated" when the agent didn't produce one A3 Dashboard stuck on "Running": - useJobs / useJob now poll every 5s while any job/locale is in an active state - Stops polling once everything is COMPLETED or ERROR B1 TM auto-config: respect empty selection - Send no TM files when user unchecks all (was auto-adding campaign channel) - Backend distinguishes empty list vs missing field B2 Auto-discover channels from TM registry: - New GET /api/v1/files/tm/channels endpoint reads distinct channels from registry - Frontend StepConfigure fetches channels per client; falls back to static list - Pipeline TM resolution falls back to flat_<Channel>_<lc>.json pattern for any registered channel (no hardcoded map needed for new channels like PrimeCBM) B3 Job inputs visible on monitoring: - New "Inputs sent to the agent" card on /jobs/[id] showing AI model, TM files, supplementary file list, and context override - New GET /api/v1/jobs/{id}/supplementary endpoint listing on-disk supplementary files C1 Context cap (large briefs truncating): - max_tokens scales with source line count (8k/16k/32k/64k by tier) - 172-line briefs now have ~64k output budget instead of fixed 16k D1 Reviewer comments in xlsx export: - Export endpoint now copies xlsx to temp path on download, queries Feedback joined with User, and appends "Reviewer (Name): comment" to the rationale cells of options that have feedback - Original generated file remains untouched D2 Hide Clients & Voice from sidebar (page still reachable by URL) D3 Remove dead notifications + settings icons from header D4 Cost by Locale table added to Analytics with total + avg cost per brief Makefile seed target now also runs register_storage_files so TM registry is populated from disk on first setup (deploy.sh already does this via --init). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-05-04 16:12:47 -04:00
DJP	d5fa4e49f7	Fix markdown table parser losing backtranslations/rationales, add model selection, update help page The V25 table has duplicate column names (Backtranslation x3, Rationale x3). The dict-based parser collapsed these — only the last value survived (Option 3's "N/A"), causing all BT/rationale fields to be "N/A" in the output Excel. Fixed by switching to positional list-based parsing instead of dicts. Also adds per-job model selection (Sonnet 4.6 / Opus 4.6) through the full stack: DB column, API schema, job wizard UI dropdown, pipeline contracts, and LLM client with model-aware cost tracking. Includes Alembic migration. Updated help page and README to reflect single-agent pipeline, multi-TM selection, flat locale grid, model selector, and linguistic summary. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-14 12:40:17 -04:00
DJP	d56311b862	Implement standalone agent feedback: consolidated locale selector, multi-TM selection, single-agent pipeline, and linguistic summary Four changes from user testing feedback: 1. Merge MAIN/DERIVED locale selectors into single 12-locale grid, auto-classify locale_type 2. Add multi-TM channel selection (checkbox grid, tm_channels JSON column, multi-file resolution) 3. Replace 6-agent pipeline with single V25-based agent (feature-flagged via USE_SINGLE_AGENT) 4. Replace Excel Tab 2 metadata with linguistic summary from agent output Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-14 12:09:51 -04:00
DJP	5e0a148b96	feat: add token usage tracking, feedback highlighting, cost on cards, help page - Wire token usage from LLM agents through pipeline context to DB and frontend - Agents 2 and 4 accumulate input/output tokens and cost into PipelineContext - job_tasks.py saves token totals to locale instance after pipeline completion - Monitoring cards show total tokens and estimated cost instead of broken 0/0 - Make feedback highlighting bolder: colored card borders, stronger button states - Add estimated cost display to dashboard job cards - Add Help page with full documentation and link in sidebar navigation - Comprehensive README with ASCII architecture diagrams Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 16:47:36 -04:00
DJP	f2398e04c4	feat: add real-time progress tracking and admin job deletion Progress tracking: - Add progress (0-100%) and current_stage columns to locale_instances - Wire orchestrator on_progress callback to update DB at each pipeline stage - Agent 4 reports batch-level sub-progress (e.g. "Translating batch 2/4") - Frontend reads real progress/stage data instead of hardcoding 50% - Stages: Loading Files → Matching TM → Ranking → Translating (per-batch) → Reviewing → Formatting → Complete Job deletion: - DELETE /jobs/{id} endpoint (admin-only, 403 for non-admins) - Cannot delete running jobs (must cancel first) - Cascades to locale instances, output rows, source lines - Frontend: Delete button with confirmation on job monitoring page (admin only) Also: compute live duration_seconds from started_at, map pipeline stages to UI status badges. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 16:18:59 -04:00
DJP	7a0971a029	feat: implement real LLM agents 2-5 for live transcreation pipeline Replace all stub agents with working Claude API-powered agents: - Agent 2 (TM Retrieval): LLM semantic matching of source lines against TM entries - Agent 3 (Ranker): Deterministic ranking with confidence tiers (high/moderate/low) - Agent 4 (Transcreator): Batched creative transcreation with voice profiles, reference files, backtranslations - Agent 5 (Compliance): Deterministic checks for character limits, blacklist terms, domain substitution Also fixes TM file loader to handle real compact JSONL format (locale code regex-based parsing), and adds file manifest resolution for reference files (glossary, blacklist, TOV, locale considerations). Verified end-to-end: 53-line de-DE brief produces real German translations with TM matching, confidence-based option counts (1/2/3), backtranslations, and compliance validation. ~$0.49 total cost. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 15:26:41 -04:00
DJP	98fa16bfc3	feat: complete Phase 1-2 scaffold — backend, frontend, pipeline skeleton Full-stack Amazon AI Transcreation Platform with: - FastAPI backend (async, PostgreSQL, Redis, Celery) with 11 DB tables - JWT auth (SSO-ready abstract provider pattern) - 6-agent pipeline orchestrator with deterministic modules - Next.js 14 frontend with Amazon branding (Ember fonts, orange/dark theme) - Job wizard, monitoring HUD, output review, admin screens - 154 TM/reference files imported, 12 locales configured - Docker Compose for all services Agents 2-5 (TM retrieval, ranker, transcreator, compliance) are stubs pending Phase 3 LLM integration. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 12:31:43 -04:00

9 commits