amazon-transcreation

Author	SHA1	Message	Date
DJP	9825b0497c	Round 2 feedback: parser fix, dynamic max_tokens, polling, TM auto-discovery, reviewer comments in export A1 Export columns shifted (critical): - V25 LLM occasionally emits 12/13-col tables with Copy Type/Char Limit prefix - Parser now anchors on "Option 1" header position; robust to any prefix shift - Verified with 23/23 unit tests covering 11/12/13-col variants - Source-line block in prompt no longer uses pipe separators (defence in depth) A2 Linguistic summary fallback: - Drop the metadata key/value table fallback on Tab 2 - Show "No linguistic summary was generated" when the agent didn't produce one A3 Dashboard stuck on "Running": - useJobs / useJob now poll every 5s while any job/locale is in an active state - Stops polling once everything is COMPLETED or ERROR B1 TM auto-config: respect empty selection - Send no TM files when user unchecks all (was auto-adding campaign channel) - Backend distinguishes empty list vs missing field B2 Auto-discover channels from TM registry: - New GET /api/v1/files/tm/channels endpoint reads distinct channels from registry - Frontend StepConfigure fetches channels per client; falls back to static list - Pipeline TM resolution falls back to flat_<Channel>_<lc>.json pattern for any registered channel (no hardcoded map needed for new channels like PrimeCBM) B3 Job inputs visible on monitoring: - New "Inputs sent to the agent" card on /jobs/[id] showing AI model, TM files, supplementary file list, and context override - New GET /api/v1/jobs/{id}/supplementary endpoint listing on-disk supplementary files C1 Context cap (large briefs truncating): - max_tokens scales with source line count (8k/16k/32k/64k by tier) - 172-line briefs now have ~64k output budget instead of fixed 16k D1 Reviewer comments in xlsx export: - Export endpoint now copies xlsx to temp path on download, queries Feedback joined with User, and appends "Reviewer (Name): comment" to the rationale cells of options that have feedback - Original generated file remains untouched D2 Hide Clients & Voice from sidebar (page still reachable by URL) D3 Remove dead notifications + settings icons from header D4 Cost by Locale table added to Analytics with total + avg cost per brief Makefile seed target now also runs register_storage_files so TM registry is populated from disk on first setup (deploy.sh already does this via --init). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-05-04 16:12:47 -04:00
DJP	d56311b862	Implement standalone agent feedback: consolidated locale selector, multi-TM selection, single-agent pipeline, and linguistic summary Four changes from user testing feedback: 1. Merge MAIN/DERIVED locale selectors into single 12-locale grid, auto-classify locale_type 2. Add multi-TM channel selection (checkbox grid, tm_channels JSON column, multi-file resolution) 3. Replace 6-agent pipeline with single V25-based agent (feature-flagged via USE_SINGLE_AGENT) 4. Replace Excel Tab 2 metadata with linguistic summary from agent output Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-14 12:09:51 -04:00
DJP	e97d4f81b7	fix: improve TM parser EN/TX split and fix report SQL errors The compact TM format parser was storing the combined EN+TX text in both fields, causing the LLM retrieval agent to fail at matching source lines against TM entries — resulting in all-low confidence tiers. Added _split_en_tx() heuristic that detects the language boundary at the first non-ASCII sentence. Also includes raw _text in LLM prompt for context. Fixed get_jobs_over_time GroupingError by using literal_column for date_trunc, added date filters to status_breakdown, and fixed Decimal serialization in locale stats. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 17:47:53 -04:00
DJP	7a0971a029	feat: implement real LLM agents 2-5 for live transcreation pipeline Replace all stub agents with working Claude API-powered agents: - Agent 2 (TM Retrieval): LLM semantic matching of source lines against TM entries - Agent 3 (Ranker): Deterministic ranking with confidence tiers (high/moderate/low) - Agent 4 (Transcreator): Batched creative transcreation with voice profiles, reference files, backtranslations - Agent 5 (Compliance): Deterministic checks for character limits, blacklist terms, domain substitution Also fixes TM file loader to handle real compact JSONL format (locale code regex-based parsing), and adds file manifest resolution for reference files (glossary, blacklist, TOV, locale considerations). Verified end-to-end: 53-line de-DE brief produces real German translations with TM matching, confidence-based option counts (1/2/3), backtranslations, and compliance validation. ~$0.49 total cost. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 15:26:41 -04:00
DJP	f271343bc0	feat: wire job wizard and dashboard to real backend API - Job wizard now calls real API: create job → upload source → launch - Dashboard and monitoring pages use live data instead of mock data - Monitoring page polls every 3s while job is active - Backend enriches job responses with client_name, created_by_name, source_line_count from eager-loaded relationships - Frontend response mappers handle backend→frontend type differences (lowercase enum values, field name mapping, computed progress/stage) - Source file parser accepts column aliases (Line type, Context notes) with case-insensitive matching for real-world Excel files - Clients list endpoint accessible to all authenticated users - Fixed uploadSource to use PUT, uploadSupplementary per-file - Removed all hardcoded mock data from useJobs hook Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 14:18:47 -04:00
DJP	98fa16bfc3	feat: complete Phase 1-2 scaffold — backend, frontend, pipeline skeleton Full-stack Amazon AI Transcreation Platform with: - FastAPI backend (async, PostgreSQL, Redis, Celery) with 11 DB tables - JWT auth (SSO-ready abstract provider pattern) - 6-agent pipeline orchestrator with deterministic modules - Next.js 14 frontend with Amazon branding (Ember fonts, orange/dark theme) - Job wizard, monitoring HUD, output review, admin screens - 154 TM/reference files imported, 12 locales configured - Docker Compose for all services Agents 2-5 (TM retrieval, ranker, transcreator, compliance) are stubs pending Phase 3 LLM integration. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-10 12:31:43 -04:00

6 commits