semblance

History

Vadym Samoilenko 915c81b8f1 Complete phases D–G: quota enforcement, token invalidation, admin writes, backfill Backend: - token_version in JWT (bump_token_version, get_token_version on User model); jwt_required checks tv claim → 401 on mismatch; login routes embed version - Quota pre-flight in all 3 LLM public methods (QuotaExceededError bubbles up) - AI runner catches QuotaExceededError → sets status paused_quota + emits WS event - Admin routes: POST /users (create), POST /users/<id>/reset-password, POST /pricing, GET /focus-groups with aggregated cost; PUT /users/<id> now bumps token_version on disable or role change - backfill_usage.py: idempotent estimated-event generator for historical data, tiktoken for GPT models, char/3.8 for Gemini, --dry-run flag Frontend: - 402 interceptor dispatches quota_exceeded CustomEvent - adminApi: createUser, resetPassword, createPricing, listFocusGroups - UsersTab: New User dialog + Reset Password in edit dialog - PricingTab: New Price dialog (model, provider, input/output/cached prices) - FocusGroupsTab: focus groups table sorted by total cost - Admin.tsx: 4th tab (Focus Groups) - FocusGroupSession: admin-only cost badge + dismissable quota exceeded banner Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>		2026-04-24 18:34:48 +01:00
..
focus_group.py	Add LLM usage tracking infrastructure (Phases A-C)	2026-04-24 18:08:27 +01:00
folder.py	Apply Jintech security audit remediation (sprint 3) — 87/92 findings fixed	2026-03-20 12:51:18 +00:00
model_pricing.py	Add LLM usage tracking infrastructure (Phases A-C)	2026-04-24 18:08:27 +01:00
persona.py	Apply Jintech security audit remediation (sprint 3) — 87/92 findings fixed	2026-03-20 12:51:18 +00:00
quota.py	Add LLM usage tracking infrastructure (Phases A-C)	2026-04-24 18:08:27 +01:00
usage_event.py	Add LLM usage tracking infrastructure (Phases A-C)	2026-04-24 18:08:27 +01:00
user.py	Complete phases D–G: quota enforcement, token invalidation, admin writes, backfill	2026-04-24 18:34:48 +01:00