semblance-dev

Author	SHA1	Message	Date
Vadym Samoilenko	3e9ccafad2	Add LLM usage tracking infrastructure (Phases A-C) - Model renames: gpt-5.2 → gpt-5.4-2026-03-05, gemini-3-pro-preview → gemini-3.1-pro-preview; retire gpt-4.1 via alias fallback - New: llm_usage_context.py (ContextVar-based attribution), model_pricing.py (tiered pricing + 60s cache), usage_event.py (append-only telemetry), quota.py (user/FG quota enforcement with 80% warning) - Wire _record_usage into all 3 LLM methods; set_llm_context at every service entry point - Fix admin_required decorator (was sync, never awaited User.find_by_id); add active_required and with_user_context decorators - Inject user_id into ContextVar from JWT on every authenticated request - Add DB indexes for usage_events, model_pricing, users collections - Seed script for model pricing (gpt-5.4 single-tier, gemini-3.1 two-tier 200k threshold) - Fix parse_json_response NameError (logger undefined at module level) - 70 passing tests: conftest.py with sys.modules stubs, test_usage_infrastructure.py (52 tests), rewrite stale test_llm_service.py (18 tests) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:08:27 +01:00
michael	b1be8f8c38	Add model alias for legacy gpt-5 database entries Focus groups created before the gpt-5.2 rename have llm_model='gpt-5' stored in MongoDB. Without an alias, the backend falls through to the Gemini provider and fails with an aiohttp AssertionError. Adds MODEL_ALIASES mapping and _resolve_model() helper so gpt-5 is transparently resolved to gpt-5.2. Also updates all llm_model checks to accept both values. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 12:06:10 -06:00
michael	b0445de18b	Update GPT-5 to GPT-5.2 and lower default reasoning effort to low Swap model ID from gpt-5 to gpt-5.2 across all backend services, frontend components, and documentation. Change default reasoning effort from medium to low for faster responses. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 11:24:01 -06:00
Michael Clervi	893b537b67	changed permissions	2025-12-19 19:26:16 +00:00
michael	4d9b0afde7	upgraded to Gemini 3.0 Pro (gemini-3-pro-preview) from Gemini 2.5 Pro - Upgraded google-genai package from 1.31.0 to 1.52.0 - Updated DEFAULT_MODEL in llm_service.py to gemini-3-pro-preview - Updated all backend routes, services, and models with new model string - Updated all frontend components with new model string and display labels - Updated CLAUDE.md documentation 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-01 10:37:36 -06:00
michael	6a40936508	major refactor of entire application - migrate sync -> async including pymongo -> motor, flask -> quart, google-generativeai -> google-genai	2025-08-27 15:20:56 -05:00
michael	da8639aee8	fixed folders again, bug fixes for gpt-5, adjusted response length calculation, cosmetic UI changes, other bug fixes	2025-08-09 10:08:45 -05:00
michael	fbb444037a	fixed folders to be database instead of local storage based, implemented gpt-5, fixed key theme export quotes	2025-08-09 06:38:49 -05:00
michael	b649793013	added gpt-4.1 support among other things	2025-08-05 17:38:13 -05:00
michael	da7b2c0448	initial commit	2025-08-04 09:07:59 -05:00

10 commits