cohorta

Author	SHA1	Message	Date
Vadym Samoilenko	4c307bf00d	feat: increase trial credits from 10 to 50 on signup Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 21:05:13 +01:00
Vadym Samoilenko	a9a5fff659	feat: full visual rebrand + landing redesign + auth page refresh + email fix - Landing: extract 513-line monolith into 12 focused section components (Hero, StatsBand, FeatureGrid, HowItWorks, LivePreview, Comparison, UseCases, Testimonials, Pricing, FAQ, FinalCTA, TrustBar) - Auth pages: replace flat orange panel with animated live mock (real persona SVGs, typewriter messages, theme bars); Login label fixed to "Email or username"; Register wires ?plan= badge - Brand: new Logo SVG (C-arc + 3 figures + wordmark/tagline), expanded palette tokens, fluid display type scale, framer-motion shared variants - Header: scroll progress bar, removed non-functional language pill - Footer: fixed all dead links, legal stubs, new logo - Legal: /about /privacy /terms /cookies /gdpr real pages added - Email: FROM_EMAIL default fixed to noreply@ai-impress.com (verified apex domain), HTML template rewritten to match new brand - Tooling: Playwright screenshot script for visual self-check Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 21:02:03 +01:00
Vadym Samoilenko	9d2f1f2c7d	feat: complete AIMPRESS visual rebrand — warm palette, new landing, real dashboard Some checks failed Deploy to Production / deploy (push) Failing after 0s Details - Replace cyan/violet design tokens with warm dark slate + orange (#E89B3C) palette - Add Space Grotesk display font; new utilities: .outline-display, .orange-band, .corner-card, .persona-orb - New brand components: Logo (hexagonal SVG), Header (pill nav + glass blur), Footer (4-col), PublicLayout, AppLayout, UserDropdown - Rewrite Index.tsx as full sales funnel: Hero → Stats → Orange band → How it works → Pricing (API) → FAQ → Final CTA - Rewrite Dashboard.tsx with real API data: credits balance, MTD spend, personas count, focus groups count, active tasks, recent transactions - Rewrite auth pages (Login, Register, VerifyEmail, NotFound, Billing) with two-column orange-panel layout - Replace hardcoded mock numbers in Dashboard with billingApi / personasApi / focusGroupsApi / usageApi calls - Delete legacy components: Navigation.tsx, Hero.tsx, FeatureCard.tsx - Add nested layout routing in App.tsx: PublicLayout for guests, AppLayout for protected routes - Color sweep inner pages: replace all purple-500/600 with primary token - Purge all semblance / Oliver / optical-dev references; rename semblance_app_documentation.md → cohorta_app_documentation.md; update backend scripts to cohorta_db Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-23 19:44:02 +01:00
Vadym Samoilenko	e01569c412	feat: commit all app changes — billing API, new auth, design overhaul All checks were successful Deploy to Production / deploy (push) Successful in 2m23s Details Includes frontend redesign (Navigation, billingApi), backend updates (auth routes, admin routes, LLM service refactor), MSAL removal, and dependency updates. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 19:04:43 +01:00
Vadym Samoilenko	5491d2d73d	Rebrand to Cohorta + full UI redesign + registration with email verification Some checks failed Deploy to Production / deploy (push) Failing after 0s Details - Complete dark-theme redesign inspired by ai-impress.com (navy + cyan + violet palette) - New Syne display font + gradient logo mark + SVG favicon - New Navigation: glass-morphism, gradient logo, Get Started CTA - New Hero: animated glow orbs, mock focus-group chat UI, stats row - New landing: Features grid, How-It-Works steps, CTA banner - New Footer: AImpress LTD branding, © AImpress LTD. All rights reserved. - New Login page: dark card, password visibility toggle, link to Register - New Register page: full form, benefits row, 50 free credits pitch - New VerifyEmail page: token verification flow with auto-redirect - Backend: email_service.py using Resend API for verification emails - Backend: /api/auth/register, /verify-email, /resend-verification endpoints - User model: email_verified, email_verify_token, email_verify_expires fields - Gitea Actions CI/CD: auto-deploy to aimpress server on push to main Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 18:40:08 +01:00
Vadym Samoilenko	7b6a7c7347	Fix admin filters: ISO Z parsing crash + All time period returning month data Two bugs caused filters to show 0 and period selector to have no effect: 1. Python < 3.11 can't parse JS toISOString() Z suffix — every request with a period filter threw ValueError → 500 → frontend received no data. Fixed with _parse_iso() helper that replaces Z with +00:00 before fromisoformat(). 2. 'All time' sends no from/to params, but backend defaulted to _month_start() instead of omitting the ts filter. Fixed with _period_match() helper that returns {} (no filter) when both from and to are absent. Also: stale _user_mtd_cost reference in get_user route replaced with _user_period_cost(user_id, None, None); adminApi types updated with from/to. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 19:17:57 +01:00
Vadym Samoilenko	ad619d45fc	Improve live token extraction: warn on missing usage_metadata, capture thinking tokens - Add WARNING log when usage_metadata/usage is None so zero-cost events are visible in logs instead of silently disappearing - Capture thoughts_token_count from Gemini thinking models into reasoning field (already included in candidates_token_count for billing, now also tracked separately) - Add same warning for OpenAI missing usage object Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 19:13:39 +01:00
Vadym Samoilenko	d0ad8e67be	Fix backfill: use accumulated conversation context for prompt estimation Old logic used output text length as a proxy for prompt tokens — completely wrong. Real Gemini calls send the full conversation history as context, so prompt grows with every turn. New logic: - completion_tokens = len(response_text) / 3.8 (what was generated) - prompt_tokens = base_template + sum(all_prior_messages_in_fg) / 3.8 - persona_response base: 1500 tok (template + persona details + topic) - moderator base: 1200 tok (moderator template + fg context) - persona_generate base: 2500 tok (persona-detailed-generation.md template) Also: - Sorts messages chronologically per focus group before processing - Accumulates context correctly so turn N includes turns 0..N-1 as context - Idempotency via pre-fetched set instead of per-doc find_one queries - cost_usd breakdown now has correct input/output split (not 40/60 guess) - Dry-run prints per-focus-group cost estimates for sanity checking Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 19:11:01 +01:00
Vadym Samoilenko	57508e8e55	Add period selector to all cost-bearing admin tabs - New usePeriod hook (day/week/month/all/custom presets) with from/to ISO string outputs - New PeriodSelector component (button group + custom date inputs) - UsersTab, UsageTab, FocusGroupsTab all wired up with period state - Backend /admin/users and /admin/focus-groups now accept from/to query params - MTD Cost column header now reflects selected period label (e.g. "Cost (MTD)") - Logout clears local state only (no account sign-out) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 19:03:16 +01:00
Vadym Samoilenko	d7ee22e557	Fix backfill pricing: read from model_pricing collection + --delete-existing-estimates flag	2026-04-24 18:57:25 +01:00
Vadym Samoilenko	66c8e1762e	Fix backfill: handle list-type persona fields	2026-04-24 18:53:41 +01:00
Vadym Samoilenko	539c5eaaee	Fix backfill script: use focus_group_messages collection + correct field names	2026-04-24 18:49:59 +01:00
Vadym Samoilenko	bc4138f332	Final pieces: decorators on LLM routes, usage self-service, billing page, WS events Backend: - @active_required + @with_user_context applied to all LLM-invoking routes in personas.py, focus_group_ai.py, ai_personas.py - backend/app/routes/usage.py: GET /api/usage/me (MTD summary by feature), GET /api/usage/focus-groups/<id> (owner or admin) - Registered usage_bp in app/__init__.py - llm_service._record_usage now emits usage_update WS event to focus group room Frontend: - useMyUsage + useFocusGroupUsage hooks - MyUsage.tsx: personal billing dashboard (cost cards + per-feature table) - /billing route (ProtectedRoute) + Billing nav link - FocusGroupSession: quota_warning amber banner with Progress bar, quota_exceeded + quota_warning WS events wired via websocketServiceNew Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:43:13 +01:00
Vadym Samoilenko	915c81b8f1	Complete phases D–G: quota enforcement, token invalidation, admin writes, backfill Backend: - token_version in JWT (bump_token_version, get_token_version on User model); jwt_required checks tv claim → 401 on mismatch; login routes embed version - Quota pre-flight in all 3 LLM public methods (QuotaExceededError bubbles up) - AI runner catches QuotaExceededError → sets status paused_quota + emits WS event - Admin routes: POST /users (create), POST /users/<id>/reset-password, POST /pricing, GET /focus-groups with aggregated cost; PUT /users/<id> now bumps token_version on disable or role change - backfill_usage.py: idempotent estimated-event generator for historical data, tiktoken for GPT models, char/3.8 for Gemini, --dry-run flag Frontend: - 402 interceptor dispatches quota_exceeded CustomEvent - adminApi: createUser, resetPassword, createPricing, listFocusGroups - UsersTab: New User dialog + Reset Password in edit dialog - PricingTab: New Price dialog (model, provider, input/output/cached prices) - FocusGroupsTab: focus groups table sorted by total cost - Admin.tsx: 4th tab (Focus Groups) - FocusGroupSession: admin-only cost badge + dismissable quota exceeded banner Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:34:48 +01:00
Vadym Samoilenko	015e6cc5cc	Add Phase D admin panel: user management + usage analytics Backend: /api/admin/* blueprint with user CRUD (list, get, update, disable/enable), usage summary aggregation (group by user/model/feature/ day/focus_group), usage event drill-down, and pricing list. Fixed admin_required decorator (async-safe). Added find_all/count/update helpers to User model. Frontend: /admin page (AdminRoute guard, 3 tabs) — Users table with search/filter/edit dialog, Usage tab with KPI cards + bar chart + events table, Pricing tab showing active model rows with tier details. Admin nav link visible only to admin role. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:26:05 +01:00
Vadym Samoilenko	3e9ccafad2	Add LLM usage tracking infrastructure (Phases A-C) - Model renames: gpt-5.2 → gpt-5.4-2026-03-05, gemini-3-pro-preview → gemini-3.1-pro-preview; retire gpt-4.1 via alias fallback - New: llm_usage_context.py (ContextVar-based attribution), model_pricing.py (tiered pricing + 60s cache), usage_event.py (append-only telemetry), quota.py (user/FG quota enforcement with 80% warning) - Wire _record_usage into all 3 LLM methods; set_llm_context at every service entry point - Fix admin_required decorator (was sync, never awaited User.find_by_id); add active_required and with_user_context decorators - Inject user_id into ContextVar from JWT on every authenticated request - Add DB indexes for usage_events, model_pricing, users collections - Seed script for model pricing (gpt-5.4 single-tier, gemini-3.1 two-tier 200k threshold) - Fix parse_json_response NameError (logger undefined at module level) - 70 passing tests: conftest.py with sys.modules stubs, test_usage_infrastructure.py (52 tests), rewrite stale test_llm_service.py (18 tests) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:08:27 +01:00
Vadym Samoilenko	2e85fc1acc	Fix root cause: naive vs aware datetime crash + stuck AI mode indicator The autonomous loop was crashing on every decision with: TypeError: can't subtract offset-naive and offset-aware datetimes because MongoDB stores created_at without timezone info but the code compared it against datetime.now(timezone.utc). - conversation_context_service: make created_at timezone-aware before subtraction (replace tzinfo=utc when naive) - DiscussionPanel: fix sync effect — when server reports AI mode is inactive, always clear localAiModeActive regardless of its value, so the "AI is generating..." spinner doesn't get stuck when the backend fails/stops before the frontend has confirmed AI mode started Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 19:30:04 +00:00
Vadym Samoilenko	7e72d07329	Fix AI loop hanging: add asyncio.wait_for timeouts on LLM calls The autonomous conversation loop could hang indefinitely because self.response_timeout=30 was defined but never used in wait_for(). - autonomous_conversation_controller: wrap generate_persona_response() with asyncio.wait_for(timeout=120s); 30s was too short for production LLMs, raised to 120s; TimeoutError returns an error dict so the loop can continue or count toward consecutive_silence limit - conversation_decision_service: add asyncio.wait_for(timeout=60s) around LLMService.generate_content() for the decision call; add asyncio import and explicit TimeoutError handling Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 19:17:36 +00:00
Vadym Samoilenko	283b31e786	Fix AI mode: race condition, split-brain UI, and stuck local state - Backend: set status to ai_mode in the route handler before submitting to AI runner, eliminating the race condition where frontend's immediate status poll read the old status - Frontend: replace all raw isAiModeActive prop usages with effectiveAiModeActive in DiscussionPanel (13 locations) so ReasoningPanel, status text, loading indicator, and manual/AI controls all reflect the correct state instantly on Start AI Mode click - Frontend: add useEffect to sync localAiModeActive back to null once the parent prop catches up, preventing permanent override after natural session end - These fixes also unblock the 3-second AI message polling which was never activating due to isAiModeActive staying false Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 19:02:15 +00:00
Vadym Samoilenko	b4978989a5	Fix AI autonomous mode: cross-loop WebSocket emit + polling fallback The AI Runner runs on a dedicated background thread with its own asyncio event loop. When it emitted WebSocket events via sio.emit(), the call happened on the wrong loop (AI Runner's vs ASGI/Quart's), causing silent failures — messages were saved to MongoDB but never reached the frontend. Additionally, the frontend HTTP polling fallback was never enabled when WebSocket appeared connected, leaving no way to discover missed messages. - websocket_manager_async.py: store ASGI main loop reference; detect cross-loop calls in emit_to_focus_group and use run_coroutine_threadsafe to schedule emits on the correct loop - __init__.py: register the ASGI event loop with the WebSocket manager in before_serving hook - FocusGroupSession.tsx: always poll fetchMessages every 3s during AI mode as a reliability fallback regardless of WebSocket status Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 18:22:24 +00:00
Vadym Samoilenko	33272cc677	Allow document uploads (PDF, DOCX, TXT, etc.) as focus group assets - Expand allowed file types from images-only to also include: PDF, DOCX, DOC, TXT, MD, CSV, XLSX, XLS, PPTX, PPT, RTF - validate_asset_file: skip PIL validation for non-image files; 50MB limit for docs / 10MB for images - Correct MIME type detection for document extensions - Store asset_type: "document"\|"image" in metadata - ImageDescriptionService: text files → LLM summary; binary docs → label; images → existing multimodal flow Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 17:08:30 +00:00
Vadym Samoilenko	4b47b334d7	Fix data isolation + conversation/decision 500 errors Data isolation: - GET /tasks/<id>: verify requesting user owns the task (403 if not) - DELETE /tasks/<id>: same ownership check - GET /tasks/status: add @jwt_required() - GET /personas/<id>: add ownership check (403 if created_by != user) - GET /focus-groups/<id>: add ownership check - GET /focus-groups/<id>/messages: add ownership check - POST/DELETE /focus-groups/<id>/participants: add ownership check Fix conversation/decision 500: - Convert POST /conversation/decision to async 202+background (was synchronous LLM → timed out / LLM errors → 500) - Frontend polls waitForTaskResult for decision result before calling generateResponseAsync - GET /conversation/insights: return empty insights (200) on LLM error instead of 500 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 17:02:10 +00:00
Vadym Samoilenko	1b387daacf	Migrate task result delivery from WebSocket to HTTP polling Backend: - task_manager.py: add result/error/completed_at storage, TTL sweeper (5min), store_task_result() helper - tasks.py: add GET /<task_id> endpoint returning stored result; cancel route stores 'cancelled' status - __init__.py: start TTL sweeper on app startup - All 8 bg functions: store result before emitting lightweight WS hint (no payload data) Frontend: - src/lib/taskPolling.ts: waitForTaskResult() — polls GET /tasks/{id} every 2s, WS hint triggers immediate poll, 5min timeout - src/hooks/useTaskPolling.ts: drop-in replacement for useCancellableGeneration using polling - Migrate 6 Promise-based WS listeners → waitForTaskResult() in DiscussionPanel, FocusGroupSession (×2), PersonaProfile, PersonaModificationModal, useDiscussionGuideGeneration - Migrate 3 hook-based consumers → useTaskPolling in AIRecruiter, SyntheticUsers, BulkExportProgressModal Fixes WS Promise leak: polling survives disconnects, background tabs, page reloads. WS events retained as zero-payload hints for near-zero latency when connected. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 16:46:58 +00:00
Vadym Samoilenko	c7034634e3	Fix all async LLM routes: bypass GCP 30s load balancer timeout Convert 6 synchronous LLM routes to async 202+WebSocket pattern: - generate-response (focus_group_ai): persona chat response - generate-key-themes (focus_group_ai): discussion analysis - modify-with-ai (personas): AI persona modification - export-profile (personas): markdown profile export - describe-asset (focus_groups): image AI description Each route now returns 202 + task_id immediately, runs LLM in asyncio background task, delivers result via WebSocket task_completed event. Frontend listeners updated to wait for ws:task_completed instead of HTTP response body. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:56:54 +00:00
Vadym Samoilenko	f4a587c4f7	Fix 500: add current_app import to focus_groups route Missing import caused NameError when starting background discussion guide generation task. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:35:29 +00:00
Vadym Samoilenko	d8a5d6643f	Fix discussion guide 504: async flow + WebSocket delivery - Backend: /generate-discussion-guide now returns task_id immediately (202) and runs generation as a background asyncio task, delivering the guide via WebSocket task_completed event (bypasses GCP LB 30s timeout) - Frontend: useDiscussionGuideGeneration awaits ws:task_completed event to resolve the guide Promise instead of waiting on the HTTP response Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:32:08 +00:00
Vadym Samoilenko	6917518d11	Fix NameError: _fg_logger undefined in update_focus_group route _fg_logger was used but never defined, causing a NameError on every PUT /focus-groups/:id request that included llm_model (i.e. all autosave and handleSubmit updates) — resulting in a 500 Internal Server Error. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:26:08 +00:00
Vadym Samoilenko	f359157949	Fix focus group create: 500 on update + 400 on autosave - FocusGroup.update: use matched_count > 0 instead of modified_count > 0 so updates succeed even when data is unchanged (was returning 500) - useFocusGroupAutoSave: skip save if name is empty (not all-fields-empty) preventing 400 Bad Request when autosave fires before name is filled Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:20:40 +00:00
Vadym Samoilenko	f60d86e8cb	Fix task_completed WebSocket payload too large Don't send full persona objects in WS event — only send counts. Frontend navigates to list page where personas load from API. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 14:11:14 +00:00
Vadym Samoilenko	aa4090888d	Fix persona generation 504: async flow + remove debug logging - Backend: /generate-personas-full now returns task_id immediately (202) and runs generation as a background asyncio task, delivering results via WebSocket task_completed event (bypasses GCP LB 30s timeout) - Frontend: AIRecruiter listens for ws:task_completed to process personas instead of awaiting the long HTTP response - Remove 53 debug console.log calls from websocketServiceNew.ts including session_id exposure and a self-test emit that was firing fake events - Remove debug logs from WebSocketContextNew, AIRecruiter, personaGenerator Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 14:00:07 +00:00
Vadym Samoilenko	c00728f375	Fix Gemini LLM AssertionError: force httpx transport over aiohttp google-genai SDK uses aiohttp when it's available in the environment (installed via llama-index-core), causing AssertionError (connector is None) on async requests. Pass httpx_async_client in HttpOptions to bypass aiohttp. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 13:05:27 +00:00
Vadym Samoilenko	7f0df54de3	Fix domain typo: oliver.solution → oliver.solutions across all files Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 13:40:00 +00:00
Vadym Samoilenko	cf698e1e87	Remove create_default_user call from run.py (method removed in security remediation)	2026-03-20 13:34:26 +00:00
Vadym Samoilenko	05d7ea68e2	Fix make_serializable import — move to utils/__init__.py (was shadowed by utils/ package)	2026-03-20 13:33:11 +00:00
Vadym Samoilenko	d1788a4017	Fix .dockerignore — exclude *.txt but keep requirements.txt	2026-03-20 13:28:32 +00:00
Vadym Samoilenko	4a6b4d6fe0	Dockerize backend — replace systemd service with docker-compose - Add backend/Dockerfile (python:3.12-slim) - Add docker-compose.yml (backend :5137 + mongo:7) - Add backend/.dockerignore - Rewrite deploy.sh: build frontend locally, rsync dist/, docker compose up --build - Remove semblance.service (no longer needed) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 13:21:34 +00:00
Vadym Samoilenko	bb4dca0fe8	Update production URL to optical-dev.oliver.solution Replace ai-sandbox.oliver.solutions with optical-dev.oliver.solution across all config, env, docs, and source files. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 12:55:45 +00:00
Vadym Samoilenko	3e1865edbd	Apply Jintech security audit remediation (sprint 3) — 87/92 findings fixed - Fix missing await on FocusGroup.get_messages() (N-L1) - Replace time.sleep with asyncio.sleep in key_theme_service and focus_group_service (N-P10) - Replace flask import with quart in focus_groups.py (N-S3) - Add logger.error before all 500 returns in focus_groups.py (N-P6) - Add logging to silent except blocks across routes (N-M10, N-M11) - Add @rate_limit to 6 remaining AI endpoints (N-H4) - Add --confirm flag to populate scripts before delete_many (S-H2) - Remove hardcoded Azure ID fallbacks from msal_service.py and msalConfig.ts (A-M2, F-H4) - Centralize make_serializable() in utils.py, remove duplicates from 3 route files (N-P7) - Replace all datetime.utcnow() with datetime.now(timezone.utc) across entire backend (M-L2) - AuthContext.tsx: only mark token validated on 200 success, not on non-401 errors (F-H2) - Rename authType → auth_type in auth.py (N-S4) - Add security_report.md and security_report.pdf with full 92-finding status Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 12:51:18 +00:00
michael	c7ff1755ee	Add architecture document generator and PDF Create comprehensive technical architecture document (PDF) with 11 chapters covering system architecture, frontend/backend design, data model, auth, WebSocket communication, LLM integration, and core feature flows. Includes 11 Mermaid diagrams rendered as PNGs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 11:39:50 -06:00
michael	b1be8f8c38	Add model alias for legacy gpt-5 database entries Focus groups created before the gpt-5.2 rename have llm_model='gpt-5' stored in MongoDB. Without an alias, the backend falls through to the Gemini provider and fails with an aiohttp AssertionError. Adds MODEL_ALIASES mapping and _resolve_model() helper so gpt-5 is transparently resolved to gpt-5.2. Also updates all llm_model checks to accept both values. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 12:06:10 -06:00
michael	b0445de18b	Update GPT-5 to GPT-5.2 and lower default reasoning effort to low Swap model ID from gpt-5 to gpt-5.2 across all backend services, frontend components, and documentation. Change default reasoning effort from medium to low for faster responses. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 11:24:01 -06:00
michael	d4ce5d99bc	Fix audience_brief and research_objective dropped in Stage 2 persona generation Stage 2 (detailed persona generation) was ignoring the audience brief and research objective, causing the LLM to guess research context from demographics alone. Now passes both values through to generate_persona() in all three endpoints (generate-personas-full, complete-and-save-persona, complete-persona) and auto-generates prompt customization via customize_persona_prompt() when they are provided. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 08:40:17 -06:00
michael	850cb25067	Fix GPT-5 Responses API crash on reasoning items with None content Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 08:05:52 -06:00
michael	1708bd75a4	Remove SDK version logging on every Gemini call 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 14:30:42 -06:00
michael	7f4f659501	Bump google-genai to >=1.56.0 to fix aiohttp AssertionError Version 1.52.0 has a known bug where aiohttp connector is None. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 14:07:38 -06:00
michael	d79c202e8f	Fix NameError: use print instead of logger in get_gemini_client logger is not defined at module level where get_gemini_client() lives. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 13:59:30 -06:00
michael	c9848210bb	Add full traceback and SDK version logging for AssertionError debug This will help identify where exactly the AssertionError is occurring in the google-genai SDK and what version is installed on the server. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 13:49:47 -06:00
michael	5a825934f8	Add verbose exception debugging for empty error messages Log full exception details: type, module, str, repr, args, and __dict__ to diagnose why Gemini errors are producing empty messages. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 13:39:15 -06:00
michael	b50c0fa2a4	Fix empty error messages from Google GenAI SDK Catch genai_errors.APIError specifically and extract e.code and e.message attributes for proper error logging. The generic str(e) was returning empty strings for Google API errors, making debugging impossible. - Import google.genai.errors for specific exception handling - Add APIError catch before generic Exception in generate_content() - Add APIError catch before generic Exception in generate_contextual_response() - Properly categorize errors by HTTP code for retry logic (429/500+ retryable) - Fix time.sleep to await asyncio.sleep in contextual response handler 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 13:26:55 -06:00
michael	6ee80e67aa	Create fresh LLM clients per call instead of caching The previous event loop tracking approach still caused issues - when replacing a cached client, its garbage collection triggers aclose() which tries to close the aiohttp session on the wrong event loop. Simplest fix: create a fresh client for each call. The overhead is minimal compared to the actual LLM API call, and this completely avoids all event loop mismatch issues in ASGI environments. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 16:56:36 -06:00

1 2

83 commits