semblance

Author	SHA1	Message	Date
Vadym Samoilenko	7b6a7c7347	Fix admin filters: ISO Z parsing crash + All time period returning month data Two bugs caused filters to show 0 and period selector to have no effect: 1. Python < 3.11 can't parse JS toISOString() Z suffix — every request with a period filter threw ValueError → 500 → frontend received no data. Fixed with _parse_iso() helper that replaces Z with +00:00 before fromisoformat(). 2. 'All time' sends no from/to params, but backend defaulted to _month_start() instead of omitting the ts filter. Fixed with _period_match() helper that returns {} (no filter) when both from and to are absent. Also: stale _user_mtd_cost reference in get_user route replaced with _user_period_cost(user_id, None, None); adminApi types updated with from/to. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 19:17:57 +01:00
Vadym Samoilenko	ad619d45fc	Improve live token extraction: warn on missing usage_metadata, capture thinking tokens - Add WARNING log when usage_metadata/usage is None so zero-cost events are visible in logs instead of silently disappearing - Capture thoughts_token_count from Gemini thinking models into reasoning field (already included in candidates_token_count for billing, now also tracked separately) - Add same warning for OpenAI missing usage object Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 19:13:39 +01:00
Vadym Samoilenko	d0ad8e67be	Fix backfill: use accumulated conversation context for prompt estimation Old logic used output text length as a proxy for prompt tokens — completely wrong. Real Gemini calls send the full conversation history as context, so prompt grows with every turn. New logic: - completion_tokens = len(response_text) / 3.8 (what was generated) - prompt_tokens = base_template + sum(all_prior_messages_in_fg) / 3.8 - persona_response base: 1500 tok (template + persona details + topic) - moderator base: 1200 tok (moderator template + fg context) - persona_generate base: 2500 tok (persona-detailed-generation.md template) Also: - Sorts messages chronologically per focus group before processing - Accumulates context correctly so turn N includes turns 0..N-1 as context - Idempotency via pre-fetched set instead of per-doc find_one queries - cost_usd breakdown now has correct input/output split (not 40/60 guess) - Dry-run prints per-focus-group cost estimates for sanity checking Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 19:11:01 +01:00
Vadym Samoilenko	57508e8e55	Add period selector to all cost-bearing admin tabs - New usePeriod hook (day/week/month/all/custom presets) with from/to ISO string outputs - New PeriodSelector component (button group + custom date inputs) - UsersTab, UsageTab, FocusGroupsTab all wired up with period state - Backend /admin/users and /admin/focus-groups now accept from/to query params - MTD Cost column header now reflects selected period label (e.g. "Cost (MTD)") - Logout clears local state only (no account sign-out) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 19:03:16 +01:00
Vadym Samoilenko	d7ee22e557	Fix backfill pricing: read from model_pricing collection + --delete-existing-estimates flag	2026-04-24 18:57:25 +01:00
Vadym Samoilenko	66c8e1762e	Fix backfill: handle list-type persona fields	2026-04-24 18:53:41 +01:00
Vadym Samoilenko	539c5eaaee	Fix backfill script: use focus_group_messages collection + correct field names	2026-04-24 18:49:59 +01:00
Vadym Samoilenko	bc4138f332	Final pieces: decorators on LLM routes, usage self-service, billing page, WS events Backend: - @active_required + @with_user_context applied to all LLM-invoking routes in personas.py, focus_group_ai.py, ai_personas.py - backend/app/routes/usage.py: GET /api/usage/me (MTD summary by feature), GET /api/usage/focus-groups/<id> (owner or admin) - Registered usage_bp in app/__init__.py - llm_service._record_usage now emits usage_update WS event to focus group room Frontend: - useMyUsage + useFocusGroupUsage hooks - MyUsage.tsx: personal billing dashboard (cost cards + per-feature table) - /billing route (ProtectedRoute) + Billing nav link - FocusGroupSession: quota_warning amber banner with Progress bar, quota_exceeded + quota_warning WS events wired via websocketServiceNew Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:43:13 +01:00
Vadym Samoilenko	915c81b8f1	Complete phases D–G: quota enforcement, token invalidation, admin writes, backfill Backend: - token_version in JWT (bump_token_version, get_token_version on User model); jwt_required checks tv claim → 401 on mismatch; login routes embed version - Quota pre-flight in all 3 LLM public methods (QuotaExceededError bubbles up) - AI runner catches QuotaExceededError → sets status paused_quota + emits WS event - Admin routes: POST /users (create), POST /users/<id>/reset-password, POST /pricing, GET /focus-groups with aggregated cost; PUT /users/<id> now bumps token_version on disable or role change - backfill_usage.py: idempotent estimated-event generator for historical data, tiktoken for GPT models, char/3.8 for Gemini, --dry-run flag Frontend: - 402 interceptor dispatches quota_exceeded CustomEvent - adminApi: createUser, resetPassword, createPricing, listFocusGroups - UsersTab: New User dialog + Reset Password in edit dialog - PricingTab: New Price dialog (model, provider, input/output/cached prices) - FocusGroupsTab: focus groups table sorted by total cost - Admin.tsx: 4th tab (Focus Groups) - FocusGroupSession: admin-only cost badge + dismissable quota exceeded banner Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:34:48 +01:00
Vadym Samoilenko	015e6cc5cc	Add Phase D admin panel: user management + usage analytics Backend: /api/admin/* blueprint with user CRUD (list, get, update, disable/enable), usage summary aggregation (group by user/model/feature/ day/focus_group), usage event drill-down, and pricing list. Fixed admin_required decorator (async-safe). Added find_all/count/update helpers to User model. Frontend: /admin page (AdminRoute guard, 3 tabs) — Users table with search/filter/edit dialog, Usage tab with KPI cards + bar chart + events table, Pricing tab showing active model rows with tier details. Admin nav link visible only to admin role. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:26:05 +01:00
Vadym Samoilenko	3e9ccafad2	Add LLM usage tracking infrastructure (Phases A-C) - Model renames: gpt-5.2 → gpt-5.4-2026-03-05, gemini-3-pro-preview → gemini-3.1-pro-preview; retire gpt-4.1 via alias fallback - New: llm_usage_context.py (ContextVar-based attribution), model_pricing.py (tiered pricing + 60s cache), usage_event.py (append-only telemetry), quota.py (user/FG quota enforcement with 80% warning) - Wire _record_usage into all 3 LLM methods; set_llm_context at every service entry point - Fix admin_required decorator (was sync, never awaited User.find_by_id); add active_required and with_user_context decorators - Inject user_id into ContextVar from JWT on every authenticated request - Add DB indexes for usage_events, model_pricing, users collections - Seed script for model pricing (gpt-5.4 single-tier, gemini-3.1 two-tier 200k threshold) - Fix parse_json_response NameError (logger undefined at module level) - 70 passing tests: conftest.py with sys.modules stubs, test_usage_infrastructure.py (52 tests), rewrite stale test_llm_service.py (18 tests) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 18:08:27 +01:00
Vadym Samoilenko	2e85fc1acc	Fix root cause: naive vs aware datetime crash + stuck AI mode indicator The autonomous loop was crashing on every decision with: TypeError: can't subtract offset-naive and offset-aware datetimes because MongoDB stores created_at without timezone info but the code compared it against datetime.now(timezone.utc). - conversation_context_service: make created_at timezone-aware before subtraction (replace tzinfo=utc when naive) - DiscussionPanel: fix sync effect — when server reports AI mode is inactive, always clear localAiModeActive regardless of its value, so the "AI is generating..." spinner doesn't get stuck when the backend fails/stops before the frontend has confirmed AI mode started Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 19:30:04 +00:00
Vadym Samoilenko	7e72d07329	Fix AI loop hanging: add asyncio.wait_for timeouts on LLM calls The autonomous conversation loop could hang indefinitely because self.response_timeout=30 was defined but never used in wait_for(). - autonomous_conversation_controller: wrap generate_persona_response() with asyncio.wait_for(timeout=120s); 30s was too short for production LLMs, raised to 120s; TimeoutError returns an error dict so the loop can continue or count toward consecutive_silence limit - conversation_decision_service: add asyncio.wait_for(timeout=60s) around LLMService.generate_content() for the decision call; add asyncio import and explicit TimeoutError handling Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 19:17:36 +00:00
Vadym Samoilenko	283b31e786	Fix AI mode: race condition, split-brain UI, and stuck local state - Backend: set status to ai_mode in the route handler before submitting to AI runner, eliminating the race condition where frontend's immediate status poll read the old status - Frontend: replace all raw isAiModeActive prop usages with effectiveAiModeActive in DiscussionPanel (13 locations) so ReasoningPanel, status text, loading indicator, and manual/AI controls all reflect the correct state instantly on Start AI Mode click - Frontend: add useEffect to sync localAiModeActive back to null once the parent prop catches up, preventing permanent override after natural session end - These fixes also unblock the 3-second AI message polling which was never activating due to isAiModeActive staying false Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 19:02:15 +00:00
Vadym Samoilenko	b4978989a5	Fix AI autonomous mode: cross-loop WebSocket emit + polling fallback The AI Runner runs on a dedicated background thread with its own asyncio event loop. When it emitted WebSocket events via sio.emit(), the call happened on the wrong loop (AI Runner's vs ASGI/Quart's), causing silent failures — messages were saved to MongoDB but never reached the frontend. Additionally, the frontend HTTP polling fallback was never enabled when WebSocket appeared connected, leaving no way to discover missed messages. - websocket_manager_async.py: store ASGI main loop reference; detect cross-loop calls in emit_to_focus_group and use run_coroutine_threadsafe to schedule emits on the correct loop - __init__.py: register the ASGI event loop with the WebSocket manager in before_serving hook - FocusGroupSession.tsx: always poll fetchMessages every 3s during AI mode as a reliability fallback regardless of WebSocket status Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 18:22:24 +00:00
Vadym Samoilenko	33272cc677	Allow document uploads (PDF, DOCX, TXT, etc.) as focus group assets - Expand allowed file types from images-only to also include: PDF, DOCX, DOC, TXT, MD, CSV, XLSX, XLS, PPTX, PPT, RTF - validate_asset_file: skip PIL validation for non-image files; 50MB limit for docs / 10MB for images - Correct MIME type detection for document extensions - Store asset_type: "document"\|"image" in metadata - ImageDescriptionService: text files → LLM summary; binary docs → label; images → existing multimodal flow Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 17:08:30 +00:00
Vadym Samoilenko	4b47b334d7	Fix data isolation + conversation/decision 500 errors Data isolation: - GET /tasks/<id>: verify requesting user owns the task (403 if not) - DELETE /tasks/<id>: same ownership check - GET /tasks/status: add @jwt_required() - GET /personas/<id>: add ownership check (403 if created_by != user) - GET /focus-groups/<id>: add ownership check - GET /focus-groups/<id>/messages: add ownership check - POST/DELETE /focus-groups/<id>/participants: add ownership check Fix conversation/decision 500: - Convert POST /conversation/decision to async 202+background (was synchronous LLM → timed out / LLM errors → 500) - Frontend polls waitForTaskResult for decision result before calling generateResponseAsync - GET /conversation/insights: return empty insights (200) on LLM error instead of 500 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 17:02:10 +00:00
Vadym Samoilenko	1b387daacf	Migrate task result delivery from WebSocket to HTTP polling Backend: - task_manager.py: add result/error/completed_at storage, TTL sweeper (5min), store_task_result() helper - tasks.py: add GET /<task_id> endpoint returning stored result; cancel route stores 'cancelled' status - __init__.py: start TTL sweeper on app startup - All 8 bg functions: store result before emitting lightweight WS hint (no payload data) Frontend: - src/lib/taskPolling.ts: waitForTaskResult() — polls GET /tasks/{id} every 2s, WS hint triggers immediate poll, 5min timeout - src/hooks/useTaskPolling.ts: drop-in replacement for useCancellableGeneration using polling - Migrate 6 Promise-based WS listeners → waitForTaskResult() in DiscussionPanel, FocusGroupSession (×2), PersonaProfile, PersonaModificationModal, useDiscussionGuideGeneration - Migrate 3 hook-based consumers → useTaskPolling in AIRecruiter, SyntheticUsers, BulkExportProgressModal Fixes WS Promise leak: polling survives disconnects, background tabs, page reloads. WS events retained as zero-payload hints for near-zero latency when connected. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 16:46:58 +00:00
Vadym Samoilenko	c7034634e3	Fix all async LLM routes: bypass GCP 30s load balancer timeout Convert 6 synchronous LLM routes to async 202+WebSocket pattern: - generate-response (focus_group_ai): persona chat response - generate-key-themes (focus_group_ai): discussion analysis - modify-with-ai (personas): AI persona modification - export-profile (personas): markdown profile export - describe-asset (focus_groups): image AI description Each route now returns 202 + task_id immediately, runs LLM in asyncio background task, delivers result via WebSocket task_completed event. Frontend listeners updated to wait for ws:task_completed instead of HTTP response body. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:56:54 +00:00
Vadym Samoilenko	f4a587c4f7	Fix 500: add current_app import to focus_groups route Missing import caused NameError when starting background discussion guide generation task. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:35:29 +00:00
Vadym Samoilenko	d8a5d6643f	Fix discussion guide 504: async flow + WebSocket delivery - Backend: /generate-discussion-guide now returns task_id immediately (202) and runs generation as a background asyncio task, delivering the guide via WebSocket task_completed event (bypasses GCP LB 30s timeout) - Frontend: useDiscussionGuideGeneration awaits ws:task_completed event to resolve the guide Promise instead of waiting on the HTTP response Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:32:08 +00:00
Vadym Samoilenko	6917518d11	Fix NameError: _fg_logger undefined in update_focus_group route _fg_logger was used but never defined, causing a NameError on every PUT /focus-groups/:id request that included llm_model (i.e. all autosave and handleSubmit updates) — resulting in a 500 Internal Server Error. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:26:08 +00:00
Vadym Samoilenko	f359157949	Fix focus group create: 500 on update + 400 on autosave - FocusGroup.update: use matched_count > 0 instead of modified_count > 0 so updates succeed even when data is unchanged (was returning 500) - useFocusGroupAutoSave: skip save if name is empty (not all-fields-empty) preventing 400 Bad Request when autosave fires before name is filled Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 15:20:40 +00:00
Vadym Samoilenko	f60d86e8cb	Fix task_completed WebSocket payload too large Don't send full persona objects in WS event — only send counts. Frontend navigates to list page where personas load from API. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 14:11:14 +00:00
Vadym Samoilenko	aa4090888d	Fix persona generation 504: async flow + remove debug logging - Backend: /generate-personas-full now returns task_id immediately (202) and runs generation as a background asyncio task, delivering results via WebSocket task_completed event (bypasses GCP LB 30s timeout) - Frontend: AIRecruiter listens for ws:task_completed to process personas instead of awaiting the long HTTP response - Remove 53 debug console.log calls from websocketServiceNew.ts including session_id exposure and a self-test emit that was firing fake events - Remove debug logs from WebSocketContextNew, AIRecruiter, personaGenerator Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 14:00:07 +00:00
Vadym Samoilenko	c00728f375	Fix Gemini LLM AssertionError: force httpx transport over aiohttp google-genai SDK uses aiohttp when it's available in the environment (installed via llama-index-core), causing AssertionError (connector is None) on async requests. Pass httpx_async_client in HttpOptions to bypass aiohttp. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 13:05:27 +00:00
Vadym Samoilenko	7f0df54de3	Fix domain typo: oliver.solution → oliver.solutions across all files Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 13:40:00 +00:00
Vadym Samoilenko	cf698e1e87	Remove create_default_user call from run.py (method removed in security remediation)	2026-03-20 13:34:26 +00:00
Vadym Samoilenko	05d7ea68e2	Fix make_serializable import — move to utils/__init__.py (was shadowed by utils/ package)	2026-03-20 13:33:11 +00:00
Vadym Samoilenko	d1788a4017	Fix .dockerignore — exclude *.txt but keep requirements.txt	2026-03-20 13:28:32 +00:00
Vadym Samoilenko	4a6b4d6fe0	Dockerize backend — replace systemd service with docker-compose - Add backend/Dockerfile (python:3.12-slim) - Add docker-compose.yml (backend :5137 + mongo:7) - Add backend/.dockerignore - Rewrite deploy.sh: build frontend locally, rsync dist/, docker compose up --build - Remove semblance.service (no longer needed) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 13:21:34 +00:00
Vadym Samoilenko	bb4dca0fe8	Update production URL to optical-dev.oliver.solution Replace ai-sandbox.oliver.solutions with optical-dev.oliver.solution across all config, env, docs, and source files. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 12:55:45 +00:00
Vadym Samoilenko	3e1865edbd	Apply Jintech security audit remediation (sprint 3) — 87/92 findings fixed - Fix missing await on FocusGroup.get_messages() (N-L1) - Replace time.sleep with asyncio.sleep in key_theme_service and focus_group_service (N-P10) - Replace flask import with quart in focus_groups.py (N-S3) - Add logger.error before all 500 returns in focus_groups.py (N-P6) - Add logging to silent except blocks across routes (N-M10, N-M11) - Add @rate_limit to 6 remaining AI endpoints (N-H4) - Add --confirm flag to populate scripts before delete_many (S-H2) - Remove hardcoded Azure ID fallbacks from msal_service.py and msalConfig.ts (A-M2, F-H4) - Centralize make_serializable() in utils.py, remove duplicates from 3 route files (N-P7) - Replace all datetime.utcnow() with datetime.now(timezone.utc) across entire backend (M-L2) - AuthContext.tsx: only mark token validated on 200 success, not on non-401 errors (F-H2) - Rename authType → auth_type in auth.py (N-S4) - Add security_report.md and security_report.pdf with full 92-finding status Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 12:51:18 +00:00
michael	c7ff1755ee	Add architecture document generator and PDF Create comprehensive technical architecture document (PDF) with 11 chapters covering system architecture, frontend/backend design, data model, auth, WebSocket communication, LLM integration, and core feature flows. Includes 11 Mermaid diagrams rendered as PNGs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 11:39:50 -06:00
michael	b1be8f8c38	Add model alias for legacy gpt-5 database entries Focus groups created before the gpt-5.2 rename have llm_model='gpt-5' stored in MongoDB. Without an alias, the backend falls through to the Gemini provider and fails with an aiohttp AssertionError. Adds MODEL_ALIASES mapping and _resolve_model() helper so gpt-5 is transparently resolved to gpt-5.2. Also updates all llm_model checks to accept both values. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 12:06:10 -06:00
michael	b0445de18b	Update GPT-5 to GPT-5.2 and lower default reasoning effort to low Swap model ID from gpt-5 to gpt-5.2 across all backend services, frontend components, and documentation. Change default reasoning effort from medium to low for faster responses. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 11:24:01 -06:00
michael	d4ce5d99bc	Fix audience_brief and research_objective dropped in Stage 2 persona generation Stage 2 (detailed persona generation) was ignoring the audience brief and research objective, causing the LLM to guess research context from demographics alone. Now passes both values through to generate_persona() in all three endpoints (generate-personas-full, complete-and-save-persona, complete-persona) and auto-generates prompt customization via customize_persona_prompt() when they are provided. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 08:40:17 -06:00
michael	850cb25067	Fix GPT-5 Responses API crash on reasoning items with None content Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 08:05:52 -06:00
michael	1708bd75a4	Remove SDK version logging on every Gemini call 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 14:30:42 -06:00
michael	7f4f659501	Bump google-genai to >=1.56.0 to fix aiohttp AssertionError Version 1.52.0 has a known bug where aiohttp connector is None. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 14:07:38 -06:00
michael	d79c202e8f	Fix NameError: use print instead of logger in get_gemini_client logger is not defined at module level where get_gemini_client() lives. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 13:59:30 -06:00
michael	c9848210bb	Add full traceback and SDK version logging for AssertionError debug This will help identify where exactly the AssertionError is occurring in the google-genai SDK and what version is installed on the server. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 13:49:47 -06:00
michael	5a825934f8	Add verbose exception debugging for empty error messages Log full exception details: type, module, str, repr, args, and __dict__ to diagnose why Gemini errors are producing empty messages. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 13:39:15 -06:00
michael	b50c0fa2a4	Fix empty error messages from Google GenAI SDK Catch genai_errors.APIError specifically and extract e.code and e.message attributes for proper error logging. The generic str(e) was returning empty strings for Google API errors, making debugging impossible. - Import google.genai.errors for specific exception handling - Add APIError catch before generic Exception in generate_content() - Add APIError catch before generic Exception in generate_contextual_response() - Properly categorize errors by HTTP code for retry logic (429/500+ retryable) - Fix time.sleep to await asyncio.sleep in contextual response handler 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 13:26:55 -06:00
michael	6ee80e67aa	Create fresh LLM clients per call instead of caching The previous event loop tracking approach still caused issues - when replacing a cached client, its garbage collection triggers aclose() which tries to close the aiohttp session on the wrong event loop. Simplest fix: create a fresh client for each call. The overhead is minimal compared to the actual LLM API call, and this completely avoids all event loop mismatch issues in ASGI environments. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 16:56:36 -06:00
michael	94f98b837b	Add event loop tracking to LLM client getters The previous lazy initialization fix wasn't sufficient - the genai.Client internally caches async structures bound to the event loop at creation time. With ASGI servers like Hypercorn, subsequent requests may come on different event loop contexts, causing "Future attached to a different loop" errors. Now tracks which event loop the client was created on and recreates it if the loop has changed. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 16:46:35 -06:00
michael	36e1752fea	Remove __pycache__ files from git tracking These files are already in .gitignore but were committed previously. Removing them from tracking to prevent future conflicts. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 13:28:19 -06:00
Michael Clervi	893b537b67	changed permissions	2025-12-19 19:26:16 +00:00
michael	45c7a52aeb	Fix asyncio event loop mismatch in LLM service The genai.Client and AsyncOpenAI clients were being created at module import time, before the Quart/Hypercorn event loop existed. This caused "Future attached to a different loop" errors when async calls were made, resulting in autonomous focus group conversations stopping with "excessive_silence". Changed to lazy initialization - clients are now created on first use within the running event loop context via get_gemini_client() and get_openai_client() helper functions. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2025-12-19 13:24:23 -06:00
michael	bb16165a83	Remove unused eventlet import and dead emit_with_ack function Legacy code from Flask-SocketIO migration that's no longer used. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-12-09 11:51:23 -06:00

1 2

78 commits