modcomms

Oliver/modcomms

Fork 0

Commit graph

Author	SHA1	Message	Date
Vadym Samoilenko	a6fc149788	Replace WebSocket with REST polling to fix GCP LB 30s timeout POST /api/analyze submits an analysis job and returns job_id instantly. GET /api/analyze/{job_id} returns progress + result; frontend polls every 2s. Analysis runs as asyncio.create_task in the background — each HTTP request completes in milliseconds, well within the 30s GCP Load Balancer limit. - Add backend/app/services/job_store.py: in-memory AnalysisJob store with 30-min TTL cleanup - Add backend/app/api/analysis_routes.py: POST + GET /api/analyze endpoints with full analysis pipeline (hash check, DB persistence, PDF pages, etc.) - Remove backend/app/websocket/: handlers.py, manager.py, __init__.py - Update backend/app/main.py: wire analysis_router, store analysis_service in app.state, drop all WebSocket imports and endpoint - Update frontend/services/geminiService.ts: replace WS with fetch+poll; function signatures unchanged so App.tsx / WIPReviewer.tsx need no edits - Remove VITE_BACKEND_WS_URL from vite.config.ts, deploy.sh, .env.deploy.example - Update cloudrun.yaml: remove WebSocket-specific session affinity annotation Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 15:26:01 +00:00
Vadym Samoilenko	5c338c31fb	Fix WebSocket connection dropped during long proof analysis - Add 25s heartbeat ping from backend to prevent Apache/proxy idle-timeout killing the connection during 1-3 min analysis runs - Handle heartbeat silently in both analyzeProof and analyzeWIPProof frontend handlers - Run PDF rasterization via asyncio.to_thread so heartbeats aren't blocked - Wrap analyze_proof with asyncio.wait_for(timeout=300) for a hard 5-min cap - Log dropped send_message calls in ConnectionManager instead of swallowing silently - cloudrun.yaml: add sessionAffinity, startup probe, raise containerConcurrency 4→10, document DISABLE_AUTH option Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 11:23:59 +00:00
Vadym Samoilenko	82e38e8853	Add gemini-3-flash-preview fallback and Cloud Run service config gemini_service.py: if the primary model (gemini-3.1-pro-preview) is unavailable or returns a permission error, all three call sites now automatically retry with gemini-3-flash-preview before propagating failure. cloudrun.yaml: new Cloud Run service definition that ensures stable WebSocket operation — 10-minute request timeout (vs 60s default), 2 vCPU / 4Gi RAM for PDF rasterisation, min 1 warm instance to prevent cold-start disconnects, and GEMINI_API_KEY sourced from Secret Manager so the service can actually reach the Gemini API. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-02 11:18:57 +00:00

Author

SHA1

Message

Date

Vadym Samoilenko

a6fc149788

Replace WebSocket with REST polling to fix GCP LB 30s timeout

POST /api/analyze submits an analysis job and returns job_id instantly.
GET /api/analyze/{job_id} returns progress + result; frontend polls every 2s.

Analysis runs as asyncio.create_task in the background — each HTTP request
completes in milliseconds, well within the 30s GCP Load Balancer limit.

- Add backend/app/services/job_store.py: in-memory AnalysisJob store with
  30-min TTL cleanup
- Add backend/app/api/analysis_routes.py: POST + GET /api/analyze endpoints
  with full analysis pipeline (hash check, DB persistence, PDF pages, etc.)
- Remove backend/app/websocket/: handlers.py, manager.py, __init__.py
- Update backend/app/main.py: wire analysis_router, store analysis_service
  in app.state, drop all WebSocket imports and endpoint
- Update frontend/services/geminiService.ts: replace WS with fetch+poll;
  function signatures unchanged so App.tsx / WIPReviewer.tsx need no edits
- Remove VITE_BACKEND_WS_URL from vite.config.ts, deploy.sh, .env.deploy.example
- Update cloudrun.yaml: remove WebSocket-specific session affinity annotation

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-03-18 15:26:01 +00:00

Vadym Samoilenko

5c338c31fb

Fix WebSocket connection dropped during long proof analysis

- Add 25s heartbeat ping from backend to prevent Apache/proxy idle-timeout
  killing the connection during 1-3 min analysis runs
- Handle heartbeat silently in both analyzeProof and analyzeWIPProof frontend handlers
- Run PDF rasterization via asyncio.to_thread so heartbeats aren't blocked
- Wrap analyze_proof with asyncio.wait_for(timeout=300) for a hard 5-min cap
- Log dropped send_message calls in ConnectionManager instead of swallowing silently
- cloudrun.yaml: add sessionAffinity, startup probe, raise containerConcurrency 4→10,
  document DISABLE_AUTH option

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-03-18 11:23:59 +00:00

Vadym Samoilenko

82e38e8853

Add gemini-3-flash-preview fallback and Cloud Run service config

gemini_service.py: if the primary model (gemini-3.1-pro-preview) is
unavailable or returns a permission error, all three call sites now
automatically retry with gemini-3-flash-preview before propagating failure.

cloudrun.yaml: new Cloud Run service definition that ensures stable
WebSocket operation — 10-minute request timeout (vs 60s default),
2 vCPU / 4Gi RAM for PDF rasterisation, min 1 warm instance to prevent
cold-start disconnects, and GEMINI_API_KEY sourced from Secret Manager
so the service can actually reach the Gemini API.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-03-02 11:18:57 +00:00

3 commits