hm_ai_qc_report_tool

Oliver/hm_ai_qc_report_tool

Fork 0

Commit graph

Author	SHA1	Message	Date
nickviljoen	6341714899	Split input/output token tracking; refresh provider pricing table UsageLog now records input_tokens and output_tokens separately and costs each side at its real rate. The old single 'blended' rate underpriced input-heavy workloads (vision/QC) and overpriced output-heavy ones. COST_PER_MILLION_TOKENS rebuilt against the live OpenAI, Gemini and Anthropic pricing pages (GPT-5.4 family, GPT-4.x, o4-mini; Gemini 2.5 Pro/Flash/Flash-Lite + 1.5 legacy; Claude 4.7/4.6/4.5 + 3.x legacy). Unknown models now warn instead of silently defaulting to $5/1M. Adds idempotent ALTER TABLE migration on startup so existing SQLite DBs pick up the new columns. Dashboard + API surface the input/output split. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-22 14:40:13 +02:00
nickviljoen	e910e00edf	Add Usage Dashboard with token tracking, cost estimates, and filters - New UsageLog model tracking every LLM API call (provider, model, tokens, estimated cost, user, module, check name) - Instrument LLMConfig.call_vision_api() to auto-log each call - New /usage tab in nav bar with dashboard showing: - Summary cards (total calls, tokens, estimated cost) - Breakdowns by provider, model, tool, and user - Recent API calls table - Time filters (All Time, 30 Days, 7 Days, Today) - Cost estimates based on per-model token pricing - Pass logged-in user through executor context for tracking Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-21 18:17:21 +02:00

Author

SHA1

Message

Date

nickviljoen

6341714899

Split input/output token tracking; refresh provider pricing table

UsageLog now records input_tokens and output_tokens separately and costs
each side at its real rate. The old single 'blended' rate underpriced
input-heavy workloads (vision/QC) and overpriced output-heavy ones.
COST_PER_MILLION_TOKENS rebuilt against the live OpenAI, Gemini and
Anthropic pricing pages (GPT-5.4 family, GPT-4.x, o4-mini; Gemini 2.5
Pro/Flash/Flash-Lite + 1.5 legacy; Claude 4.7/4.6/4.5 + 3.x legacy).
Unknown models now warn instead of silently defaulting to $5/1M.

Adds idempotent ALTER TABLE migration on startup so existing SQLite DBs
pick up the new columns. Dashboard + API surface the input/output split.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-22 14:40:13 +02:00

nickviljoen

e910e00edf

Add Usage Dashboard with token tracking, cost estimates, and filters

- New UsageLog model tracking every LLM API call (provider, model,
  tokens, estimated cost, user, module, check name)
- Instrument LLMConfig.call_vision_api() to auto-log each call
- New /usage tab in nav bar with dashboard showing:
  - Summary cards (total calls, tokens, estimated cost)
  - Breakdowns by provider, model, tool, and user
  - Recent API calls table
  - Time filters (All Time, 30 Days, 7 Days, Today)
- Cost estimates based on per-model token pricing
- Pass logged-in user through executor context for tracking

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-21 18:17:21 +02:00

2 commits