Problem: Complex client Excel files (30+ columns, merged cells, Q&A columns,
tier data) produced zero assets because the extraction was a dumb pipe dump
that lost all column context.
Fix:
- Smart Excel extraction: detects header rows, labels each value with its
column name, skips empty sheets, handles merged cells. Claude now sees
"Top 10 deliverables: Toolbox presentation deck | Tier A: Yes | 1"
instead of "Toolbox | Base | Toolbox presentation deck | ü' | Yes | 1"
- Two extraction modes on Upload tab:
- Normal: fast single-pass extraction (~$0.05)
- Deep Extraction: two-pass AI analysis (~$0.15-0.30)
Pass 1: Claude analyzes the spreadsheet structure
Pass 2: Claude extracts assets using the structural understanding
- Upload endpoint accepts ?mode=normal|deep query parameter
- Background parse shows "Deep extraction: analyzing structure (Pass 1 of 2)"
- Tested against both Wella files - header-aware extraction produces
clear labelled output
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
||
|---|---|---|
| .. | ||
| __init__.py | ||
| deps.py | ||
| efficiency.py | ||
| gmal.py | ||
| ingest.py | ||
| matching.py | ||
| projects.py | ||
| ratecard.py | ||