Problem: Complex client Excel files (30+ columns, merged cells, Q&A columns,
tier data) produced zero assets because the extraction was a dumb pipe dump
that lost all column context.
Fix:
- Smart Excel extraction: detects header rows, labels each value with its
column name, skips empty sheets, handles merged cells. Claude now sees
"Top 10 deliverables: Toolbox presentation deck | Tier A: Yes | 1"
instead of "Toolbox | Base | Toolbox presentation deck | ü' | Yes | 1"
- Two extraction modes on Upload tab:
- Normal: fast single-pass extraction (~$0.05)
- Deep Extraction: two-pass AI analysis (~$0.15-0.30)
Pass 1: Claude analyzes the spreadsheet structure
Pass 2: Claude extracts assets using the structural understanding
- Upload endpoint accepts ?mode=normal|deep query parameter
- Background parse shows "Deep extraction: analyzing structure (Pass 1 of 2)"
- Tested against both Wella files - header-aware extraction produces
clear labelled output
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
||
|---|---|---|
| .. | ||
| __init__.py | ||
| ai_descriptions.py | ||
| ai_matching.py | ||
| doc_parser.py | ||
| excel_parser.py | ||
| export_excel.py | ||
| export_pdf.py | ||
| match_refiner.py | ||
| ratecard_builder.py | ||
| rfp_analysis.py | ||
| team_shape.py | ||
| tier_expander.py | ||