- Added Flask web interface for batch metadata processing - Added Excel-based metadata lookup (Celum ID mapping) - Dual-sheet support: DSB (primary) and Medsurg (fallback) - Unicode/hieroglyph support for CGA region (Chinese, Japanese, Korean) - Multi-format support: PDF, images, Office docs, video - OCR with multi-language support (Tesseract) - Filename matching without extension (case-insensitive) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
1 line
51 B
Python
1 line
51 B
Python
"""Content extractors for different file types."""
|