Fix PyMuPDF: replace get_text("markdown") with get_text("text")

"markdown" format requires PyMuPDF >= 1.24.0, not available on server.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
Vadym Samoilenko 2026-03-20 17:59:22 +00:00
parent 8670822a59
commit 6157fcbc4e

View file

@ -82,7 +82,7 @@ def _parse_pdf_with_pymupdf(file_path: str) -> str:
parts = []
with fitz.open(file_path) as doc:
for page_num, page in enumerate(doc, 1):
text = page.get_text("markdown").strip()
text = page.get_text("text").strip()
if text:
parts.append(f"## Page {page_num}\n\n{text}")