Files
rag-from-scratch/requirements.txt
T
davide 346e336f1a step-2: add convert_pdf.py (pymupdf4llm, low-memory)
Converts PDFs in sources/ to Markdown using pymupdf4llm (pure C,
~30-50 MB RAM, no ML models). Output: step-2/<stem>/raw.md + clean.md.
2026-04-13 10:01:03 +02:00

6 lines
114 B
Plaintext

# Step 0-1 — Ispezione e verifica PDF
pdfplumber==0.11.9
# Step 2 — Conversione PDF → Markdown
pymupdf4llm