step-5: add adaptive chunker

chunker.py splits any revised Markdown (step-4) into RAG-ready chunks.
Supports 4 strategies driven by structure_profile.json: h3_aware,
h2_paragraph_split, paragraph, sliding_window. Respects MIN/MAX_CHARS
and sentence-level overlap. Updates .gitignore and README paths.
This commit is contained in:
2026-04-13 13:36:53 +02:00
parent 1631dff80d
commit 5126e0d971
3 changed files with 468 additions and 8 deletions
+3
View File
@@ -37,3 +37,6 @@ step-3/*/
step-4/*/
step-4/revision_log.md
# Output step-5 — chunk generati da chunker.py
step-5/*/