Troubleshooting
Common Issues
PDFs Not Chunking: Ensure PyMuPDF is installed and PDFs aren’t encrypted. Use –dry-run to debug.
Semantic Chunking Fails: Syntax errors cause fallback to single chunks. Fix code or disable –semantic-chunks.
Chunks Too Large: –context-window is a target; use –max-chunk-size for strict limits.
Chunking ignore not happening: Check ignore patterns and use –dry-run to verify. If in doubt, use the ** wildcard before and after the folder name.
FAQ
Q: How do I process only specific file types? A: Use –file-type, e.g., –file-type py.
Q: Can I customize ignore patterns? A: Yes, with –ignore and –unignore.
Q: Why isn’t my Python file split semantically? A: Check for syntax errors and ensure –semantic-chunks is on.