PDF to clean Markdown converter that preserves headings, tables and links, with a plugin hook for OCR (Claude PDF-to-MD OCR). Published on PyPI.
Turning real-world PDFs into clean, structured Markdown that keeps headings, tables and links intact, while staying easy to extend for scanned documents.
Built a Python package with a plugin architecture so OCR can be layered in via Claude PDF-to-MD OCR. Packaged and published on PyPI for reuse.
Let's discuss how we can help you achieve similar results