You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Comprehensive PDF parser focused on metadata-rich, layout-aware extraction. Combines PyMuPDF/pdfplumber text analysis, Camelot/Tabula tables, image and formula capture, plus column detection to preserve reading order. Ships with TOON export + token comparisons, CLI examples, and utilities for visual debug + dataset generation.