OmniDocs: Your Complete Toolkit for Intelligent Document Understanding
OmniDocs is a unified Python framework for intelligent document understanding. Extract text, tables, layout, and more from PDFs and images using state-of-the-art modelsโall with a single, powerful API.
โจ Key Features
- Unified API: One simple interface for every document AI task.
- Layout Detection: YOLO, Surya, PaddleOCR, and more.
- OCR Extraction: PaddleOCR, Tesseract, EasyOCR, Surya OCR.
- Text Extraction: PyPDF2, PyMuPDF, pdfplumber, docling_parse, pdftext, Surya
- Math Expression Extraction:
- Donut: NAVER CLOVA Donut model for math/LaTeX extraction.
- Nougat: Facebook's Nougat model for LaTeX from academic documents.
- Surya Math: Surya-based mathematical expression extraction.
- UniMERNet: Universal Mathematical Expression Recognition Network.
- Table Extraction: Camelot, Tabula, PDFPlumber, Table Transformer, TableFormer, Surya Table
- Reading Order & Structure: Advanced document parsing that just works.
- Multilingual: Supports 90+ languages out of the box.
- Extensible: Easily plug in your own models and build custom workflows.
โก Installation
Install OmniDocs from PyPI with a single command:
```bash pip install omnidocs ````
For full setup (GPU, conda
, poetry
, etc.), check the Installation Guide.
๐ Quick Start
See the Quick Start Guide for a comprehensive introduction to using OmniDocs.
๐ The Arsenal: All Supported Backends
Layout Analysis Models (Click to Expand)
- DocLayout YOLO - PPStructure (Paddle OCR) - RT DETR (Docling) - Florence-2-DocLayNet - Surya LayoutText Extraction Libraries (Click to Expand)
- PyPDF2 - PyMuPDF - pdfplumber - docling_parse - pdftext - surya_textOCR Models (Click to Expand)
- Paddle OCR - Tesseract - EasyOCR - Surya OCRMath Expression Extraction Models (Click to Expand)
- Donut - Nougat - Surya Math - UniMERNetTable Extraction Models (Click to Expand)
- PPStructure (Paddle OCR) - Camelot - Tabula - PDFPlumber - Table Transformer - TableFormer - Surya Table๐บ๏ธ Learn More
Tutorials: Hands-on notebooks and guides for every task. API Reference: The full dictionary of all public methods and classes.
๐ค Contributing
Contributions are welcome! If you want to help make OmniDocs even better, see our CONTRIBUTING.md guide.
๐ก๏ธ License
The OmniDocs framework is MIT licensed. The underlying models and libraries may have their own licensesโplease verify before use in production.
๐ Support the Project
If you find OmniDocs helpful, please โญ the repo on GitHub!
๐จ๏ธ Join the Community
Issues: Report bugs or suggest features here Email: adithyaskolavi@gmail.com or laxmansrivastacc@gmail.com