Skip to content

OmniDocs: Your Complete Toolkit for Intelligent Document Understanding

OmniDocs Banner

PyPI version Build Status License

OmniDocs is a unified Python framework for intelligent document understanding. Extract text, tables, layout, and more from PDFs and images using state-of-the-art modelsโ€”all with a single, powerful API.


โœจ Key Features

  • Unified API: One simple interface for every document AI task.
  • Layout Detection: YOLO, Surya, PaddleOCR, and more.
  • OCR Extraction: PaddleOCR, Tesseract, EasyOCR, Surya OCR.
  • Text Extraction: PyPDF2, PyMuPDF, pdfplumber, docling_parse, pdftext, Surya
  • Math Expression Extraction:
  • Donut: NAVER CLOVA Donut model for math/LaTeX extraction.
  • Nougat: Facebook's Nougat model for LaTeX from academic documents.
  • Surya Math: Surya-based mathematical expression extraction.
  • UniMERNet: Universal Mathematical Expression Recognition Network.
  • Table Extraction: Camelot, Tabula, PDFPlumber, Table Transformer, TableFormer, Surya Table
  • Reading Order & Structure: Advanced document parsing that just works.
  • Multilingual: Supports 90+ languages out of the box.
  • Extensible: Easily plug in your own models and build custom workflows.

โšก Installation

Install OmniDocs from PyPI with a single command:

```bash pip install omnidocs ````

For full setup (GPU, conda, poetry, etc.), check the Installation Guide.


๐Ÿš€ Quick Start

See the Quick Start Guide for a comprehensive introduction to using OmniDocs.


๐Ÿ“š The Arsenal: All Supported Backends

Layout Analysis Models (Click to Expand) - DocLayout YOLO - PPStructure (Paddle OCR) - RT DETR (Docling) - Florence-2-DocLayNet - Surya Layout
Text Extraction Libraries (Click to Expand) - PyPDF2 - PyMuPDF - pdfplumber - docling_parse - pdftext - surya_text
OCR Models (Click to Expand) - Paddle OCR - Tesseract - EasyOCR - Surya OCR
Math Expression Extraction Models (Click to Expand) - Donut - Nougat - Surya Math - UniMERNet
Table Extraction Models (Click to Expand) - PPStructure (Paddle OCR) - Camelot - Tabula - PDFPlumber - Table Transformer - TableFormer - Surya Table

๐Ÿ—บ๏ธ Learn More

Tutorials: Hands-on notebooks and guides for every task. API Reference: The full dictionary of all public methods and classes.

๐Ÿค Contributing

Contributions are welcome! If you want to help make OmniDocs even better, see our CONTRIBUTING.md guide.


๐Ÿ›ก๏ธ License

The OmniDocs framework is MIT licensed. The underlying models and libraries may have their own licensesโ€”please verify before use in production.


๐ŸŒŸ Support the Project

If you find OmniDocs helpful, please โญ the repo on GitHub!


๐Ÿ—จ๏ธ Join the Community

Issues: Report bugs or suggest features here Email: adithyaskolavi@gmail.com or laxmansrivastacc@gmail.com