Skip to content

OmniDocs

PyTorch

adithya-s-k/OmniDocs

OmniDocs

adithya-s-k/OmniDocs

Home
Home
Usage
Usage
- Text Extraction
  Text Extraction
  - MinerU VL
  - Qwen
  - DotsOCR
  - Nanonets OCR2
  - DeepSeek OCR
- Layout Analysis
  Layout Analysis
- Structured Extraction
- OCR
  OCR
- Table Extraction
  Table Extraction
  - TableFormer
- Reading Order
- Batch Processing
- Model Cache
- Deployment
API Reference
API Reference
- Batch
- Cache
- Document
- Tasks
  Tasks
  - Overview
  - Layout Extraction
    Layout Extraction
    
    Overview
    
    Base
    
    Doc Layout YOLO
    
    Mineruvl
    Mineruvl
    
    Overview
    
    API
    
    Detector
    
    MLX
    
    PyTorch
    
    VLLM
    
    Models
    
    Qwen
    Qwen
    
    Overview
    
    API
    
    Detector
    
    MLX
    
    PyTorch
    
    VLLM
    
    Rtdetr
    
    Vlm
  - OCR Extraction
    OCR Extraction
    
    Overview
    
    Base
    
    EasyOCR
    
    Models
    
    PaddleOCR
    
    Tesseract
  - Reading Order
    Reading Order
    
    Overview
    
    Base
    
    Models
    
    Rule Based
    Rule Based
    
    Overview
    
    Predictor
  - Structured Extraction
    Structured Extraction
    
    Overview
    
    Base
    
    Models
    
    Vlm
  - Table Extraction
    Table Extraction
    
    Overview
    
    Base
    
    Models
    
    Tableformer
    Tableformer
    
    Overview
    
    Config
    
    PyTorch
  - Text Extraction
    Text Extraction
    
    Overview
    
    Base
    
    Deepseek
    Deepseek
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Dots OCR
    Dots OCR
    
    Overview
    
    API
    
    Extractor
    
    PyTorch PyTorch
    Table of contents
    
    pytorch
    
    DotsOCRPyTorchConfig
    
    VLLM
    
    Glmocr
    Glmocr
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Granitedocling
    Granitedocling
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Lighton
    Lighton
    
    Overview
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Mineruvl
    Mineruvl
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    Utils
    
    VLLM
    
    Models
    
    Nanonets
    Nanonets
    
    Overview
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Qwen
    Qwen
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Vlm
- Utils
  Utils
- Vlm
  Vlm
  - Overview
  - Client
  - Config
Contributing
Contributing
Roadmap

PyTorch¶

PyTorch backend configuration for Dots OCR.

DotsOCRPyTorchConfig ¶

Bases: BaseModel

PyTorch/HuggingFace backend configuration for Dots OCR.

Dots OCR provides layout-aware text extraction with 11 predefined layout categories (Caption, Footnote, Formula, List-item, Page-footer, Page-header, Picture, Section-header, Table, Text, Title).

Example

from omnidocs.tasks.text_extraction import DotsOCRTextExtractor
from omnidocs.tasks.text_extraction.dotsocr import DotsOCRPyTorchConfig

config = DotsOCRPyTorchConfig(
        model="rednote-hilab/dots.ocr",
        device="cuda",
        torch_dtype="bfloat16",
    )
extractor = DotsOCRTextExtractor(backend=config)