Skip to content

OmniDocs

VLLM

adithya-s-k/OmniDocs

OmniDocs

adithya-s-k/OmniDocs

Home
Home
Usage
Usage
- Text Extraction
  Text Extraction
  - MinerU VL
  - Qwen
  - DotsOCR
  - Nanonets OCR2
- Layout Analysis
  Layout Analysis
- Structured Extraction
- OCR
  OCR
- Table Extraction
  Table Extraction
  - TableFormer
- Reading Order
- Batch Processing
- Model Cache
- Deployment
API Reference
API Reference
- Batch
- Cache
- Document
- Tasks
  Tasks
  - Overview
  - Layout Extraction
    Layout Extraction
    
    Overview
    
    Base
    
    Doc Layout YOLO
    
    Mineruvl
    Mineruvl
    
    Overview
    
    API
    
    Detector
    
    MLX
    
    PyTorch
    
    VLLM
    
    Models
    
    Qwen
    Qwen
    
    Overview
    
    API
    
    Detector
    
    MLX
    
    PyTorch
    
    VLLM VLLM
    Table of contents
    
    vllm
    
    QwenLayoutVLLMConfig
    
    Rtdetr
    
    Vlm
  - OCR Extraction
    OCR Extraction
    
    Overview
    
    Base
    
    EasyOCR
    
    Models
    
    PaddleOCR
    
    Tesseract
  - Reading Order
    Reading Order
    
    Overview
    
    Base
    
    Models
    
    Rule Based
    Rule Based
    
    Overview
    
    Predictor
  - Structured Extraction
    Structured Extraction
    
    Overview
    
    Base
    
    Models
    
    Vlm
  - Table Extraction
    Table Extraction
    
    Overview
    
    Base
    
    Models
    
    Tableformer
    Tableformer
    
    Overview
    
    Config
    
    PyTorch
  - Text Extraction
    Text Extraction
    
    Overview
    
    Base
    
    Dots OCR
    Dots OCR
    
    Overview
    
    API
    
    Extractor
    
    PyTorch
    
    VLLM
    
    Granitedocling
    Granitedocling
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Mineruvl
    Mineruvl
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    Utils
    
    VLLM
    
    Models
    
    Nanonets
    Nanonets
    
    Overview
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Qwen
    Qwen
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Vlm
- Utils
  Utils
- Vlm
  Vlm
  - Overview
  - Client
  - Config
Contributing
Contributing
Roadmap

VLLM¶

VLLM backend configuration for Qwen3-VL layout detection.

QwenLayoutVLLMConfig ¶

Bases: BaseModel

VLLM backend configuration for Qwen layout detection.

This backend uses VLLM for high-throughput inference. Best for batch processing and production deployments. Requires: vllm, torch, transformers, qwen-vl-utils

Example

config = QwenLayoutVLLMConfig(
        model="Qwen/Qwen3-VL-8B-Instruct",
        tensor_parallel_size=2,
        gpu_memory_utilization=0.9,
    )