Skip to content

OmniDocs

MLX

adithya-s-k/OmniDocs

OmniDocs

adithya-s-k/OmniDocs

Home
Home
Usage
Usage
- Text Extraction
  Text Extraction
  - MinerU VL
  - Qwen
  - DotsOCR
  - Nanonets OCR2
  - DeepSeek OCR
- Layout Analysis
  Layout Analysis
- Structured Extraction
- OCR
  OCR
- Table Extraction
  Table Extraction
  - TableFormer
- Reading Order
- Batch Processing
- Model Cache
- Deployment
API Reference
API Reference
- Batch
- Cache
- Document
- Tasks
  Tasks
  - Overview
  - Layout Extraction
    Layout Extraction
    
    Overview
    
    Base
    
    Doc Layout YOLO
    
    Mineruvl
    Mineruvl
    
    Overview
    
    API
    
    Detector
    
    MLX
    
    PyTorch
    
    VLLM
    
    Models
    
    Qwen
    Qwen
    
    Overview
    
    API
    
    Detector
    
    MLX
    
    PyTorch
    
    VLLM
    
    Rtdetr
    
    Vlm
  - OCR Extraction
    OCR Extraction
    
    Overview
    
    Base
    
    EasyOCR
    
    Models
    
    PaddleOCR
    
    Tesseract
  - Reading Order
    Reading Order
    
    Overview
    
    Base
    
    Models
    
    Rule Based
    Rule Based
    
    Overview
    
    Predictor
  - Structured Extraction
    Structured Extraction
    
    Overview
    
    Base
    
    Models
    
    Vlm
  - Table Extraction
    Table Extraction
    
    Overview
    
    Base
    
    Models
    
    Tableformer
    Tableformer
    
    Overview
    
    Config
    
    PyTorch
  - Text Extraction
    Text Extraction
    
    Overview
    
    Base
    
    Deepseek
    Deepseek
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Dots OCR
    Dots OCR
    
    Overview
    
    API
    
    Extractor
    
    PyTorch
    
    VLLM
    
    Glmocr
    Glmocr
    
    Overview
    
    API
    
    Extractor
    
    MLX MLX
    Table of contents
    
    mlx
    
    GLMOCRMLXConfig
    
    PyTorch
    
    VLLM
    
    Granitedocling
    Granitedocling
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Lighton
    Lighton
    
    Overview
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Mineruvl
    Mineruvl
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    Utils
    
    VLLM
    
    Models
    
    Nanonets
    Nanonets
    
    Overview
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Qwen
    Qwen
    
    Overview
    
    API
    
    Extractor
    
    MLX
    
    PyTorch
    
    VLLM
    
    Vlm
- Utils
  Utils
- Vlm
  Vlm
  - Overview
  - Client
  - Config
Contributing
Contributing
Roadmap

MLX¶

MLX backend configuration for GLM-OCR text extraction.

GLMOCRMLXConfig ¶

Bases: BaseModel

MLX backend configuration for GLM-OCR.

Uses mlx-vlm for Apple Silicon native inference.
GLM-OCR at 0.9B runs comfortably on any M-series Mac with 8GB+ unified memory.
Requires: mlx, mlx-vlm>=0.3.11

Note: Only works on Apple Silicon Macs. Do NOT use for Modal/cloud deployments.

Available models:
    mlx-community/GLM-OCR-bf16   (default — full precision, 2.21 GB)
    mlx-community/GLM-OCR-6bit   (quantized, smaller)

Example:

python config = GLMOCRMLXConfig() # bf16, default config = GLMOCRMLXConfig(model="mlx-community/GLM-OCR-6bit") # quantized