VLLM¶
VLLM backend configuration for Dots OCR.
DotsOCRVLLMConfig
¶
Bases: BaseModel
VLLM backend configuration for Dots OCR.
VLLM provides high-throughput inference with optimizations like: - PagedAttention for efficient KV cache management - Continuous batching for higher throughput - Optimized CUDA kernels