Skip to content

PyTorch

PyTorch/HuggingFace backend configuration for LightOn text extraction.

LightOnTextPyTorchConfig

Bases: BaseModel

PyTorch/HuggingFace backend config for LightOn text extraction.

Uses HuggingFace Transformers with LightOnOcrForConditionalGeneration.

Example
from omnidocs.tasks.text_extraction import LightOnTextExtractor
from omnidocs.tasks.text_extraction.lighton import LightOnTextPyTorchConfig

extractor = LightOnTextExtractor(
    backend=LightOnTextPyTorchConfig(device="cuda", torch_dtype="bfloat16")
)
result = extractor.extract(image)