OPEN‑XTRACT

TURN DOCUMENTS INTO
STRUCTURED DATA

Open‑source toolkit for extracting clean, structured data from PDFs, images, and text.
Auto-routing with minimal setup.

INSTALL
uv add open-xtract

FEATURES

AUTO-ROUTING

Single method detects PDFs, images, URLs, or raw text automatically

MULTI-PROVIDER

Supports Claude, GPT, Gemini with automatic provider detection

STRUCTURED OUTPUT

Pydantic schemas ensure clean, typed data extraction

EXAMPLES

PYTHON