Durable Extraction
Fault Tolerant
Workflows survive crashes and restarts with automatic retries.
Multi-Media
Documents, images, audio, and video with smart routing.
Any LLM
OpenAI, Anthropic, Google. One model string, zero config.
Type Safe
Pydantic schemas ensure validated, typed output every time.
How it works
One flag for durability
Add durable=True and Open Xtract provisions Temporal via Docker automatically.
1
Docker Auto-Start
Temporal and PostgreSQL spin up when needed.
2
Workflow Execution
Extractions run as durable workflows with checkpointing.
3
Retry & Recovery
Failures retry automatically. State persists across restarts.
Terminal
workflow.py
from pydantic import BaseModel
from open_xtract import extract, stop_temporal
class Report(BaseModel):
title: str
findings: list[str]
severity: int
result = extract(
schema=Report,
model="openai:gpt-5.2",
url="https://example.com/report.pdf",
instructions="Extract findings",
durable=True,
temporal_ui=True,
)
stop_temporal()
Works with any media
Documents
Images
Audio
Video
PDF, DOCX, PNG, JPG, MP3, MP4, and 20+ formats