OX
v0.2.0 | Durable Execution

Durable Extraction

Extractions that survive failures.

Extract structured data from documents, images, audio, and video with fault-tolerant workflows powered by Temporal.

View Source
Terminal extract.py
from open_xtract import extract

result = extract(
    schema=Invoice,
    model="openai:gpt-5.2",
    url="https://example.com/doc.pdf",
    instructions="Extract invoice data",
    durable=True,
)
PostgreSQL Temporal Docker Pydantic

Fault Tolerant

Workflows survive crashes and restarts with automatic retries.

Multi-Media

Documents, images, audio, and video with smart routing.

Any LLM

OpenAI, Anthropic, Google. One model string, zero config.

Type Safe

Pydantic schemas ensure validated, typed output every time.

How it works

One flag for durability

Add durable=True and Open Xtract provisions Temporal via Docker automatically.

1

Docker Auto-Start

Temporal and PostgreSQL spin up when needed.

2

Workflow Execution

Extractions run as durable workflows with checkpointing.

3

Retry & Recovery

Failures retry automatically. State persists across restarts.

Terminal workflow.py
from pydantic import BaseModel
from open_xtract import extract, stop_temporal

class Report(BaseModel):
    title: str
    findings: list[str]
    severity: int

result = extract(
    schema=Report,
    model="openai:gpt-5.2",
    url="https://example.com/report.pdf",
    instructions="Extract findings",
    durable=True,
    temporal_ui=True,
)

stop_temporal()

Works with any media

Documents Images Audio Video

PDF, DOCX, PNG, JPG, MP3, MP4, and 20+ formats

Get started

Extract structured data with fault-tolerant workflows.

View Source