- Optimize the prompt. Give it a dataset of inputs and correct outputs for one task. It runs prompt optimization (DSPy GEPA) and reports the score change on held-out rows.
- Replace the model. Once you have enough verified data, train a small model to take over from the frontier model. The switch is gated on evals, with instant rollback. This second feature is still being built; pages that describe it are marked Building.
Quickstart
Go from a CSV to an optimized prompt in under ten lines.
JSON extraction
A full run for the first task class, end to end.
Capture from LangChain
Log your production calls with one callback, no code changes.
Python SDK reference
Every method, its parameters, and what it returns.