📓 Blog

Engineering, pricing, and product notes.

What changed under the hood, why it matters for your bills, and how to use the new bits.

Release Notes 2026-05-12 · 6 min read

May 2026 — 23 shipments, zero breaking changes, 30–90% cache savings

Upstream prompt-cache pass-through, semantic cache, smart router, batch API, content moderation, PII redaction, vision token math fix, 7 new models behind 3 new providers, prompt templates, fine-tune lifecycle, reserved throughput, and Prometheus/Grafana observability — all additive. Existing OpenAI SDK code keeps working without a single line changed.

Read post →