Skip to content

OpenAI Model Catalog

Werker Health proxies every generally available OpenAI model. Use the exact model string—no prefixing required—after pointing the SDK at https://api.werker.health/v1.

✅ Newly announced models (including GPT-5 variants) become available on Werker Health within minutes of OpenAI launch. See the OpenAI Models reference for the authoritative list.

ModelRecommended Use
gpt-5Frontier flagship for complex coding, agentic reasoning, and cross-domain problem solving.
gpt-5-miniCost-efficient GPT-5 tier for high-volume tasks with well-defined outputs.
gpt-5-nanoFastest GPT-5 deployment for latency-sensitive automations.
gpt-5-proPremium variant tuned for more precise responses and deeper reasoning.

GPT-4 Series

ModelRecommended Use
gpt-4.1Production-grade reasoning with strong multimodal performance.
gpt-4.1-miniBalanced quality and price for copilots and summarization.
gpt-4oMultimodal flagship optimized for cost, latency, and speech.
gpt-4o-miniScaled deployments needing strong quality under tight latency budgets.
gpt-4o-realtime-previewAudio-first experiences and real-time assistants.

o-Series (High Planning)

ModelRecommended Use
o4-miniLong-context, tool-heavy agents benefiting from explicit reasoning traces.
o3-miniLightweight planning, evaluation, and oversight workflows.

Specialized & Legacy Models

ModelRecommended Use
gpt-4o-audio-previewSpeech-centric assistants that transcribe, reason, and respond vocally.
gpt-4o-mini-transcribeMedical and claims transcription with high accuracy.
gpt-4o-codeCode generation, reviews, and refactoring tasks.
gpt-3.5-turboLegacy workloads not yet migrated to GPT-4o/5.
text-embedding-3-large / text-embedding-3-smallSemantic search, retrieval, and analytics pipelines.

Managing Model Access

  • Configure per-environment allowlists to control which model families can receive PHI.
  • Use Werker’s billing dashboards to monitor cost per model and set budgets.
  • Combine metadata fields with Werker audit logs to track which application feature invoked each model.

Rolling Updates

  • Automatic availability: Newly released OpenAI models appear without app changes.
  • Version pinning: Set explicit model strings in production to avoid surprises; use staging to evaluate new versions.
  • Testing new models: Clone workloads in Werker’s sandbox environment and compare output quality before promoting to production.

Sample Usage

python
from openai import OpenAI

client = OpenAI(
    base_url="https://api.werker.health/v1",
    api_key="WERKER_HEALTH_API_KEY",
)

plan = client.responses.create(
    model="gpt-5",
    input=[
        {"role": "system", "content": "You generate discharge plans for clinicians."},
        {"role": "user", "content": "Create discharge instructions for a mild asthma flare."},
    ],
)

print(plan.output_text)
ts
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.werker.health/v1",
  apiKey: "WERKER_HEALTH_API_KEY",
});

const plan = await client.responses.create({
  model: "gpt-5",
  input: [
    { role: "system", content: "You generate discharge plans for clinicians." },
    { role: "user", content: "Create discharge instructions for a mild asthma flare." },
  ],
});

console.log(plan.output_text);