At a glance

OpenAIpricing, performance & catalog

The citable facts about OpenAI's 31 models — sourced from provider APIs and refreshed continuously.

Lowest price
GPT OSS 120B High at $0.100 per 1M input tokens
Highest throughput
GPT-5 mini at 200 tokens/s
Lowest latency
GPT-4 Turbo at 0.50s
Largest context
GPT-5.5 at 1.1M tokens
Catalog
31 active models from 1 organization

FAQ

Common questions about OpenAI.

What is OpenAI?

OpenAI is an API provider that hosts large language models. Active models: 31; From (input): $0.10 / 1M tok; Avg throughput: 108 tok/s; Avg latency: 2.76 s; Max context: 1.1M.

How many models does OpenAI offer?

OpenAI currently serves 31 active models out of 54 historical offerings on LLM Stats.

What is OpenAI's API pricing?

OpenAI input pricing starts from $0.10 per 1M tokens, with the most expensive offering at $10 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

How fast is OpenAI?

OpenAI averages 108 output tokens per second across its catalog, with average latency of 2.76s. Per-model performance is shown in the Performance tab.

Is OpenAI OpenAI compatible?

Most providers expose an OpenAI-compatible /v1/chat/completions endpoint so you can switch from OpenAI to OpenAI by changing only the base URL and API key. Check https://openai.com for the exact endpoint format and any provider-specific parameters.

Does OpenAI support multimodal models?

Yes. OpenAI's catalog includes 28 vision-capable, 12 image generation, 2 audio, and 4 video models. See the Models and Capabilities tabs for the full per-model breakdown.

Whose models does OpenAI host?

OpenAI hosts models from OpenAI. See the Models tab for the full catalog grouped by creator.

How do I start using OpenAI?

Sign up at https://openai.com to get an API key, then call OpenAI's API directly from your application. Most clients work out of the box by pointing the OpenAI SDK at OpenAI's base URL with your key. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.