OpenAIReleased on Aug 5, 2025

GPT OSS 120B: Benchmarks, Pricing & Context Window

Name: GPT OSS 120B
Author: OpenAI

GPT OSS 120B is a language model from OpenAI, released in August 2025.

GPT-OSS-120B is an open-weight, 116.8B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is

Input

Text

Output

Text

$0.13/ 1M · 8:1 in:out

$0.09 in · $0.45 out

GPT OSS 120B pricing

Providers

GPT OSS 120B starts at $0.0900 per million input tokens and $0.450 per million output tokens via DeepInfra. See all 5 providers below with their per-token pricing, latency, throughput, and modality support.

Provider	Input $/M	Output $/M	Max Input	Max Output	Latency s	Throughput	Quant
DeepInfra	$0.0900	$0.450	131.1K	131.1K	—	—	int4
Novita	$0.100	$0.500	131.1K	131.1K	—	—	bf16
OpenAI	$0.100	$0.500	131.1K	131.1K	5.20	115 c/s	—
Fireworks	$0.150	$0.600	131.0K	30.0K	5.76	163 c/s	—
Groq	$0.150	$0.600	131.0K	30.0K	0.50	500 c/s	—