- Organizations
- OpenAI
- GPT OSS 120B
GPT OSS 120B: Benchmarks, Pricing & Context Window
GPT OSS 120B is a language model from OpenAI, released in August 2025.
GPT-OSS-120B is an open-weight, 116.8B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is
GPT OSS 120B pricing
Providers
GPT OSS 120B starts at $0.0900 per million input tokens and $0.450 per million output tokens via DeepInfra. See all 5 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.0900 | $0.450 | 131.1K | 131.1K | — | — | int4 | |||
| $0.100 | $0.500 | 131.1K | 131.1K | — | — | bf16 | |||
| $0.100 | $0.500 | 131.1K | 131.1K | 5.20 | 115 c/s | — | |||
| $0.150 | $0.600 | 131.0K | 30.0K | 5.76 | 163 c/s | — | |||
| $0.150 | $0.600 | 131.0K | 30.0K | 0.50 | 500 c/s | — |
GPT OSS 120B API
API access coming soon
GPT OSS 120B will be available through our gateway shortly.
GPT OSS 120B examples
Recent arena outputs from GPT OSS 120B, picked from the highest-ranked matchups.
GPT OSS 120B license
GPT OSS 120B is released under the Apache 2.0 license, which permits commercial use, has 116.8B parameters.
- License
- Apache 2.0
- Commercial use allowed
- Parameters
- 116.8B
Apache License 2.0 - allows commercial use
FAQ
Common questions about GPT OSS 120B.