API Provider10 active models1 organizationdocs.x.ai

xAI: API pricing, performance & models

xAI hosts 10 active AI models, with input pricing from $0.20 per 1M tokens, averaging 90 tok/s output throughput, with up to 2.0M context window. Compare xAI's API pricing, latency, and feature support against other LLM providers.

VideoFirst-party only

10Active

Pricing

$0.200/MFrom

$0.675/MAvg

Performance

90tok/sThroughput

0.93sLatency

2.0MMax

Catalog

Type

Price

22 models

Model	Input /M	Output /M	Throughput	Context	Capabilities
Grok Imagine Video	—	—	—	1K	Video
Grok Imagine Video	—	—	—	1K	VisionVideo
Grok Imagine Video	—	—	—	1K	Video
Grok Imagine Image	$0.020/img	—	—	10K	VisionImage gen
Grok Imagine Image	$0.020/img	—	—	10K	Image gen
Grok Imagine Video	$0.050/sec	—	—	131K	Video
Grok Imagine Video	$0.050/sec	—	—	131K	VisionVideo
Grok Imagine Video	$0.050/sec	—	—	131K	Video
Grok-4 Fast Non-Reasoning	$0.200	$0.500	90t/s	2.0M	Vision
Grok-4 Fast Non-Reasoning	$0.200	$0.500	90t/s	2.0M	—
Grok-4 Fast Reasoning	$0.200	$0.500	90t/s	2.0M	Vision
Grok-4 Fast Reasoning	$0.200	$0.500	90t/s	2.0M	—
Grok-4.1 Fast Non-Reasoning	$0.200	$0.500	90t/s	2.0M	Vision
Grok-4.1 Fast Non-Reasoning	$0.200	$0.500	90t/s	2.0M	—
Grok-4.1 Fast Reasoning	$0.200	$0.500	90t/s	2.0M	Vision
Grok-4.1 Fast Reasoning	$0.200	$0.500	90t/s	2.0M	—
Grok 4 Fast	$0.200	$0.500	90t/s	2.0M	Vision
Grok 4 Fast	$0.200	$0.500	90t/s	2.0M	—
Grok Code Fast 1	$0.200	$1.50	76t/s	256K	—
Grok 4.3	$1.25	$2.50	—	1.0M	—
Grok-3	$3.00	$15.0	100t/s	128K	Vision
Grok-3	$3.00	$15.0	100t/s	128K	—

At a glance

xAIpricing, performance & catalog

The citable facts about xAI's 10 models — sourced from provider APIs and refreshed continuously.

Lowest price: Grok-4 Fast Non-Reasoning at $0.200 per 1M input tokens
Highest throughput: Grok-3 at 100 tokens/s
Lowest latency: Grok-3 at 0.70s
Largest context: Grok-4 Fast Non-Reasoning at 2.0M tokens
Catalog: 10 active models from 1 organization

Most affordable

Fastest

Largest context

FAQ

Common questions about xAI.

What is xAI?

xAI is an API provider that hosts large language models. Active models: 10; From (input): $0.20 / 1M tok; Avg throughput: 90 tok/s; Avg latency: 0.93 s; Max context: 2.0M.

How many models does xAI offer?

xAI currently serves 10 active models out of 20 historical offerings on LLM Stats.

What is xAI's API pricing?

xAI input pricing starts from $0.20 per 1M tokens, with the most expensive offering at $3 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

How fast is xAI?

xAI averages 90 output tokens per second across its catalog, with average latency of 0.93s. Per-model performance is shown in the Performance tab.

Is xAI OpenAI compatible?

Most providers expose an OpenAI-compatible /v1/chat/completions endpoint so you can switch from OpenAI to xAI by changing only the base URL and API key. Check https://docs.x.ai for the exact endpoint format and any provider-specific parameters.

Does xAI support multimodal models?

Yes. xAI's catalog includes 9 vision-capable, 2 image generation, and 6 video models. See the Models and Capabilities tabs for the full per-model breakdown.

Whose models does xAI host?

xAI hosts models from xAI. See the Models tab for the full catalog grouped by creator.

How do I start using xAI?

Sign up at https://docs.x.ai to get an API key, then call xAI's API directly from your application. Most clients work out of the box by pointing the OpenAI SDK at xAI's base URL with your key. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.

Catalog

Type

Price

22 models

Model	Input /M	Output /M	Throughput	Context	Capabilities
Grok Imagine Video	—	—	—	1K	Video
Grok Imagine Video	—	—	—	1K	VisionVideo
Grok Imagine Video	—	—	—	1K	Video
Grok Imagine Image	$0.020/img	—	—	10K	VisionImage gen
Grok Imagine Image	$0.020/img	—	—	10K	Image gen
Grok Imagine Video	$0.050/sec	—	—	131K	Video
Grok Imagine Video	$0.050/sec	—	—	131K	VisionVideo
Grok Imagine Video	$0.050/sec	—	—	131K	Video
Grok-4 Fast Non-Reasoning	$0.200	$0.500	90t/s	2.0M	Vision
Grok-4 Fast Non-Reasoning	$0.200	$0.500	90t/s	2.0M	—
Grok-4 Fast Reasoning	$0.200	$0.500	90t/s	2.0M	Vision
Grok-4 Fast Reasoning	$0.200	$0.500	90t/s	2.0M	—
Grok-4.1 Fast Non-Reasoning	$0.200	$0.500	90t/s	2.0M	Vision
Grok-4.1 Fast Non-Reasoning	$0.200	$0.500	90t/s	2.0M	—
Grok-4.1 Fast Reasoning	$0.200	$0.500	90t/s	2.0M	Vision
Grok-4.1 Fast Reasoning	$0.200	$0.500	90t/s	2.0M	—
Grok 4 Fast	$0.200	$0.500	90t/s	2.0M	Vision
Grok 4 Fast	$0.200	$0.500	90t/s	2.0M	—
Grok Code Fast 1	$0.200	$1.50	76t/s	256K	—
Grok 4.3	$1.25	$2.50	—	1.0M	—
Grok-3	$3.00	$15.0	100t/s	128K	Vision
Grok-3	$3.00	$15.0	100t/s	128K	—