Question 1

What is MiniMax?

Accepted Answer

MiniMax is an API provider that hosts large language models. Active models: 9; From (input): $0.30 / 1M tok; Avg throughput: 93 tok/s; Avg latency: 3.25 s; Max context: 1.0M.

Question 2

How many models does MiniMax offer?

Accepted Answer

MiniMax currently serves 9 active models out of 9 historical offerings on LLM Stats.

Question 3

What is MiniMax's API pricing?

Accepted Answer

MiniMax input pricing starts from $0.30 per 1M tokens, with the most expensive offering at $0.6 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

Question 4

How fast is MiniMax?

Accepted Answer

MiniMax averages 93 output tokens per second across its catalog, with average latency of 3.25s. Per-model performance is shown in the Performance tab.

Question 5

Is MiniMax OpenAI compatible?

Accepted Answer

Most providers expose an OpenAI-compatible /v1/chat/completions endpoint so you can switch from OpenAI to MiniMax by changing only the base URL and API key. Check https://platform.minimax.io for the exact endpoint format and any provider-specific parameters.

Question 6

Does MiniMax support multimodal models?

Accepted Answer

Yes. MiniMax's catalog includes 2 vision-capable and 4 audio models. See the Models and Capabilities tabs for the full per-model breakdown.

Question 7

Whose models does MiniMax host?

Accepted Answer

MiniMax hosts models from MiniMax. See the Models tab for the full catalog grouped by creator.

Question 8

How do I start using MiniMax?

Accepted Answer

Sign up at https://platform.minimax.io to get an API key, then call MiniMax's API directly from your application. Most clients work out of the box by pointing the OpenAI SDK at MiniMax's base URL with your key. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.

Model	Input /M	Output /M	Throughput	Context	Capabilities
MiniMax M3	$0.600	$2.40	—	1.0M	—
MiniMax M3	$0.600	$2.40	—	1.0M	Vision
MiniMax M3	$0.600	$2.40	—	1.0M	—
Speech 2.5 Turbo Preview	$10.0	—	—	3K	Audio
Speech 2.5 HD Preview	$20.0	—	—	3K	Audio
Speech 02 Turbo	$7.50	—	—	3K	Audio
Speech 02 HD	$15.0	—	—	3K	Audio
MiniMax M2.7	$0.300	$1.20	—	205K	—
MiniMax M2.5	$0.300	$1.20	100t/s	1.0M	Vision
MiniMax M2.5	$0.300	$1.20	100t/s	1.0M	—
MiniMax M2.1	$0.300	$1.20	100t/s	1.0M	—
MiniMax M2	$0.300	$1.20	70t/s	1.0M	—

Model	Input /M	Output /M	Throughput	Context	Capabilities
MiniMax M3	$0.600	$2.40	—	1.0M	—
MiniMax M3	$0.600	$2.40	—	1.0M	Vision
MiniMax M3	$0.600	$2.40	—	1.0M	—
Speech 2.5 Turbo Preview	$10.0	—	—	3K	Audio
Speech 2.5 HD Preview	$20.0	—	—	3K	Audio
Speech 02 Turbo	$7.50	—	—	3K	Audio
Speech 02 HD	$15.0	—	—	3K	Audio
MiniMax M2.7	$0.300	$1.20	—	205K	—
MiniMax M2.5	$0.300	$1.20	100t/s	1.0M	Vision
MiniMax M2.5	$0.300	$1.20	100t/s	1.0M	—
MiniMax M2.1	$0.300	$1.20	100t/s	1.0M	—
MiniMax M2	$0.300	$1.20	70t/s	1.0M	—

MiniMax: API pricing, performance & models

Catalog

MiniMaxpricing, performance & catalog

Most affordable

Fastest

Largest context

FAQ