MiniMax unveils M3, an open-weights model touting coding-agentic gains and 1M context

In a thread on X, MiniMax cites 59.0% on SWE-Bench Pro and a new sparse attention scheme that scales context to 1M, with an API promo rolling out this week.

By · · updated

Why it matters

If the reported numbers hold, M3 pushes open-weights models further into practical coding and agent workflows while adding 1M context via sparse attention. That combination could let teams self-host or fine-tune a more capable agentic stack without closed-model lock-in. A short-term API discount signals MiniMax is angling for fast developer adoption, turning benchmark wins into early integrations.

A miniature scene depicting an artificial intelligence system actively processing and generating code, with elements symbolizing a vast context window. (Museum-diorama miniature with handcrafted figurines and painted backdrop, resembling pa

MiniMax announced M3 in a thread on X, casting it as the first open-weights model to combine strong coding and agentic performance with a 1M context window via its MiniMax Sparse Attention.

On headline benchmarks, MiniMax reports 59.0% on SWE-Bench Pro, 66.0% on Terminal Bench 2.1, 34.8% on SWE-fficiency, 28.8% on KernelBench Hard, and 74.2% on MCP Atlas. MiniMax says M3 scales context to 1M using a sparse attention approach and points to more details in a link.

MiniMax also outlined launch pricing: 50% off standard usage (<=512K context) for the first 7 days, with priority access available via [email protected] and self-serve access for all users coming in the next few days. The company shared promotion terms alongside the announcement.

For a limited time you can use the new model for free on OpenCode:

Reader comments

Conversation for this story loads after sign-in.