Artificial intelligence is currently entering a new phase of global competition. For several years, the most advanced models were developed primarily by OpenAI, Anthropic, Google, and Meta. But the rise of Chinese players is significantly accelerating the sector’s momentum. Following DeepSeek, Qwen, and Baichuan, MiniMax is now drawing attention with M3, an Open Weight model touted as one of the most ambitious ever developed.
Officially unveiled on June 1, 2026, M3 combines several features rarely found together in a single system: a context window capable of holding up to one million tokens, a native multimodal architecture, advanced programming capabilities, and a design optimized for agent-based AI systems. The goal is clear: to offer an open-source alternative capable of competing with the best proprietary models on the market.
If the announced promises are fulfilled, M3 could mark a significant milestone in the evolution of open models and further accelerate the widespread adoption of advanced artificial intelligence.
Open Weight models are becoming a strategic priority
The year 2026 confirms a trend that has been observed for several years: the rise of open models. Long considered less powerful than their proprietary counterparts, they are gradually closing the gap thanks to improvements in architecture and increased computing power.
This trend addresses specific business needs. Many organizations now want to maintain greater control over their infrastructure, data, and operating costs. According to IDC, global spending on artificial intelligence is expected to exceed $500 billion by the end of the decade1. A growing share of these investments is specifically directed toward Open Weight models and platforms that enable flexible deployment on private or hybrid infrastructures.
In this context, MiniMax aims to position itself as a player capable of delivering top-tier performance while retaining the benefits of an open approach.
Architecture Designed for Large-Scale Settings
One of M3’s most impressive features is its ability to process massive amounts of data. The model supports a context window of up to one million tokens, with a guaranteed minimum of 512,000 tokens2.
This capability immediately places M3 among the most advanced models on the market in this field. By way of comparison, a window of one million tokens allows for the simultaneous analysis of several hundred pages of technical documentation, large codebases, or complex business data sets.
This trend is of particular interest to sectors related to data, data engineering, data analysis, and data management. Companies today are handling ever-increasing volumes of information and are seeking models capable of maintaining a broad context for complex projects.
For agent-based AI systems, this capability is also essential. The more relevant information a model can keep in memory, the better it becomes at effectively coordinating complex tasks over long periods of time.
Technology designed to reduce computing costs
The main change introduced by M3 lies in its internal architecture. MiniMax has reintroduced a technology called MiniMax Sparse Attention (MSA), an approach that allows the model to select only the information that is truly relevant before focusing its computational resources on it.
Unlike traditional architectures that analyze the entire context uniformly, this approach significantly reduces the computational load. According to MiniMax, the computational cost per token is up to twenty times lower when processing a context of one million tokens2.
The company also claims to have significantly improved processing performance. Analysis times are reportedly several times faster than those observed in the previous generation, while the generation speed is said to reach nearly 100 tokens per second.
At a time when inference costs are becoming a major challenge for businesses, these optimizations could provide a significant competitive advantage.
M3 Goes Head-to-Head with Global AI Leaders
The launch of the M3 comes at a time when the market is extremely competitive, with a few major models still dominating the market.
| Model | Maximum Pop-up Window | Type |
|---|---|---|
| M3 (MiniMax) | 1 million tokens | Open Weight |
| GPT-5.5 | 512,000 tokens | Owner |
| Claude Opus 4.7 | 500,000 tokens | Owner |
| Gemini 3.1 Pro | 1 million tokens | Owner |
| Qwen3.7-Max | 1 million tokens | Open Weight |
| Llama 4 Maverick | Up to 10 million tokens (hierarchical architecture) | Open Weight |
This comparison shows that M3 is positioned squarely in the most advanced segment of the market. It joins a select group of models capable of handling massive contexts while maintaining high performance.
The competition is also playing out in the professional sphere. According to GitHub, more than 92% of developers now use artificial intelligence tools in their daily work3. Programming performance is therefore becoming an increasingly critical factor.
Benchmarks that directly target OpenAI and Anthropic
MiniMax particularly highlights the results achieved by M3 in several specialized benchmarks.
On SWE-Bench Pro, a benchmark for evaluating the resolution of real-world software problems, the model reportedly achieved 59%2. This level of performance would put it on par with the best systems currently available.
The company also claims to achieve excellent results on benchmarks focused on autonomous agents, tool usage, web browsing, and advanced programming.
These figures should, of course, be interpreted with caution. As is often the case in the industry, some of the evaluations were conducted on MiniMax’s own infrastructure. Independent validation will be necessary to confirm this performance in a variety of environments.
Nevertheless, initial results suggest that M3 could indeed join the ranks of the best-performing models currently available.
Agent-Based AI at the Heart of the MiniMax Strategy
Beyond raw performance, M3 was designed to power a new generation of autonomous systems.
In particular, the model is integrated into MiniMax Code, an agent specialized in software development that can break down complex projects into several steps, verify its own results, and use various tools to perform advanced tasks.
This direction aligns perfectly with current market trends. Artificial intelligence is no longer limited to generating content. It is gradually becoming capable of taking action, planning, using applications, and coordinating entire workflows.
According to Gartner, nearly one-third of business applications will incorporate agent-based AI mechanisms by 20284. This transformation explains why research labs are investing heavily in models capable of powering these future intelligent agents.
A move that could accelerate adoption
One of the most eagerly anticipated developments is the release of the model’s weight specifications. MiniMax has confirmed that M3 will adopt an Open Weight strategy, with technical documentation and weight specifications being made available gradually on the major development platforms.
This decision could encourage rapid adoption among developers, researchers, and companies looking to customize the model to suit their own needs.
At a time when OpenAI, Anthropic, and Google continue to rely on largely proprietary approaches, this openness represents a significant strategic advantage. It allows organizations to deploy their own infrastructure while retaining greater control over their data and processes.
A New Milestone in the Global AI Competition
With M3, MiniMax isn't just looking to launch a new model. The company wants to demonstrate that Open Weight models can now compete with the best proprietary AI systems on the market.
This trend also illustrates the intensifying global competition in artificial intelligence. Chinese research labs are making rapid progress and gradually closing the gap with the American leaders.
In the coming years, the battle will likely no longer be fought solely on the basis of a model’s raw power. Criteria such as technological sovereignty, operating costs, agent-based AI, data management, and deployment flexibility will become just as important.
If the announced performance figures are confirmed, M3 could become one of the most influential open models of its generation and further strengthen the current momentum surrounding open artificial intelligence.
How does MiniMax's M3 work?
M3 is based on a multimodal language model architecture designed to handle extremely long contexts while optimizing the use of computational resources. Developed by MiniMax, this Open Weight model combines language understanding, advanced reasoning, programming, document analysis, and agent-based capabilities within a single system. Unlike previous generations of models, which see their performance decline as the amount of information increases significantly, M3 was designed to maintain high efficiency on contexts of up to one million tokens.
The system is based on a technology called MiniMax Sparse Attention (MSA). This architecture uses a selective attention mechanism that identifies the most relevant information before performing the complex calculations required for reasoning. Rather than analyzing the entire context with the same intensity, the model focuses its resources on the most important elements. This approach reduces computational costs while maintaining a high level of accuracy on complex tasks.
Thanks to this architecture, M3 can simultaneously process large codebases, voluminous documents, structured data, and multi-step projects. This capability makes it particularly well-suited for applications related to agent-based AI, software development, data, and the automation of complex processes.
- Giant pop-up window: Supports up to 1 million tokens for analyzing massive amounts of data
- Sparse Attention Architecture: Optimizing Computations Through Intelligent Selection of Relevant Information
- Multimodal capabilities: processing text, images, and various data formats
- Advanced Programming: Code Generation, Analysis, and Debugging for Complex Projects
- Multi-step reasoning: problem-solving that requires multiple levels of analysis
- Compatibility with AI agents: native integration with agent-based AI systems capable of using tools and executing workflows
- Open Weight Model: can be adapted, customized, and deployed across different infrastructures
- Independent validation is still limited for certain advanced benchmarks
- Significant material resources are required to fully exploit very large contexts
- Performance may vary depending on the quality of the data and instructions provided
- Potentially high infrastructure costs for large-scale deployments
- Risk of hallucinations remains despite improvements in reasoning
- The Need for Human Oversight for Critical or Regulated Uses
Learn more
The launch of MiniMax’s M3 confirms the rise of open models and alternative strategies in response to major proprietary models. On a related topic, check out our article “Mistral Joins the Ranks of the Giants: 1.7 Billion Euros Raised for Sovereign AI”, which analyzes how European players are seeking to strengthen their position in the global race for artificial intelligence, amid technological sovereignty, massive funding, and competition with American and Chinese giants.
References
1. IDC. (2025). Worldwide Artificial Intelligence Spending Guide.
https://www.idc.com
2. MiniMax. (2026). M3 Technical Release and Benchmark Overview.
https://www.minimax.io
3. GitHub. (2025). State of AI in Software Development.
https://github.blog
4. Gartner. (2025). Agentic AI Forecast.
https://www.gartner.com

