Xiaomi and MiniMax both unleash their ultimate moves, signaling the start of the Agent Pricing War.

By: blockbeats|2026/03/20 13:00:01

vibe

codex

opus-2

On March 18 and 19, two Chinese companies successively released their respective Agent-oriented large models. The domestic AI startup MiniMax launched M2.7, and Xiaomi's large model team MiMo introduced V2-Pro. Both models entered the global top tier in the Agent benchmark, but their API output pricing is 1/21 and 1/8 of Claude Opus 4.6, respectively.

Both companies played their cards in the same week, but with completely different hands. They represent two completely different technical paths, betting on two futures of the Agent era.

Same Exam, 1/17 Tuition Fee

First, let's look at the most intuitive comparison.

Xiaomi and MiniMax both unleash their ultimate moves, signaling the start of the Agent Pricing War.

According to OpenRouter and various company official pricing pages, based on API output price (per million tokens), MiniMax M2.7 is $1.2, and MiMo-V2-Pro is $3. As a reference, the output price for Claude Opus 4.6 is $25, GPT-5.2 is $14, and Claude Sonnet 4.6 is $15.

The price difference is an order of magnitude, but the performance difference is not. In SWE-bench Verified (the current mainstream benchmark for measuring code engineering capability), MiMo-V2-Pro scored 78%, Sonnet 4.6 was 79.6%, a difference of less than two percentage points. M2.7's SWE-Pro score is 56.22%, on par with GPT-5.3-Codex. In VIBE-Pro (end-to-end project delivery capability), M2.7 scored 55.6%, approaching the level of Opus 4.6.

The focus of this chart is not on who is higher or lower—the benchmark systems of various companies are not entirely aligned, so direct comparisons should be cautious. The focus is on the "price-performance scissor difference": domestic Agent models have squeezed into the same performance band, but are in completely different price ranges.

Trillion Parameters vs. Self-evolution

Price is just the surface. The two companies have presented two completely different sets of trump cards.

MiMo-V2-Pro follows the "go big or go home" route. According to Xiaomi's official announcement, V2-Pro has over 1 trillion total parameters, 42B activation parameters, and supports an ultra-long context of 1 million tokens. Its core innovation is the Hybrid Attention mixed attention mechanism, adjusting the ratio of Sliding Window Attention (SWA) to Global Attention (GA) to 7:1—its predecessor V2-Flash was 5:1. This architecture makes the model more stable in scenarios where long documents are processed and multiple tool parallel calls in the Agent scene. In PinchBench (Agent tool invocation capability assessment), MiMo-V2-Pro scored 84%.

M2.7 took a completely different path. According to MiniMax's official tech blog post on March 18, M2.7's parameter count was not disclosed, but it demonstrated a "self-iterative evolution" mechanism: the model autonomously ran over 100 optimization loops, including analyzing failure trajectories, planning modifications, modifying its own code architecture, running evaluations, and looping again, ultimately achieving a 30% performance improvement on an internal evaluation set. In the MLE Bench Lite (Machine Learning Contest Difficulty Assessment), out of 22 challenging problems, M2.7 secured 9 gold, 5 silver, and 1 bronze, with an average medal rate of 66.6%.

From five dimensions, the two paths are aimed in completely different directions: MiMo-V2-Pro clearly dominates in context length and code engineering dimensions, while M2.7 widens the gap in office automation and self-iterative capability. According to MiniMax's same tech blog post, M2.7 scored ELO 1495 on GDPval-AA (Office Document Processing Evaluation), ranking first among open-source models, and maintained a 97% skill compliance rate in the MM-Claw test covering over 40 complex skills.

Four Versions in Five Months

Not only are the technical paths of the two companies different, but their iteration rhythms are also completely different.

According to public release records, from the release of M2 in October 2025 to the release of M2.7 in March 2026, MiniMax iterated four versions within five months, averaging a major version every 49 days. The gap between M2.5 and M2.7 was only about 30 days.

The rhythm of Xiaomi's MiMo is different: MiMo-7B was released in April 2025 (an open-source inference model with 7B parameters), V2-Flash was released in December of the same year (with 309B total parameters), and V2-Pro was released in March 2026 (with 1T total parameters). The parameter scale between each generation is much larger, but the intervals between versions are also longer.

MiniMax chose small, frequent steps, with each iteration not making big leaps but at a very high frequency. M2.7's self-iterative mechanism itself is designed for "continuous evolution." Xiaomi opted for a more impactful approach, with each version featuring significant changes in parameter scale and architecture.

-- Price

Anonymous 8 Days, Summit OpenRouter

In addition to the technical roadmap, Xiaomi's release strategy has also broken industry conventions.

According to Reuters, on March 11, an anonymous model named Hunter Alpha appeared on the world's largest API aggregation platform, OpenRouter. No brand endorsement, no product launch event, no technical blog. Its API pricing was extremely low, yet its performance was surprisingly strong.

The community began to speculate about its origins. According to Republic World and several tech media reports, the most mainstream speculation was DeepSeek V4, as MiMo team leader Luo Fuli had previously worked on research at DeepSeek. The number of API calls quickly skyrocketed, with the total number of calls during the anonymous period exceeding 1 trillion tokens, reaching the top of the OpenRouter weekly rankings.

Early on March 19, Xiaomi revealed: Hunter Alpha is indeed MiMo-V2-Pro. According to the same Reuters report, Xiaomi's Hong Kong stock once surged by 5.8% after the revelation.

This is the first time a domestic large-scale model has proven itself on a global platform through purely blind testing. Not relying on the brand, not relying on publicity, it took 8 days to let developers vote with their feet.

Naval personally stepped in as the chairman of the USVC Investment Committee. This SEC-registered fund launched by AngelList attempts to bring top private tech assets like OpenAI, Anthropic, and xAI to the general public with a $500 entry threshold. It is not just a new fund, but a structural experi...

a16z Crypto: 9 Charts to Understand the Evolution Trends of Stablecoins

Stablecoins are evolving from trading tools into universal payment infrastructure, and this process is quieter and more thorough than most people expected.

Refutation of Yang Haipo's "The End of Cryptocurrency"

This may be the true test of cryptocurrency. It's not about whether the price has reached a new high, nor about who will achieve financial freedom in the next bull market, but rather whether, after all the grand narratives have been washed away by cycles, it can still leave behind some simpler, more...

Can a hairdryer earn $34,000? Interpreting the reflexivity paradox of prediction markets

Prediction markets are essentially betting on reality, and when participants can access or even influence this path earlier, the market no longer just reflects reality but begins to shape it in return.

6MV Founder: In 2026, the "landmark turning point" for crypto investment has arrived

"I will deploy funds in 2026, so I will tell you this is the best year in history."

Abraxas Capital Mints $2.89 Billion USDT: Liquidity Boost or Just More Stablecoin Arbitrage?

Abraxas Capital just received $2.89 billion in freshly minted USDT from Tether. Is this a bullish liquidity injection for crypto markets, or is it business as usual for a stablecoin arbitrage giant? We analyze the data and the likely impact on Bitcoin, altcoins, and DeFi.

A VC from the Crypto world said AI is too crazy, and they are very conservative

Amid the Crypto frenzy and with investors who once missed out on Pinduoduo, a new AI fund called Impa Ventures was established, rejecting bubble narratives and adhering to a conservative "problem-first" strategy to seek real business value.

The Evolutionary History of Contract Algorithms: A Decade of Perpetual Contracts, the Curtain Has Yet to Fall

The ten-year evolution of perpetual contracts: from pulling the plug on 312 to the shocking short squeeze of TRB, a deep dive into the pricing machine that averages $200 billion daily, written with countless liquidations and real money, detailing the blood and tears of risk control theory.

Kicked out by PayPal, Musk aims to make a comeback in the cryptocurrency market

Cashtags generated a trading volume of 1 billion dollars just a few days after its launch, marking a strong start for Musk's super app strategy. For the cryptocurrency market, X's layout may be one of the most anticipated sources of retail growth after the meme coin craze subsides.

Solana ETF News: What Is a Solana ETF and Why Is Goldman Sachs Betting $108 Million on SOL?

Solana ETF news today shows Goldman Sachs disclosed a $108M position while total SOL ETF inflows reached $1.45B. Analysts now expect up to $6B in institutional demand as Solana trades 71% below its all-time high.

Bitcoin ETF News Today: $2.1B Inflows Signal Strong Institutional Demand for BTC

Bitcoin ETFs news recorded $2.1B inflows over 8 consecutive days, marking one of the strongest recent accumulation streaks. Here’s what the latest Bitcoin ETF news means for BTC price and whether the $80K breakout level is next.

Michael Saylor: Winter is Over – Is He Right? 5 Key Data Points (2026)

Michael Saylor tweeted yesterday “Winter‘s Over.” It is short. It is bold. And it has the crypto world talking.

But is he right? Or is this just another CEO pumping his bags?

Let us look at the data. Let us be neutral. Let us see if the ice has really melted.

WEEX Bubbles App Now Live Visualizes the Crypto Market at a Glance

WEEX Bubbles is a standalone app designed to help users quickly understand complex crypto market movements through an intuitive bubble visualization.

Polygon co-founder Sandeep: Writing after the chain bridge chain explosion

In three weeks, Drift, Hyperbridge, and KelpDAO were consecutively hacked, resulting in nearly $900 million in losses. Polygon's CEO wrote that the problem lies not with any single team, but with the "notary" style architecture shared by the entire industry—relying on one or two signers to stamp cro...

Major Upgrade on Web: 10+ Advanced Chart Styles for Deeper Market Insights

To deliver more powerful and professional analysis tools, WEEX has rolled out a major upgrade to its web trading charts—now supporting up to 14 advanced chart styles.

Morning Report | Aethir secures a $260 million enterprise contract with Axe Compute; New Fire Technology acquires Avenir Group's trading team; Polymarket's trading volume surpassed by Kalshi

Overview of Important Market Events on April 23

Why a Million-Follower Crypto KOL Chooses WEEX VIP?

Discover why top crypto KOL Carl Moon partnered with WEEX. Explore the WEEX VIP ecosystem, 1,000 BTC protection fund, and exclusive rewards for serious traders.

CoinEx Founder: The Crypto Endgame in My Eyes

The industry will not disappear, but it will shrink significantly.