BTC $71,807
2026 Bull Run Is Building Start trading with 5% OFF all fees
Sign Up Now
BTC $71,807
Bull Run 2026 | 5% Off Fees Open your Binance account today
Sign Up

Google’s New Diffusion Model Hits 1,000 Tokens/Sec

Google's DiffusionGemma generates 256 token blocks via diffusion enabling blazing speed but current deployment hurdles persist

  • Google released DiffusionGemma today, an open-text AI generating entire 256-token blocks at once via text diffusion, hitting 1,000+ tokens per second on an NVIDIA H100.
  • This Apache 2.0-licensed model is four times faster than standard Gemma but trades quality for speed and currently lacks the drafter module needed for local inference on consumer setups.
  • The 256K context model is preconfigured for just 8,192 tokens on NVIDIA NIM, blocking its use with agentic frameworks like Hermes Agent that require a 64,000-token minimum.

Google launched DiffusionGemma today, a revolutionary open-weight AI model that generates text using the same diffusion process as image generators. The model achieves over 1,000 tokens per second on an NVIDIA H100, marking a speed breakthrough that is detailed in Google’s announcement.

- Advertisement -

Unlike autoregressive models that write token by token, it starts with noise and refines 256-token blocks in parallel. This approach provides bidirectional attention, allowing the beginning of a text to be influenced by its end.

Consequently, it excels at constrained tasks like code infilling and structured output. A fine-tuned version designed to solve Sudoku puzzles achieved an 80% success rate, a massive leap from the base model’s near-zero performance.

However, significant hurdles prevent immediate widespread adoption. Running DiffusionGemma locally requires a special drafter module that isn’t yet available in popular runtimes like mlx-lm or LM Studio.

Furthermore, the model launched on NVIDIA NIM with only 8,192 tokens of context by default. This is below the 64,000-token minimum required by frameworks like Hermes Agent, effectively blocking autonomous workflows.

- Advertisement -

The model is therefore aimed at developers building real-time tools on high-end NVIDIA hardware. Researchers are also intrigued by its potential for generating complex, interdependent sequences like protein structures.

Text diffusion has evolved from academic projects like LLaDA and Dream. DiffusionGemma represents the first major open release from a top-tier lab, building on the strategy of its predecessor, Gemma 4.

✅ Follow BITNEWSBOT on Telegram, Facebook, LinkedIn, X.com, and Google News for instant updates.

Previous Articles:

- Advertisement -
Ad
Altseason Is Loading. Don't watch from the sidelines.
SOL $90.51
DOGE $0.0963
LINK $9.02
SUI $1.00
5% off fees when you sign up
Start Trading
Ad
Pay Less on Every Trade. For Life.
$10K/mo volume Save $60/yr
$50K/mo volume Save $300/yr
$100K/mo volume Save $600/yr
5% off all trading fees when you sign up
Claim Your Discount

Latest News

OpenAI Sets IPO Goal, Preps “5.6” Model Release

OpenAI has submitted its IPO documents with the SEC and CEO Sam Altman told...

Saylor clarifies reposts weren’t endorsements as DeFi tokens slide

Michael Saylor clarified his social media reposts of DeFi projects were "notifications," not endorsements,...

Crypto Campaign Challenges UK Bank Transfer Restrictions

Stand With Crypto UK is mobilizing 286,000 members to protest UK bank restrictions on...

China-Linked JDY Botnet Expands, Infects 1,500 Devices

The JDY botnet, used by Chinese state-sponsored hacking groups like Volt Typhoon, has rapidly...

Mastercard Launches AI Payment Platform for Machine Transactions

Mastercard launched Agent Pay for Machines, a new platform enabling AI agents to autonomously...

Must Read

TOP 12 Day Trading Crypto Books For Beginners

Day trading cryptocurrencies has become an increasingly popular financial activity, offering the potential for huge returns to those who understand the market's complexities and...
Ad
Altseason Is Loading. These 4 coins are trending right now.
SOL $92.12
DOGE $0.0950
LINK $9.02
SUI $1.02
5% off spot fees when you sign up
Start Trading