BTC $71,807
2026 Bull Run Is Building Start trading with 5% OFF all fees
Sign Up Now
BTC $71,807
Bull Run 2026 | 5% Off Fees Open your Binance account today
Sign Up

Google’s New Diffusion Model Hits 1,000 Tokens/Sec

Google's DiffusionGemma generates 256 token blocks via diffusion enabling blazing speed but current deployment hurdles persist

  • Google released DiffusionGemma today, an open-text AI generating entire 256-token blocks at once via text diffusion, hitting 1,000+ tokens per second on an NVIDIA H100.
  • This Apache 2.0-licensed model is four times faster than standard Gemma but trades quality for speed and currently lacks the drafter module needed for local inference on consumer setups.
  • The 256K context model is preconfigured for just 8,192 tokens on NVIDIA NIM, blocking its use with agentic frameworks like Hermes Agent that require a 64,000-token minimum.

Google launched DiffusionGemma today, a revolutionary open-weight AI model that generates text using the same diffusion process as image generators. The model achieves over 1,000 tokens per second on an NVIDIA H100, marking a speed breakthrough that is detailed in Google’s announcement.

- Advertisement -

Unlike autoregressive models that write token by token, it starts with noise and refines 256-token blocks in parallel. This approach provides bidirectional attention, allowing the beginning of a text to be influenced by its end.

Consequently, it excels at constrained tasks like code infilling and structured output. A fine-tuned version designed to solve Sudoku puzzles achieved an 80% success rate, a massive leap from the base model’s near-zero performance.

However, significant hurdles prevent immediate widespread adoption. Running DiffusionGemma locally requires a special drafter module that isn’t yet available in popular runtimes like mlx-lm or LM Studio.

Furthermore, the model launched on NVIDIA NIM with only 8,192 tokens of context by default. This is below the 64,000-token minimum required by frameworks like Hermes Agent, effectively blocking autonomous workflows.

- Advertisement -

The model is therefore aimed at developers building real-time tools on high-end NVIDIA hardware. Researchers are also intrigued by its potential for generating complex, interdependent sequences like protein structures.

Text diffusion has evolved from academic projects like LLaDA and Dream. DiffusionGemma represents the first major open release from a top-tier lab, building on the strategy of its predecessor, Gemma 4.

✅ Follow BITNEWSBOT on Telegram, Facebook, LinkedIn, X.com, and Google News for instant updates.

Previous Articles:

- Advertisement -
Ad
Altseason Is Loading. Don't watch from the sidelines.
SOL $90.51
DOGE $0.0963
LINK $9.02
SUI $1.00
5% off fees when you sign up
Start Trading
Ad
Pay Less on Every Trade. For Life.
$10K/mo volume Save $60/yr
$50K/mo volume Save $300/yr
$100K/mo volume Save $600/yr
5% off all trading fees when you sign up
Claim Your Discount

Latest News

Robinhood AI sets Guinness World Record

Robinhood set a new Guinness World Record for the most items purchased by an...

CISA Flags Actively Exploited Microsoft SharePoint Flaw

The U.S. CISA has flagged a high-severity Microsoft SharePoint flaw, CVE-2026-45659, as actively exploited,...

2026 Stock Outlook Bullish on Strong Earnings, AI Boom

The S&P 500 is up over 7% through late June 2026, with the second-half...

Robinhood expands to Europe with leveraged futures

Robinhood is expanding its European derivatives, offering perpetual futures on traditional assets like commodities...

Unpatched Argo CD flaw risks full Kubernetes takeover

An unpatched flaw in Argo CD's repo-server component allows for unauthenticated remote code execution...

Must Read

What Is Binance Earn?

As someone who is passionate about cryptocurrency, I am always on the lookout for new opportunities to grow my portfolio. That's why I was...
Ad
Altseason Is Loading. These 4 coins are trending right now.
SOL $92.12
DOGE $0.0950
LINK $9.02
SUI $1.02
5% off spot fees when you sign up
Start Trading