AIMPACT News, May 16 (UTC+8), Google disclosed the architecture details of the eighth-generation TPU (TPU 8t) rack-level connection to the Virgo network. The network uses high-radix switches and a flat two-layer non-blocking topology, increasing data center network bandwidth by four times compared to the previous generation, with a single structure capable of connecting over 134k TPU 8t chips, providing 47 Pb/s of non-blocking bidirectional bandwidth and nearly linear scaling performance of over 1.7K ExaFlops. The TPU 8t itself adopts a 3D torus topology, with a single super pod scalable to 9,600 chips, and supports expansion to over one million chips via JAX and Pathways. Key technologies include SparseCore accelerators, VPU/MXU overlap and balanced scaling, native FP4 support, and integrated Arm-based Axion CPUs to eliminate host bottlenecks. This design addresses the evolution of AI models from dense large language models to large-scale mixture-of-experts models and inference-intensive architectures. (Source: InFoQ)

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

9 Likes

Reward
9
9
2
Share

Comment

Add a comment

GateUser-bf5d0c14

· 1h ago

Rack interconnection using Virgo, flat two-layer non-blocking, this engineering detail is quite hardcore

View OriginalReply0

BlackVelvetKey

· 1h ago

SparseCore and FP4 native support, how much can training costs be reduced?

View OriginalReply0

AuroraSnowyWildernessSolitary

· 2h ago

47Pb/s non-blocking bandwidth is indeed outrageous; Google's network topology design must be aiming for AGI, right?

View OriginalReply0

GateUser-f78f1f3e

· 4h ago

Million-chip expansion, JAX/Pathways ecosystem binding is too deep in this wave

View OriginalReply0

CatUnderTheNeonBridge

· 5h ago

134,000 chip units single-structure—when AWS saw this scale, it went silent; when Azure saw it, it welled up with tears.

View OriginalReply0

AirdropJanitor

· 5h ago

Arm Axion CPU integration, TPU is also heading towards the SoC route

View OriginalReply0

AprDaydream

· 5h ago

MoE architecture-specific optimization, it looks like Gemini's next generation is going to do something big

View OriginalReply0

AirdropArchivist

· 5h ago

VPU/MXU Overlap Balancing, Google finally figured out how to schedule the compute units properly

View OriginalReply0

QuantitativeButNotPretentious

· 5h ago

Single super pod 9600 chip, how is the heat dissipation handled? Curious.

View OriginalReply0

Trending Topics
View More
#
StockTradingChallengeUpTo17000U
16.23M Popularity
#
TrumpBacksCFTCAuthorityOverPredictionMarkets
816.83K Popularity
#
GatePredictionMarketAddsSmartMoneyTracking
13.2M Popularity
#
MicronMarketCapBreaks1Trillion
36.58K Popularity
#
TradeCFDWinGold
3.08M Popularity

Pinned

Sitemap

Google releases details of the eighth-generation TPU 8t rack-scale network architecture

Trending Topics

StockTradingChallengeUpTo17000U

TrumpBacksCFTCAuthorityOverPredictionMarkets

GatePredictionMarketAddsSmartMoneyTracking

MicronMarketCapBreaks1Trillion

TradeCFDWinGold

Pinned