ML//TPU

2026-02-05

Google's custom ASIC designed for ML training and inference at datacenter scale.

Google's custom ASIC designed for ML training and inference at datacenter scale.

Uses a systolic array architecture to stream matrix operations with minimal memory access.

TPU v4 pods interconnect thousands of chips via custom high-bandwidth networks.

Exposed as cloud instances (Google Cloud TPU)

Represents the extreme end of ML hardware specialization.

Math embedded as silicon; both TPU and NPU are ASICs optimized for tensor multiplication.