TPU

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2020 May 17 9:25
Editor
Edited
Edited
2025 Jul 6 18:29
Refs
Refs
An
ASIC
developed by Google, optimized for large-scale matrix multiplication processing and energy efficiency.
  • Systolic Array + Pipelining to minimize memory access
  • Ahead-of-Time compilation (XLA) to predetermine memory access patterns, utilizing scratchpads instead of caches
TPU Versions
 
 
 
https://jax-ml.github.io/scaling-book/
https://jax-ml.github.io/scaling-book/tpus/
https://jax-ml.github.io/scaling-book/tpus/
https://jax-ml.github.io/scaling-book/tpus/
Compute primitives and memory layouts for different hardware backends. Image by Chen et al., 2018.
Compute primitives and memory layouts for different hardware backends. Image by Chen et al., 2018.
https://jax-ml.github.io/scaling-book/tpus/
 
 
 
 
 

Recommendations