GPU
A massively parallel processor — optimized for throughput over latency.
A massively parallel processor — optimized for throughput over latency.
Thousands of simple cores executing the same instruction on different data (SIMD/SIMT)
Originally designed for rendering, now critical for ML training and inference.
Available as discrete add-in cards or integrated units sharing the SoC die with the CPU
Two main uses: (1) rendering and (2) AI inference.
NVIDIA RTX 3090/4090: discrete GPU for PC (PCB with fans, fits inside a tower)