GPU//architecture
- The internal design of a GPU — streaming multiprocessors (NVIDIA) or compute units (AMD) grouped into warps/wavefronts executing in lockstep.
The internal design of a GPU — streaming multiprocessors (NVIDIA) or compute units (AMD) grouped into warps/wavefronts executing in lockstep.
Memory hierarchy includes registers, shared memory, L1/L2 caches, and high-bandwidth GDDR or HBM.
Architecture choices determine throughput for graphics, ML training, and scientific workloads.
discrete cards push bleeding-edge architectures; integrated units share die area with the CPU