GDDR7

GDDR7 uses PAM3 signaling to push per-pin rates to 28–32 Gbps in first-gen products (2025), with a path to 40+ Gbps. RTX 50 series adopts it: RTX 5090 hits 1.79 TB/s, RTX 5080 at 960 GB/s, RTX 5070 at 672 GB/s.

For local inference, GDDR7 is the largest single-generation bandwidth jump consumer cards have seen — 78% over the RTX 4090. That translates almost linearly into decode tok/s on memory-bandwidth-bound workloads.

Still not HBM territory (an H100 SXM does 3.35 TB/s) but closes the gap meaningfully for prosumer setups.

Related terms

See also