B200 Can Run Llama 3.2 3B

The B200 packs 192GB VRAM. Llama 3.2 3B needs ~1.7GB. You're good to go.

✅ Plenty of headroom

Hardware Specs

GPU
B200
VRAM
192 GB
Bandwidth
3350 GB/s
MSRP
$0

Model Requirements

Model
Llama 3.2 3B
Parameters
3B
VRAM (Q4)
~1.7 GB
Context
N/A

Estimated Performance

1675 tok/s
Estimated inference speed (Q4 quantization)
0