✅ — GB200 NVL72 Can Run Llama 3.1 8B
The GB200 NVL72 packs 13824GB VRAM. Llama 3.1 8B needs ~4.4GB. You're good to go.
✅ Plenty of headroom
Hardware Specs
- GPU
- GB200 NVL72
- VRAM
- 13824 GB
- Bandwidth
- 3350 GB/s
- MSRP
- $0
Model Requirements
- Model
- Llama 3.1 8B
- Parameters
- 8B
- VRAM (Q4)
- ~4.4 GB
- Context
- N/A
Estimated Performance
628 tok/s
Estimated inference speed (Q4 quantization)