← Back to Models
nvidiatransformer
Llama 3.1 Nemotron 70B Instruct
70B parameters • 38.5GB VRAM (Q4) • 8,192 context
Specifications
Parameters
70B
VRAM (Q4)
38.5 GB
VRAM (FP16)
140 GB
Context Window
8,192
Architecture
transformer
License
Apache-2.0
Find the cheapest GPU that can run Llama 3.1 Nemotron 70B Instruct
Compatibility Lab — check every GPU × quantization combination