Calculate exactly how much GPU memory (VRAM) you need to run any AI model locally. Supports 280+ models at FP16, Q8, Q4, and other quantization levels.
Calculate how much GPU memory you need to run AI models locally. Supports all quantization levels.
VRAM estimates are approximate. Actual usage varies by model architecture, batch size, and runtime.
For MoE models (Mixtral, DeepSeek), only active parameters are loaded — actual VRAM may be lower than total parameter count suggests.