AWS EC2 P5 Instance (8x H100) Can Run Gemma 2 2B

The AWS EC2 P5 Instance (8x H100) packs 640GB VRAM. Gemma 2 2B needs ~1.1GB. You're good to go.

✅ Plenty of headroom

Hardware Specs

GPU
AWS EC2 P5 Instance (8x H100)
VRAM
640 GB
Bandwidth
26400 GB/s
MSRP
$0

Model Requirements

Model
Gemma 2 2B
Parameters
2B
VRAM (Q4)
~1.1 GB
Context
N/A

Estimated Performance

19800 tok/s
Estimated inference speed (Q4 quantization)
0