Google Cloud A3 (8x H100) Can Run Phi-3 Small 7B

The Google Cloud A3 (8x H100) packs 640GB VRAM. Phi-3 Small 7B needs ~3.9GB. You're good to go.

✅ Plenty of headroom

Hardware Specs

GPU
Google Cloud A3 (8x H100)
VRAM
640 GB
Bandwidth
26400 GB/s
MSRP
$0

Model Requirements

Model
Phi-3 Small 7B
Parameters
7B
VRAM (Q4)
~3.9 GB
Context
N/A

Estimated Performance

5657 tok/s
Estimated inference speed (Q4 quantization)
0