Specifications
Parameters
175B
VRAM (Q4)
96.3 GB
VRAM (FP16)
350 GB
Context Window
128,000
Architecture
transformer
License
Commercial API
Find the cheapest GPU that can run GPT-3.5 Turbo
Compatibility Lab — check every GPU × quantization combination