← Back to Models
Zhang Peiyuantransformer
TinyLlama 1.1B
1.1B parameters • 0.6GB VRAM (Q4) • 2,048 context
Specifications
Parameters
1.1B
VRAM (Q4)
0.6 GB
VRAM (FP16)
2.2 GB
Context Window
2,048
Architecture
transformer
License
Apache-2.0
Find the cheapest GPU that can run TinyLlama 1.1B
Compatibility Lab — check every GPU × quantization combination