RAG Studio

Design retrieval-augmented generation systems

Sponsored byPinecone
Try Free

Embedding Model

Generation LLM

Document Count

Retrieval Settings

GPU

System Analysis

Total VRAM Needed
5.1 GB
Embed: 0.7GB + LLM: 4.4GB
Index Size0.14 GB
Total Chunks50K
Context Used2560 tokens
Context Free125440 tokens