The A800 PCIe 40 GB packs 40GB VRAM. Llama 3.1 70B needs ~38.5GB. It'll work, but don't expect to load much context.
ollama run llama3.1:70b
Requires Ollama installed on your machine
Check GPU compatibility for any AI model
Compare pricing across 24+ providers
Side-by-side GPU specs and benchmarks
GPU × Model compatibility matrix