Accurate VRAM calculator for Local LLMs (Llama 4, DeepSeek V3, Qwen 2.5). Calculates GGUF quantization, GQA context overhead, and offloading limits
-
Updated
Nov 27, 2025 - HTML
Accurate VRAM calculator for Local LLMs (Llama 4, DeepSeek V3, Qwen 2.5). Calculates GGUF quantization, GQA context overhead, and offloading limits
Add a description, image, and links to the gpu-calculator topic page so that developers can more easily learn about it.
To associate your repository with the gpu-calculator topic, visit your repo's landing page and select "manage topics."