We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
0 parents commit 450c44fCopy full SHA for 450c44f
README.md
@@ -0,0 +1,12 @@
1
+# llama-cpp-python GPU Wheel for Python 3.14 (CUDA 13.1)
2
+
3
+Fully working GPU-accelerated wheel for `llama-cpp-python==0.3.16` on **Python 3.14** (Windows amd64).
4
5
+Built December 17, 2025 with:
6
+- CUDA Toolkit 13.1 (latest)
7
+- Full CUDA graph support
8
+- Tested: ~85 tokens/second on Llama 3 8B Q4_K_M (RTX 3090)
9
10
+### Install
11
+```bash
12
+pip install llama_cpp_python-0.3.16-cp314-cp314-win_amd64.whl
0 commit comments