Revise README for llama-cpp-python GPU wheel

rookiemann · web-flow · commit 450c44f55f30 · 2025-12-17T18:21:28.000-07:00
Updated README to reflect accurate package name and details.
diff --git a/README.md b/README.md
@@ -0,0 +1,12 @@
+# llama-cpp-python GPU Wheel for Python 3.14 (CUDA 13.1)
+
+Fully working GPU-accelerated wheel for `llama-cpp-python==0.3.16` on **Python 3.14** (Windows amd64).
+
+Built December 17, 2025 with:
+- CUDA Toolkit 13.1 (latest)
+- Full CUDA graph support
+- Tested: ~85 tokens/second on Llama 3 8B Q4_K_M (RTX 3090)
+
+### Install
+```bash
+pip install llama_cpp_python-0.3.16-cp314-cp314-win_amd64.whl