Skip to content
@gpustack

GPUStack

Simple, scalable AI model deployment on GPU clusters

Pinned Loading

  1. gpustack gpustack Public

    GPU cluster manager for optimized AI model deployment

    Python 4.1k 419

  2. runner runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    Dockerfile 3 6

  3. runtime runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    Python 3 5

  4. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 220 21

  5. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 178 26

Repositories

Showing 10 of 13 repositories