Skip to content
@gpustack

GPUStack

Open-source GPU cluster manager for running large language models(LLMs)

Pinned Loading

  1. gpustack gpustack Public

    Manage GPU clusters for running AI models

    Python 2.8k 283

  2. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 174 16

  3. llama-box llama-box Public

    LM inference server implementation based on *.cpp.

    C++ 204 16

  4. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 118 15

Repositories

Showing 10 of 10 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…