Skip to content
Change the repository type filter

All

    Repositories list

    • gpustack-higress-plugin

      Public
      Go
      3100Updated Apr 17, 2026Apr 17, 2026
    • gpustack-ui

      Public
      TypeScript
      Apache License 2.0
      567925Updated Apr 17, 2026Apr 17, 2026
    • gpustack

      Public
      A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.
      Python
      Apache License 2.0
      5024.9k52426Updated Apr 17, 2026Apr 17, 2026
    • Community Inference Backends for GPUStack V2
      Python
      Apache License 2.0
      81200Updated Apr 17, 2026Apr 17, 2026
    • runtime

      Public
      Provides a unified interface to detect GPU resources and manages GPU workloads.
      Python
      Apache License 2.0
      151403Updated Apr 8, 2026Apr 8, 2026
    • gpustack.github.io

      Public
      HTML
      2100Updated Apr 3, 2026Apr 3, 2026
    • .github

      Public
      Meta-Github repository for all GPUStack repositories.
      Apache License 2.0
      4100Updated Apr 1, 2026Apr 1, 2026
    • gguf-parser-go

      Public
      Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
      Go
      MIT License
      2426310Updated Mar 25, 2026Mar 25, 2026
    • runner

      Public
      Collection of Dockerfiles to build images for various inference services across different accelerated backends.
      Dockerfile
      Apache License 2.0
      91100Updated Mar 13, 2026Mar 13, 2026
    • Python
      Apache License 2.0
      2310Updated Mar 6, 2026Mar 6, 2026
    • vox-box

      Public
      A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
      Python
      Apache License 2.0
      33206162Updated Dec 23, 2025Dec 23, 2025
    • llama-box

      Public archive
      LM inference server implementation based on *.cpp.
      C++
      MIT License
      2829620Updated Nov 24, 2025Nov 24, 2025
    • Python
      Apache License 2.0
      2110Updated Aug 26, 2025Aug 26, 2025
    • fastfetch

      Public
      Like neofetch, but much faster because written mostly in C.
      C
      MIT License
      741200Updated Oct 24, 2024Oct 24, 2024
    • Deliver LLMs of GGUF format via Dockerfile.
      Go
      MIT License
      51500Updated Oct 24, 2024Oct 24, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.