LLMKube
★ 110Kubernetes operator for llama.cpp-native LLM inference with GPU scheduling, Apple Silicon Metal support, and OpenAI-compatible API.
Community Ratings
Ease of Setup
—
Documentation
—
Resource Usage
—
Feature Completeness
—
Paywall Fairness
—
Maintenance
—
Community
—
Details
- Last Updated
- May 26, 2026
- Created
- Nov 12, 2025
- Runtime / Stack
- GoDockerK8S
Tags
Track your self-hosted stack
Bookmark software to try, rate tools you've used, and keep your collection in one place.
Related Software
AutoGPT
MIT License
184.6k
DevelopersAI & ML PractitionersArtificial IntelligenceAutomation Tools+4
AutoGPT is a platform for creating, deploying, and managing continuous AI agents that automate complex workflows.
Open CoreMulti-UserDocker
DetailsOllama
MIT License
172.3k
DevelopersAI & ML PractitionersGenerative AIArtificial Intelligence+3
Ollama is a local runtime for running open models and interacting with them through a CLI, REST API, and integrations.
OfflineDocker
Detailshermes-agent
MIT License
167.4k
DevelopersAI & ML PractitionersGenerative AIAI Agents+2
Hermes Agent is a self-improving AI agent with a terminal UI and messaging gateway that can run across CLI and chat platforms.
Multi-User
DetailsDify.ai
Apache License 2.0, commons-clause
142.6k
DevelopersStartupsLow Code DevelopmentGenerative AI+6
Dify is an open-source LLM app development platform for building, testing, and operating AI applications with workflows, RAG, agents, and model management.
Open CoreMulti-UserDocker
DetailsOpen-WebUI
MIT License
138.6k
AI & ML PractitionersGenerative AIArtificial Intelligence+6
Open WebUI is a self-hosted AI platform for chatting with LLMs, managing RAG, and building custom model workflows, designed to run entirely offline.
Open CoreOfflineMulti-User
DetailsClaude Code
proprietary
126.5k
DevelopersArtificial IntelligenceAI Coding Assistant+5
Claude Code is an agentic coding tool that runs in the terminal and helps with coding tasks, code explanation, and git workflows using natural language commands.
Binary
DetailsComfyUI
GNU General Public License v3.0
114.5k
Content CreatorsAI & ML PractitionersDesignersArtificial IntelligenceAI Media Generation+4
ComfyUI is a graph-based visual engine for designing and running advanced Stable Diffusion workflows.
Open CoreOfflineDocker
DetailsNextChatAI
MIT License
88.1k
DevelopersAI & ML PractitionersSmall TeamsArtificial IntelligenceAI Interfaces+3
NextChat is a lightweight, fast AI assistant web app and desktop client with support for multiple LLM providers and self-hosted models.
Open CoreMulti-UserDocker
DetailsCodex
Apache License 2.0
85.7k
DevelopersVibe CodersArtificial IntelligenceAI Coding Assistant+5
Codex CLI is a local coding agent from OpenAI for running coding tasks on your computer.
Multi-UserPackage
DetailsLobe Chat
Apache License 2.0
77.7k
Small TeamsArtificial IntelligenceAI Interfaces+5
LobeHub is an AI agent workspace for creating, collaborating with, and managing agent teammates.
Multi-UserDocker
DetailsLobeHub
proprietary
77.7k
Small TeamsGenerative AIAI Agents+4
LobeHub is a self-hostable AI agent workspace for organizing, creating, scheduling, and collaborating with agents.
Multi-UserDocker
DetailsGPT4All
MIT License
77.4k
Privacy AdvocatesAI & ML PractitionersArtificial IntelligenceLocal LLMs+4
GPT4All runs large language models privately on everyday desktops and laptops, and provides a Python client around llama.cpp implementations.
OfflineDocker
Details