Each model is tuned for infrastructure or code-heavy workflows. Pull the weights for local use, or chat in-browser using our hosted inference.
Ultra-lightweight Kubernetes helper for manifests and object scaffolding, ideal for fast iteration, CI jobs, and resource-constrained environments. Finetuned to outperform base Qwen on core K8s authoring tasks.
Larger Qwen-based Kubernetes model for multi-step reasoning, production workflows, and richer manifest generation. Finetuned to exceed baseline Qwen on troubleshooting, CI/CD, and rollout planning.
Mid-sized Llama model deeply finetuned for controllers, RBAC, operators, and production-grade deployment patterns. Opinionated best-practice defaults for real-world cluster architectures.
Gemma-based Kubernetes model tuned for long-form explanations, migration plans, debugging sessions, and architecture reviews. Finetuned to outperform base Gemma on complex K8s design tasks.