Skip to content
10 Ready Templates

AI Templates

No CUDA setup. Pick a template, choose your GPU — running in minutes.

Most Popular
🦙

Ollama

LLM Serve

Run any open-source LLM with one command

Min 8GB VRAM

vLLM

LLM Serve

High-throughput OpenAI-compatible inference server

Min 16GB VRAM
🤗

Text Generation Inference

LLM Serve

HuggingFace production LLM serving

Min 16GB VRAM
🚀

SGLang

LLM Serve

Structured generation & fast batching

Min 16GB VRAM
Most Popular
🌐

Open WebUI

UI

ChatGPT-style interface for Ollama / OpenAI

Min 8GB VRAM
💬

LibreChat

UI

Multi-provider AI chat with plugins

Min 8GB VRAM
🔄

Flowise

UI

Drag-and-drop LLM flow builder

Min 4GB VRAM
🪓

Axolotl

Training

Easy LLM fine-tuning: LoRA, QLoRA, FSDP

Min 24GB VRAM
Fast
🦥

Unsloth

Training

2× faster fine-tuning, 70% less VRAM

Min 16GB VRAM
🔗

Langflow

Agent

Visual multi-agent & RAG pipeline builder

Min 8GB VRAM