Skip to main content

vAquila Documentation

vAquila is an open-source orchestrator for vLLM + Docker designed for local and production-oriented workflows.

The project focuses on:

Fast model launch from CLI
Safer GPU memory usage with runtime estimation
CPU fallback support
A local Web UI for operations and observability

Start with Getting Started.