Skip to main content

vAquila Documentation

vAquila is an open-source orchestrator for vLLM + Docker designed for local and production-oriented workflows.

The project focuses on:

  • Fast model launch from CLI
  • Safer GPU memory usage with runtime estimation
  • CPU fallback support
  • A local Web UI for operations and observability

Start with Getting Started.