vAquila Documentation
vAquila is an open-source orchestrator for vLLM + Docker designed for local and production-oriented workflows.
The project focuses on:
- Fast model launch from CLI
- Safer GPU memory usage with runtime estimation
- CPU fallback support
- A local Web UI for operations and observability
Start with Getting Started.