mirror of
https://github.com/docker/inference-engine-vllm
synced 2026-04-05 19:45:13 +00:00
No description
- Shell 93.8%
- Dockerfile 6.2%
|
|
||
|---|---|---|
| .github/workflows | ||
| scripts | ||
| cuda.Dockerfile | ||
| generic.Dockerfile | ||
| LICENSE | ||
| README.md | ||
vLLM inference runtime
This repo contains implementations of the vLLM inference runtime.