Last active
June 25, 2025 06:18
-
-
Save ndeloof/34c5ad1db34a6b1d41124c0c770cf420 to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
models: | |
llama: | |
model: ai/example-model | |
context-size: 1024 | |
runtime-flags: "--no-prefill-assistant" | |
services: | |
# Option 1 : by host alias | |
app: | |
models: | |
- llama # container will run with `llama` alias in /etc/hosts | |
another_app: | |
models: | |
# mimic https://github.com/compose-spec/compose-spec/blob/main/05-services.md#links to let user choose a network alias for Model | |
- llama:model-runner | |
# Option 2 : by env variable | |
app_with_env: | |
models: | |
- llama # container will run with `LLAMA_URL` set | |
another_app_with_env: | |
models: | |
llama: | |
variable: MODEL_URL |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment