Commit 2da5cfa
committed
feat: make Llama.cpp context size configurable via environment
- Add LLAMA_ARG_CTX_SIZE environment variable support in llamacpp service
- Increase default context size from 4096 to 8192 tokens for better performance
- Allow overriding via .env file for different deployment scenarios
- Maintains backward compatibility with existing setups1 parent c8aadbc commit 2da5cfa
1 file changed
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
412 | 412 | | |
413 | 413 | | |
414 | 414 | | |
415 | | - | |
| 415 | + | |
416 | 416 | | |
417 | 417 | | |
418 | 418 | | |
| |||
0 commit comments