config: fallback 모델 qwen3.5:35b → qwen3.5:9b-q8_0 (GPU VRAM 제한)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -13,7 +13,7 @@ ai:
|
||||
|
||||
fallback:
|
||||
endpoint: "http://ollama:11434/v1/chat/completions"
|
||||
model: "qwen3.5:35b-a3b"
|
||||
model: "qwen3.5:9b-q8_0"
|
||||
max_tokens: 4096
|
||||
timeout: 120
|
||||
|
||||
|
||||
Reference in New Issue
Block a user