config: fallback 모델 qwen3.5:35b → qwen3.5:9b-q8_0 (GPU VRAM 제한)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
Hyungi Ahn
2026-04-03 13:40:25 +09:00
parent 63f75de89d
commit 9b0705b79f

View File

@@ -13,7 +13,7 @@ ai:
fallback:
endpoint: "http://ollama:11434/v1/chat/completions"
model: "qwen3.5:35b-a3b"
model: "qwen3.5:9b-q8_0"
max_tokens: 4096
timeout: 120