fix(deploy): primary endpoint -> mlx-proxy 8801
100.76.254.116:8800 -> :8801 to route through mlx-proxy and gain /status observability (active_jobs / total_requests).
This commit is contained in:
+1
-1
@@ -6,7 +6,7 @@ ai:
|
||||
|
||||
models:
|
||||
primary:
|
||||
endpoint: "http://100.76.254.116:8800/v1/chat/completions"
|
||||
endpoint: "http://100.76.254.116:8801/v1/chat/completions"
|
||||
model: "mlx-community/Qwen3.5-35B-A3B-4bit"
|
||||
max_tokens: 4096
|
||||
timeout: 60
|
||||
|
||||
Reference in New Issue
Block a user