Community fixes 20260601 by GO1984 · Pull Request #15 · outsourc-e/bench-loop

GO1984 · 2026-06-01T09:05:33Z

Summary

This PR fixes several runtime issues found while using BenchLoop with local OpenAI-compatible servers and newer NVIDIA hardware.

Detect OpenAI-compatible endpoints by common ports and hosts.
Use /v1/chat/completions for OpenAI-compatible preflight checks.
Skip Ollama version checks for OpenAI-compatible endpoints.
Add endpoint-specific API key support via BENCHLOOP_OPENAI_KEYS.
Forward endpoint-specific auth headers through model listing, chat, and streaming calls.
Tolerate [N/A] values from nvidia-smi.
Omit internal asyncio task objects from active run API responses.

Some OpenAI-compatible servers, such as llama.cpp, expose:

but do not expose Ollama routes like:

Before this change, BenchLoop could treat those endpoints as Ollama and fail with errors like:

Health check failed (404): {"error":{"message":"File Not Found","type":"not_found_error","code":404}}

GO1984 added 2 commits June 1, 2026 10:19

fix: handle OpenAI-compatible endpoint preflight

02bb8c4

fix: improve runtime robustness

107c5a6