Add Hopper LLM provider by pavanyellow · Pull Request #6132 · livekit/agents

pavanyellow · 2026-06-17T00:36:39Z

Summary

Adds a LLM.with_hopper() factory to the OpenAI plugin, alongside the other
OpenAI-compatible providers (Cerebras, Together, Nebius, Telnyx, …).

Hopper serves open-source models optimized for low
time-to-first-token, aimed at voice agents. The API is OpenAI-compatible, so
this follows the existing with_* provider pattern — no new plugin, just a
factory method plus a HopperChatModels type.

Usage

from livekit.plugins import openai

llm = openai.LLM.with_hopper(
    # api_key defaults to HOPPER_API_KEY env var
    model="Qwen/Qwen3.6-35B-A3B",
)

Get an API key at https://withhopper.com.

Changes

with_hopper() classmethod on openai.LLM (defaults: base_url=https://api.withhopper.com/v1, HOPPER_API_KEY).
HopperChatModels literal in models.py.
_strict_tool_schema=False, matching the other open-model providers served on vLLM-style backends (same as Cerebras, use non-strict tool schema for cerebras llm #3134).

Latency

TTFT (time to first token) over a warm connection, voice-agent-shaped context
(~2k-token system prompt + short user turn), 10-run p50:

From us-west-2 (same region as the model server): p50 62ms (min 56, max 77)
From a residential laptop in SF: ~170ms — the difference is network round-trip

First-token latency is dominated by where your agent runs relative to the model,
not the serving itself; colocated, it's ~60ms.

Testing

Verified against the live endpoint through the plugin:

factory builds with the correct base URL and model
streaming chat() returns a valid completion
function calling emits a correct tool call (validates _strict_tool_schema=False)
ruff check passes on the changed files

CLAassistant · 2026-06-17T00:36:47Z

All committers have signed the CLA.

Hopper serves open-source models optimized for low time-to-first-token, aimed at voice agents. The API is OpenAI-compatible, so this adds a LLM.with_hopper() factory in the livekit-plugins-openai package alongside the other OpenAI-compatible providers (Cerebras, Together, Nebius, etc.), plus a HopperChatModels type.

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no bugs or issues to report.

pavanyellow force-pushed the feat/hopper-llm branch from 06fe901 to ce4f12a Compare June 17, 2026 00:45

pavanyellow changed the title ~~feat(openai): add Hopper LLM provider (with_hopper)~~ Add Hopper LLM provider Jun 17, 2026

pavanyellow marked this pull request as ready for review June 17, 2026 00:52

pavanyellow force-pushed the feat/hopper-llm branch from ce4f12a to ff006fd Compare June 17, 2026 00:52

pavanyellow requested a review from a team as a code owner June 17, 2026 00:52

devin-ai-integration Bot reviewed Jun 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Hopper LLM provider#6132

Add Hopper LLM provider#6132
pavanyellow wants to merge 1 commit into
livekit:mainfrom
pavanyellow:feat/hopper-llm

pavanyellow commented Jun 17, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Jun 17, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pavanyellow commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Usage

Changes

Latency

Testing

Uh oh!

CLAassistant commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pavanyellow commented Jun 17, 2026 •

edited

Loading

CLAassistant commented Jun 17, 2026 •

edited

Loading