feat(plugins): Ollama support by Dialvive · Pull Request #352 · mpfaffenberger/code_puppy

Dialvive · 2026-05-21T00:22:57Z

Summary

Adds a new plugin that lets Code Puppy connect to local inference servers — Ollama, LM Studio, vLLM, llama.cpp, and any other OpenAI Chat Completions-compatible endpoint — without sending requests to remote model providers.

The plugin is auto-discovered by the existing plugin loader. Zero existing files were modified.

User Story

As a user, I want to run Code Puppy with local models through Ollama (or any OpenAI Chat Completions-compatible endpoint), so that I don't connect with remote model providers, use the hardware on my machine to process requests, and reduce costs.

Acceptance Criteria

Test Plan

#	Test Case	Expected Result
1	Handler with `custom_endpoint` config	Uses provided URL, api_key, headers via `get_custom_config()`
2	Handler without `custom_endpoint`	Defaults to `http://localhost:11434/v1`, api_key `"ollama"`
3	`OLLAMA_HOST` env var set	Overrides default URL, appends `/v1` if missing
4	Handler returns `OpenAIChatModel`	NOT `OpenAIResponsesModel` — verified via isinstance check
5	Handler failure (e.g., bad config)	Returns `None`, does not raise
6	`get_ollama_model_types()` return structure	Returns `[{"type": "ollama", "handler": callable}]`
7	`ModelFactory.get_model` integration	Type `"ollama"` routes to the handler via callback
8	Plugin auto-discovery	`register_callbacks.py` is found by plugin loader

How to Use

1 — Install Ollama

brew install ollama

2 — Pull a model with tool calling support

ollama pull qwen3:8b         # ~6 GB VRAM  — good for testing
ollama pull qwen3:14b        # ~10 GB VRAM — better quality
ollama pull qwen3:30b        # ~20 GB VRAM — recommended

3 — Start the Ollama server

brew services start ollama

curl http://localhost:11434/api/tags

4 — Configure the model in Code Puppy

Create or edit ~/.code_puppy/extra_models.json:

{
  "ollama-qwen3": {
    "type": "ollama",
    "name": "qwen3:8b",
    "context_length": 32768
  }
}

Multiple models can be registered at once:

{
  "ollama-qwen3-8b": {
    "type": "ollama",
    "name": "qwen3:8b",
    "context_length": 32768
  },
  "ollama-qwen3-30b": {
    "type": "ollama",
    "name": "qwen3:30b",
    "context_length": 131072
  }
}

Custom host (different port, remote machine, LM Studio):

{
  "lmstudio-codellama": {
    "type": "ollama",
    "name": "codellama:34b",
    "context_length": 16384,
    "custom_endpoint": {
      "url": "http://192.168.1.50:1234/v1",
      "api_key": "lm-studio"
    }
  }
}

Alternatively, set OLLAMA_HOST to override the default endpoint without editing the config:

export OLLAMA_HOST=http://myserver:11434

5 — Run Code Puppy and switch to the local model

./code-puppy-dev --interactive

Inside the session:

/model ollama-qwen3

Or start directly on the model:

./code-puppy-dev --interactive --model ollama-qwen3

Testing

Unit tests (no Ollama required — fully mocked)

pytest tests/test_ollama_plugin.py -v

Coverage: 100% on plugin files across 15 test cases covering:

custom_endpoint path (uses get_custom_config())
Default localhost path
OLLAMA_HOST env var — append /v1, already ends with /v1, trailing slash, empty string
Returns OpenAIChatModel not OpenAIResponsesModel
model_config["name"] resolution with fallback to config key
Graceful None return on exception
Handler registration structure

Verified End-to-End

ollama serve + qwen3:8b on localhost
Model appears in /model picker after configuring extra_models.json
File read/write tool calls work
Multi-step agentic tasks complete successfully
No remote API calls made during local model session

Adds docstring for the Ollama plugin.

Add unit tests for the Ollama plugin model type handler, covering various scenarios including custom endpoints, environment variables, and model creation.

Dialvive added 3 commits May 20, 2026 18:12

Add Ollama model type handler for OpenAI integration

9da633c

Add docstring to Ollama plugin initialization

c971ba3

Adds docstring for the Ollama plugin.

Implement tests for Ollama plugin model handler

ab8aa6c

Add unit tests for the Ollama plugin model type handler, covering various scenarios including custom endpoints, environment variables, and model creation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(plugins): Ollama support#352

feat(plugins): Ollama support#352
Dialvive wants to merge 3 commits into
mpfaffenberger:mainfrom
Dialvive:feature/ollama_support

Dialvive commented May 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Dialvive commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

User Story

Acceptance Criteria

Test Plan

How to Use

1 — Install Ollama

2 — Pull a model with tool calling support

3 — Start the Ollama server

4 — Configure the model in Code Puppy

5 — Run Code Puppy and switch to the local model

Testing

Unit tests (no Ollama required — fully mocked)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Dialvive commented May 21, 2026 •

edited

Loading