Add configurable default model support by Qard · Pull Request #161 · braintrustdata/autoevals

Stephen Belanger (Qard) · 2026-01-12T23:54:31Z

This change allows users to configure which model to use as the default for all evaluations, replacing the hardcoded gpt-4o default.

Changes:

Add defaultModel parameter to init() in both JS and Python
Add getDefaultModel() function to retrieve configured default model
Update LLMClassifier and RAGAS scorers to use configurable default model
Update documentation with examples for different use cases

This enables:

Using different OpenAI models (gpt-4-turbo, o1, gpt-3.5-turbo, etc.)
Using non-OpenAI models via Braintrust proxy (Claude, Gemini, Llama, etc.)
Configuring once and having all evaluators use the preferred model

Example usage:

init({
  client: new OpenAI({
    apiKey: process.env.BRAINTRUST_API_KEY,
    baseURL: "https://api.braintrust.dev/v1/proxy",
  }),
  defaultModel: "claude-3-5-sonnet-20241022",
});

Fixes #136

github-actions · 2026-01-12T23:57:17Z

Braintrust eval report

Autoevals (model-flexibility-1768324125)

Score	Average	Improvements	Regressions
NumericDiff	73.4% (+1pp)	3 🟢	1 🔴
Time_to_first_token	1.33tok (-0.06tok)	85 🟢	33 🔴
Llm_calls	1.55 (+0)	-	-
Tool_calls	0 (+0)	-	-
Errors	0 (+0)	-	-
Llm_errors	0 (+0)	-	-
Tool_errors	0 (+0)	-	-
Prompt_tokens	279.25tok (+0tok)	-	-
Prompt_cached_tokens	0tok (+0tok)	-	-
Prompt_cache_creation_tokens	0tok (+0tok)	-	-
Completion_tokens	19.3tok (+0tok)	-	-
Completion_reasoning_tokens	0tok (+0tok)	-	-
Total_tokens	298.54tok (+0tok)	-	-
Estimated_cost	0$ (+0$)	-	-
Duration	3.14s (-0.42s)	140 🟢	79 🔴
Llm_duration	2.58s (-0.22s)	106 🟢	13 🔴

Olmo Maldonado (ibolmo)

LGTM!

This change allows users to configure which model to use as the default for all evaluations, replacing the hardcoded gpt-4o default. Changes: - Add `defaultModel` parameter to `init()` in both JS and Python - Add `getDefaultModel()` function to retrieve configured default model - Update LLMClassifier and RAGAS scorers to use configurable default model - Update documentation with examples for different use cases This enables: - Using different OpenAI models (gpt-4-turbo, o1, gpt-3.5-turbo, etc.) - Using non-OpenAI models via Braintrust proxy (Claude, Gemini, Llama, etc.) - Configuring once and having all evaluators use the preferred model Example usage: ```javascript init({ client: new OpenAI({ apiKey: process.env.BRAINTRUST_API_KEY, baseURL: "https://api.braintrust.dev/v1/proxy", }), defaultModel: "claude-3-5-sonnet-20241022", }); ``` Fixes #136 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

github-actions · 2026-01-13T17:10:45Z

Braintrust eval report

Autoevals (main-1768324249)

Score	Average	Improvements	Regressions
NumericDiff	72.5% (-1pp)	1 🟢	3 🔴
Time_to_first_token	1.34tok (+0.01tok)	50 🟢	68 🔴
Llm_calls	1.55 (+0)	-	-
Tool_calls	0 (+0)	-	-
Errors	0 (+0)	-	-
Llm_errors	0 (+0)	-	-
Tool_errors	0 (+0)	-	-
Prompt_tokens	279.25tok (+0tok)	-	-
Prompt_cached_tokens	0tok (+0tok)	-	-
Prompt_cache_creation_tokens	0tok (+0tok)	-	-
Completion_tokens	19.3tok (+0tok)	-	-
Completion_reasoning_tokens	0tok (+0tok)	-	-
Total_tokens	298.54tok (+0tok)	-	-
Estimated_cost	0$ (+0$)	-	-
Duration	2.86s (-0.28s)	105 🟢	111 🔴
Llm_duration	2.72s (+0.14s)	30 🟢	89 🔴

Stephen Belanger (Qard) requested a review from Olmo Maldonado (ibolmo) January 12, 2026 23:54

Stephen Belanger (Qard) self-assigned this Jan 12, 2026

Stephen Belanger (Qard) added enhancement New feature or request lang:python lang:typescript labels Jan 12, 2026

Stephen Belanger (Qard) requested a review from Ankur Goyal (ankrgyl) January 12, 2026 23:54

Stephen Belanger (Qard) force-pushed the model-flexibility branch 3 times, most recently from 48320a5 to 91f19f8 Compare January 13, 2026 00:00

Olmo Maldonado (ibolmo) approved these changes Jan 13, 2026

View reviewed changes

Stephen Belanger (Qard) force-pushed the model-flexibility branch 3 times, most recently from 7d7b9da to 7f8c1fd Compare January 13, 2026 00:33

Stephen Belanger (Qard) requested a review from Olmo Maldonado (ibolmo) January 13, 2026 00:35

Olmo Maldonado (ibolmo) approved these changes Jan 13, 2026

View reviewed changes

Stephen Belanger (Qard) force-pushed the model-flexibility branch from 7f8c1fd to d616f67 Compare January 13, 2026 17:08

Stephen Belanger (Qard) merged commit 1ff945d into main Jan 13, 2026
7 checks passed

Stephen Belanger (Qard) deleted the model-flexibility branch January 13, 2026 17:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add configurable default model support#161

Add configurable default model support#161
Stephen Belanger (Qard) merged 1 commit into
mainfrom
model-flexibility

Stephen Belanger (Qard) commented Jan 12, 2026

Uh oh!

github-actions Bot commented Jan 12, 2026 •

edited

Loading

Uh oh!

Olmo Maldonado (ibolmo) left a comment

Uh oh!

Uh oh!

github-actions Bot commented Jan 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Stephen Belanger (Qard) commented Jan 12, 2026

Uh oh!

github-actions Bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Braintrust eval report

Uh oh!

Olmo Maldonado (ibolmo) left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Braintrust eval report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions Bot commented Jan 12, 2026 •

edited

Loading

github-actions Bot commented Jan 13, 2026 •

edited

Loading