Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions packages/proxy/schema/index.ts
Original file line number Diff line number Diff line change
Expand Up @@ -590,6 +590,7 @@ export const AvailableEndpointTypes: { [name: string]: ModelEndpointType[] } = {
"grok-2-1212": ["xAI"],
"grok-vision-beta": ["xAI"],
"grok-beta": ["xAI"],
"claude-opus-4-8": ["anthropic"],
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Strip unsupported sampling params for Opus 4.8

Exposing claude-opus-4-8 as an Anthropic model means Chat Completions requests are translated through openaiParamsToAnthropicMesssageParams, but omitUnsupportedAnthropicParams only special-cases the 4.7 model names. Anthropic documents that Opus 4.8 returns 400 for non-default temperature, top_p, or top_k, so common requests that include a temperature (for example temperature: 0.7) will be forwarded and fail unless 4.8 is added to the unsupported-params handling.

Useful? React with 👍 / 👎.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is referencing legacy proxy code, so not relevant

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lingua does the same thing though, so need to put up a PR to fix it there

"grok-4.20-0309-non-reasoning": ["xAI"],
"accounts/fireworks/models/deepseek-v4-flash": ["fireworks"],
"accounts/fireworks/models/glm-5": ["fireworks"],
Expand Down
16 changes: 16 additions & 0 deletions packages/proxy/schema/model_list.json
Original file line number Diff line number Diff line change
Expand Up @@ -2604,6 +2604,22 @@
"max_input_tokens": 100000,
"max_output_tokens": 8191
},
"claude-opus-4-8": {
"format": "anthropic",
"flavor": "chat",
"multimodal": true,
"input_cost_per_mil_tokens": 5,
"output_cost_per_mil_tokens": 25,
"input_cache_read_cost_per_mil_tokens": 0.5,
"input_cache_write_cost_per_mil_tokens": 6.25,
"displayName": "Claude 4.8 Opus",
"reasoning": true,
"max_input_tokens": 1000000,
"max_output_tokens": 128000,
"available_providers": [
"anthropic"
]
},
"meta/llama-2-70b-chat": {
"format": "openai",
"flavor": "chat",
Expand Down
Loading