You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Determine how much the model should think by setting the `thinking_budget`
173
173
174
174
- **OpenAI**: use effort levels — `minimal`, `low`, `medium`, `high`
175
-
- **Anthropic**: set an integer token budget. Minimum is 1024; range is 1024–32768; must be strictly less than `max_tokens`. When set, cagent uses Anthropic's Beta Messages API with interleaved thinking enabled.
175
+
- **Anthropic**: set an integer token budget. Range is 1024–32768; must be strictly less than `max_tokens`.
Anthropic's interleaved thinking feature uses the Beta Messages API to provide tool calling during model reasoning. You can control this behavior using the `interleaved_thinking` provider option:
@@ -220,11 +246,14 @@ models:
220
246
221
247
Notes:
222
248
223
-
- If an invalid OpenAI effort value is set, the request will fail with a clear error
224
-
- For Anthropic, values < 1024 or ≥ `max_tokens` are ignored (warning logged)
225
-
- When `interleaved_thinking` is enabled, cagent uses Anthropic's Beta Messages API with a default thinking budget of 16384 tokens if not specified
249
+
- **OpenAI**: If an invalid effort value is set, the request will fail with a clear error
250
+
- **Anthropic**: Values < 1024 or ≥ `max_tokens` are ignored (warning logged). When `interleaved_thinking` is enabled, cagent uses Anthropic's Beta Messages API with a default thinking budget of 16384 tokens if not specified
251
+
- **Google**:
252
+
- Most models support values between -1 and 24576 tokens. Set to `0` to disable, `-1` for dynamic thinking
0 commit comments