You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Determine how much the model should think by setting the `thinking_budget`
173
173
174
174
- **OpenAI**: use effort levels — `minimal`, `low`, `medium`, `high`
175
-
- **Anthropic, Google (Gemini), DMR, others**: coming soon
175
+
- **Anthropic**: set an integer token budget. Minimum is 1024; range is 1024–32768; must be strictly less than `max_tokens`. When set, cagent uses Anthropic's Beta Messages API with interleaved thinking enabled.
176
176
177
177
Examples (OpenAI):
178
178
@@ -189,11 +189,44 @@ agents:
189
189
instruction: you are a helpful assistant
190
190
```
191
191
192
+
Examples (Anthropic):
193
+
194
+
```yaml
195
+
models:
196
+
claude:
197
+
provider: anthropic
198
+
model: claude-sonnet-4-5-20250929
199
+
thinking_budget: 1024
200
+
201
+
agents:
202
+
root:
203
+
model: claude
204
+
instruction: you are a helpful assistant that doesn't think very much
205
+
```
206
+
207
+
#### Interleaved Thinking (Anthropic)
208
+
209
+
Anthropic's interleaved thinking feature uses the Beta Messages API to provide tool calling during model reasoning. You can control this behavior using the `interleaved_thinking` provider option:
210
+
211
+
```yaml
212
+
models:
213
+
claude:
214
+
provider: anthropic
215
+
model: claude-sonnet-4-5-20250929
216
+
thinking_budget: 8192 # Optional: defaults to 16384 when interleaved thinking is enabled
0 commit comments