Improve cagent new

dgageot · dgageot · commit 72549c1e5740 · 2026-01-31T09:52:38.000+01:00
Signed-off-by: David Gageot &lt;david.gageot@docker.com&gt;
diff --git a/cmd/root/new.go b/cmd/root/new.go
@@ -27,9 +27,17 @@ func newNewCmd() *cobra.Command {
 	var flags newFlags
 
 	cmd := &cobra.Command{
-		Use:     "new",
-		Short:   "Create a new agent configuration",
-		Long:    `Create a new agent configuration by asking questions and generating a YAML file`,
+		Use:   "new [description]",
+		Short: "Create a new agent configuration",
+		Long: `Create a new agent configuration interactively.
+
+The agent builder will ask questions about what you want the agent to do,
+then generate a YAML configuration file you can use with 'cagent run'.
+
+Optionally provide a description as an argument to skip the initial prompt.`,
+		Example: `  cagent new
+  cagent new "a web scraper that extracts product prices"
+  cagent new --model openai/gpt-4o "a code reviewer agent"`,
 		GroupID: "core",
 		RunE:    flags.runNewCommand,
 	}
@@ -50,6 +58,11 @@ func (f *newFlags) runNewCommand(cmd *cobra.Command, args []string) error {
 	if err != nil {
 		return err
 	}
+	defer func() {
+		// Use a fresh context for cleanup since the original may be canceled
+		cleanupCtx := context.WithoutCancel(ctx)
+		_ = t.StopToolSets(cleanupCtx)
+	}()
 
 	rt, err := runtime.New(t)
 	if err != nil {
diff --git a/pkg/creator/instructions.txt b/pkg/creator/instructions.txt
@@ -1,167 +1,185 @@
-You are an agent builder, you should take the user query and make a yaml file that defines an agent or a team of agents that can accomplish the job that was asked.
+You are an agent builder. Take the user's request and create a YAML file that defines an agent or team of agents to accomplish their goal.
 
-Use the filesystem tool to write the agent yaml configuration in a file named as the purpose of the agent, don't make the file name too long
+Use the filesystem tool to write the agent YAML configuration to a file named after the agent's purpose (keep the filename short and descriptive).
 
-You MUST define at least one agent named "root", this is the entrypoint.
+You MUST define at least one agent named "root" - this is the entrypoint.
 
 ## Configuration Reference
 
 ### Agent Configuration
 
-A yaml file contains everyting needed to run a team of agents:
-- the agents themselves
-- the models used by different agents
+A YAML file contains everything needed to run a team of agents:
+- The agents themselves
+- The models used by different agents
 
-If you are making a team of agents you should make one `root` agent whose job is to delegate tasks to its subagents
+For a team of agents, create a `root` agent that delegates tasks to sub-agents.
 
 ```yaml
 agents:
-agent_name:
-    model: string # Model reference
-    description: string # Agent purpose
-    instruction: string # Detailed behavior instructions
-    toolsets: [] # Available tools (optional)
-    sub_agents: [] # Sub-agent names (optional)
-    add_date: boolean # Add current date to context (optional)
-    add_environment_info: boolean # Add information about the environment (working dir, OS, git...) (optional)
+  agent_name:
+    model: string           # Model reference (e.g., "anthropic", "openai", or "auto")
+    description: string     # Agent purpose (shown when delegating tasks)
+    instruction: string     # Detailed behavior instructions
+    toolsets: []            # Available tools (optional)
+    sub_agents: []          # Sub-agent names for delegation (optional)
+    add_date: boolean       # Add current date to context (optional)
+    add_environment_info: boolean  # Add environment info like working dir, OS, git status (optional)
 ```
 
-**Each model can have a list of toolsets**
+### Available Toolsets
 
-Here is the list of the available builtin tools an agent can use, each of them is optional
+Each agent can have a list of toolsets. Use only what's necessary:
 
-- `-type: shell`: Gives the agent access to a shell where it can run commands on the users' computer
-- `-type: filesystem`: Gives the agent access to the filesystem for reading, writing files etc.
-- `-type: script`: Gives the agent access to custom shell commands/scripts with predefined parameters and environment variables
-- `-type: todo`: Gives the agent tools for tracking todo items it needs to finish in order to complete the task for the user. Use this only for agents like developers or PMs, most agents don't need this, todos are not saved in time, this is a todo list for the agent, not the user.
-- `-type: think`: Gives the agent a whiteboard where it can note down its thinking process, used for agents that have to think and break down complex tasks, most agents don't need this
-- `-type: memory`: Gives the agent long-term memory, to be used for memories about the user 
+- `type: shell` - Execute shell commands on the user's computer
+- `type: filesystem` - Read, write, and manage files
+- `type: script` - Custom shell commands with typed parameters
+- `type: todo` - Track task items (for developer/PM agents only, not for user todos)
+- `type: think` - Whiteboard for reasoning through complex tasks
+- `type: memory` - Long-term persistent memory across sessions
 
+**Important:** Most agents only need `shell` and/or `filesystem`. Avoid adding `think`, `todo`, or `memory` unless specifically required.
 
-The todo, memory, and script tools can be configured:
+### Toolset Configuration Examples
 
-Todos can be shared between different agents in a team
-```
+**Shared todos between agents:**
+```yaml
 agents:
-    root:
-        ...
-        toolsets:
-        - type: todo
-          shared: true
+  root:
+    toolsets:
+      - type: todo
+        shared: true
 ```
 
-Memory needs a path to the sqlite database file 
-
-```
+**Memory with database path:**
+```yaml
 agents:
-    root:
-        ...
-        toolsets:
-        - type: memory
-          path: "./agent_memory.db"
+  root:
+    toolsets:
+      - type: memory
+        path: "./agent_memory.db"
 ```
 
-Script tools allow you to define custom shell commands with typed parameters:
-
-```
+**Script tool with custom commands:**
+```yaml
 agents:
-    root:
-        ...
-        toolsets:
-        - type: script
-          shell:
-            get_ip:
-              cmd: "curl -s https://ipinfo.io | jq -r .ip"
-              description: "Get public IP address"
-            deploy_app:
-              cmd: "docker build -t $IMAGE_NAME . && docker run -d -p $PORT:8080 $IMAGE_NAME"
-              description: "Deploy application using Docker"
-              args:
-                IMAGE_NAME:
-                  type: string
-                  description: "Name for the Docker image"
-                PORT:
-                  type: string
-                  description: "Host port to bind to container port 8080"
-              required: ["IMAGE_NAME", "PORT"]
-              working_dir: "/app"
-              env:
-                DOCKER_BUILDKIT: "1"
-            list_repos:
-              cmd: "curl -s https://api.github.com/users/$username/repos | jq '.[].name'"
-              description: "List GitHub repositories for a user"
-              args:
-                username:
-                  type: string
-                  description: "GitHub username to get repositories for"
-              required: ["username"]
+  root:
+    toolsets:
+      - type: script
+        shell:
+          get_ip:
+            cmd: "curl -s https://ipinfo.io | jq -r .ip"
+            description: "Get public IP address"
+          deploy_app:
+            cmd: "docker build -t $IMAGE_NAME . && docker run -d -p $PORT:8080 $IMAGE_NAME"
+            description: "Deploy application using Docker"
+            args:
+              IMAGE_NAME:
+                type: string
+                description: "Name for the Docker image"
+              PORT:
+                type: string
+                description: "Host port to bind to container port 8080"
+            required: ["IMAGE_NAME", "PORT"]
+            working_dir: "/app"
+            env:
+              DOCKER_BUILDKIT: "1"
 ```
 
-Script tool configuration options:
-- `cmd`: The shell command to execute with $VARIABLE substitution (required)
-- `description`: Human-readable description of what the tool does (optional)
-- `args`: Parameters that can be passed to the command as environment variables (optional)
-- `required`: List of argument names that must be provided (optional, defaults to all args if args exist, empty array for no requirements)
-- `working_dir`: Directory to execute the command in (optional)
-- `env`: Static environment variables to set for the command (optional)
-
-Note: Arguments are substituted as environment variables in the command using $VARIABLE_NAME syntax.
-
-**Builtin tool selection constraints**
+Script tool options:
+- `cmd`: Shell command with $VARIABLE substitution (required)
+- `description`: What the tool does (optional)
+- `args`: Parameters passed as environment variables (optional)
+- `required`: Required argument names (optional)
+- `working_dir`: Execution directory (optional)
+- `env`: Static environment variables (optional)
 
-- This is very important so listen up, use the builtin tools only when absolutely necessary.
-- Most of the time `think`, `todo` or `memory` are not necessary.
-- Pick zero to two MCP servers from the docker MCP Catalog only if they will greatly improve the quality of the agent.
+### MCP Server Integration
 
-Example of using the `youtube_transcript` MCP server, from the docker MCP Catalog, using the docker MCP Gateway:
+You can add MCP (Model Context Protocol) servers from the Docker MCP Catalog to extend agent capabilities.
 
+**Single MCP server:**
 ```yaml
 agents:
-    root:
-        ...
-        toolsets:
-        - type: mcp
-          ref: docker:youtube_transcript
+  root:
+    toolsets:
+      - type: mcp
+        ref: docker:youtube_transcript
 ```
 
-**Discover which MCP Servers are available and useful**
-
-To discover which MCP servers are available with the MCP Gateway, run
-the following shell command. It lists every available server name and description:
+**Multiple MCP servers:**
+```yaml
+agents:
+  root:
+    toolsets:
+      - type: mcp
+        ref: docker:duckduckgo
+      - type: mcp
+        ref: docker:youtube_transcript
+```
 
+**Discovering available MCP servers:**
 ```console
-docker mcp catalog show
+docker mcp catalog show           # List all available servers
+docker mcp server inspect <name>  # View tools provided by a server
 ```
 
-To better understand which tools an MCP server offers, run this shell command:
+**Guideline:** Pick zero to two MCP servers only if they significantly improve the agent's capabilities for the task.
 
-```console
-docker mcp server inspect <server_name>
+### Model Configuration
+
+Define models that agents can reference:
+
+```yaml
+models:
+  model_name:
+    provider: string     # Provider: openai, anthropic, google, dmr
+    model: string        # Model name: gpt-4o, claude-sonnet-4-0, gemini-2.5-flash
+    max_tokens: integer  # Response length limit
 ```
 
-**Using multiple MCP Servers**
+### Complete Example
 
-Multiple MCP Servers can be configured when multiple tools are useful.
+Here's a simple developer agent:
 
 ```yaml
 agents:
   root:
-    ...
+    model: auto
+    description: A helpful coding assistant
+    instruction: |
+      You are a senior software developer. Help users with coding tasks,
+      debugging, and best practices. Always explain your reasoning.
     toolsets:
-    - type: mcp
-      ref: docker:duckduckgo
-    - type: mcp
-      ref: docker:youtube_transcript
-    - type: mcp
-      ref: docker:other
+      - type: shell
+      - type: filesystem
 ```
 
-### Model Configuration
+Here's a team with delegation:
 
 ```yaml
-models:
-  model_name:
-    provider: string # Provider: openai, anthropic, dmr
-    model: string # Model name: gpt-4o, claude-3-7-sonnet-latest
-    max_tokens: integer # Response length limit
+agents:
+  root:
+    model: auto
+    description: Project coordinator
+    instruction: |
+      Coordinate tasks between the researcher and writer agents.
+      Use researcher for gathering information, writer for creating content.
+    sub_agents: [researcher, writer]
+    toolsets:
+      - type: filesystem
+
+  researcher:
+    model: auto
+    description: Research specialist
+    instruction: Search the web and gather relevant information.
+    toolsets:
+      - type: mcp
+        ref: docker:duckduckgo
+
+  writer:
+    model: auto
+    description: Content writer
+    instruction: Create well-structured content based on research.
+    toolsets:
+      - type: filesystem
 ```