You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: python_agent/README.md
+2-14Lines changed: 2 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,9 +4,6 @@
4
4
5
5
This agent provides a general-purpose, sandboxed environment for executing Python code to accomplish user-defined tasks. It leverages a Large Language Model (LLM) to interpret a natural language task, generate Python code, and execute it within a Docker container. The agent operates by creating an interactive session with a [Jupyter kernel](https://docs.jupyter.org/en/latest/projects/kernels.html) running inside the container, allowing it to iteratively write code, execute it, and use the output to inform its next steps until the task is complete.
6
6
7
-
## Intended Use
8
-
9
-
The agent is designed for a wide range of tasks that can be solved programmatically with Python.
10
7
11
8
## Environment
12
9
@@ -18,16 +15,7 @@ To run this agent, a Docker daemon must be available and running on the host mac
18
15
-`restart_kernel`
19
16
-`complete_task`
20
17
21
-
## Features
22
-
23
-
-**Sandboxed Execution**: All code is executed within a secure and isolated Docker container, preventing unintended side effects on the host machine.
24
-
-**Customizable Environment**: Users can specify any Docker image for the execution environment and mount local directories as volumes into the container.
25
-
-**LLM-Powered Task Resolution**: The agent takes a high-level, natural language task and intelligently generates and executes the code needed to complete it.
26
-
-**Interactive Code Execution**: Provides tools for the LLM to `execute_code` and `restart_kernel`, allowing for an interactive and stateful problem-solving process.
27
-
-**Task Completion Reporting**: The agent can explicitly mark a task as complete with a success or failure status and a final summary.
28
-
-**Step-by-Step Iteration**: The agent operates within a defined loop with a maximum number of steps (max_steps) to ensure termination.
29
-
-**Artifact Logging**: Upon completion, the agent can log the entire working directory as an artifact to Dreadnode, preserving any generated files.
30
18
31
-
## References
19
+
## Examples
32
20
33
-
- None
21
+
`uv run python_agent/main.py --model "anthropic/claude-haiku-4-5-20251001" --task "please generate an interesting dataset, and visualize it"`
0 commit comments