Skip to content

OpenAgentsInc/probe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

335 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Probe

Probe is being reset.

This repository used to contain a Rust implementation of Probe. That tree is now deprecated and removed from main. The old implementation remains available in Git history as source material for protocol ideas, command ergonomics, tests, runtime failure cases, and deployment lessons, but it is not the new runtime contract and should not be revived as a compatibility layer by default.

The intended product surface is still probe: the first-party OpenAgents coding agent runtime. It should be one coherent surface for coding agents that can run with API-key model providers, local inference, swarm inference, Codex-style hosted execution, and future OpenAgents inference routes without exposing those implementation choices as product names. Probe should be opinionated around the OpenAgents stack: Omega dispatch, OpenAgents Sync events, SHC boxes, Pylons, forum-thread workrooms, artifact receipts, and explicit product acceptance.

OpenCode is an important inspiration point, especially for the shape of a modern coding-agent runtime: durable sessions, a clean tool registry, permission flow, local control plane, and a focused CLI. Probe will not wrap, fork, or vendor OpenCode as its base. The goal is a first-party runtime that learns from OpenCode while fitting OpenAgents infrastructure more tightly than a generic coding agent can.

Pylons should eventually be able to spawn Probes as local or remote work executors. A Pylon-hosted Probe may combine local machine capability, swarm compute, and API-backed model calls behind one assignment contract. The user or OpenAgents product surface should ask for a coding outcome, not choose between a pile of separate agent products.

The same runtime should also be deployable into sandboxes from openagents.com. Forum threads, workroom requests, and operator actions should be able to create bounded Probe assignments that run in SHC boxes or other approved sandboxes, emit redacted events back to Omega, produce artifacts, and wait for explicit acceptance before any product or writeback authority is treated as complete.

Current Branch State

main is intentionally clean while the refactor begins.

Tracked files should stay minimal until the new architecture lands:

  • README.md explains the reset and target direction.
  • package.json defines a Bun workspace/catalog layout modeled after the OpenCode workspace style while pinning the Effect line used by Omega.
  • packages/runtime/src/contracts/provider-account.ts defines the first Probe/Omega ChatGPT/Codex account contract.
  • packages/runtime/src/contracts/assignment.ts defines the Probe run assignment contract for Omega refs and grants.
  • packages/runtime/src/omega/grant-client.ts resolves Omega auth grants into Probe materialization plans.
  • packages/runtime/src/auth/materializer.ts materializes brokered auth only inside per-run env/file targets and scrubs it on closeout.
  • packages/runtime/src/runner/identity.ts gates Omega grant use on linked Probe runner identity before resolving and materializing auth.
  • packages/runtime/src/runtime/backend-assignment.ts routes no-auth Apple FM assignments through live backend health and emits backend run events.
  • packages/runtime/src/cli.ts exposes the first Probe CLI commands for Omega linking, account management, and Gemini chat through local keys or the Omega broker fallback.
  • packages/runtime/src/fleet/telemetry.ts reports auth/account health signals and requests Omega failover without locally iterating raw account tokens.
  • packages/runtime/src/fleet/token-usage.ts emits redacted Probe inference token-usage events to Omega for global Stats totals and opt-out-aware leaderboards.
  • packages/runtime/src/fleet/backend-capability.ts reports Apple FM backend capability for Pylon/SHC/sandbox routing from live health.
  • packages/runtime/src/backends/registry.ts registers the first Apple FM backend profile.
  • packages/runtime/src/backends/apple-fm/contract.ts defines the Effect v4 Apple FM backend contract.
  • packages/runtime/src/backends/apple-fm/client.ts attaches to Apple FM bridge health and preserves typed availability.
  • packages/runtime/src/backends/apple-fm/receipts.ts defines redacted Apple FM availability, failure, and transcript receipts.
  • packages/runtime/src/backends/apple-fm/tools.ts defines the Probe-owned Apple FM tool-callback session runtime.
  • packages/runtime/src/backends/apple-fm/acceptance.ts defines retained Apple FM acceptance cases and comparison receipts.
  • packages/runtime/src/benchmark/ defines the public-safe benchmark closeout bundle writer and retained Terminal-Bench fixture package for GEPA Stage 0/1 optimization handles.
  • packages/runtime/src/blueprint/ defines the narrowed Blueprint consumer contracts, static registry fixtures, and registry source client Probe needs before live Omega Blueprint routes exist, plus the typed signature lookup service, backend-independent tool menu planner, and local Program Run evidence contracts used for backend preflight.
  • docs/probe-omega-auth-contract.md records the implemented account-contract slice.
  • docs/probe-omega-run-assignment.md records the implemented assignment and grant-resolution slice, including Blueprint-scoped assignment refs for Pylon/Omega dispatch.
  • docs/probe-auth-materialization.md records the implemented per-run auth materialization slice.
  • docs/probe-runner-identity.md records the SHC/Pylon/sandbox runner identity slice.
  • docs/probe-cli-omega-auth.md records the CLI account-management slice.
  • docs/probe-fleet-telemetry.md records the fleet telemetry and failover slice.
  • docs/probe-token-usage-telemetry.md records the Probe-to-Omega token usage producer, privacy boundaries, opt-out controls, and fake-endpoint test path.
  • docs/2026-06-07-apple-fm-first-backend-audit.md audits the previous Apple FM implementations and records the plan for making Apple FM the first supported backend in the new Probe runtime.
  • docs/probe-apple-fm-backend.md records the implemented Apple FM backend contract slice.
  • docs/apple-fm-admitted-mac-acceptance.md records the live admitted-Mac Apple FM acceptance runbook.
  • docs/2026-06-07-blueprint-signature-lookup-apple-fm-tool-use-audit.md audits Guidance Module and Blueprint prior art and records the roadmap for shared Blueprint signature lookup feeding Apple FM and other Probe backends.
  • docs/probe-blueprint-consumer-contracts.md records the local narrowed Blueprint mirror and safety posture.
  • docs/probe-blueprint-signature-lookup-service.md records the typed lookup service and no-keyword-routing selector posture.
  • docs/probe-blueprint-tool-menu-planner.md records the backend-independent Probe tool menu planner.
  • docs/probe-blueprint-program-run-evidence.md records the evidence-only Program Run shape emitted by Apple FM tool streams.
  • docs/probe-blueprint-backend-capability-routing.md records the Blueprint-aware backend capability facts used by Pylon/SHC routing.
  • docs/probe-blueprint-action-submission-boundary.md records the proposal-only write boundary for external effects.
  • docs/probe-blueprint-contribution-release-gates.md records the Probe-side contribution draft and release-gate model.
  • docs/probe-benchmark-contracts.md records the Probe benchmark assignment, closeout, candidate, promotion, and closeout bundle writer contracts.
  • docs/probe-retained-terminal-bench-fixtures.md records the public-ref-only retained Terminal-Bench fixture package for GEPA Stage 0/1.
  • docs/2026-06-08-probe-gepa-live-network-system-audit.md records the current cross-repo Probe GEPA live-network system state after the SHC Harbor smoke, Benchmark Cloud runner work, Psionic live import path, Omega route scorecard/product evidence path, and Artanis projection updates.
  • .gitignore keeps local build/cache noise out of the repo.
  • Add a license file before publishing or distributing a new runtime artifact.

Do not add compatibility scaffolding only because the old repo had it. Search for real callers first. If a deprecated command, protocol, adapter, or runtime mode has no current caller and no clear final-product role, harvest its useful idea into tests or notes and leave the surface deleted.

Direction

The next Probe should start from the final shape:

  • probe as the only runtime/product name.
  • A typed assignment and event protocol shared deliberately with Omega.
  • A durable session and event log that can resume, replay, and close out runs.
  • A single tool and approval model across local, swarm, API-key, and Codex-style inference routes.
  • SHC-first deployment, with later Pylon and other sandbox hosts using the same assignment contract.
  • Redacted artifacts and events suitable for OpenAgents Sync and openagents.com workrooms.
  • Explicit separation between runtime completion and product acceptance.

The refactor should optimize for the code that should exist, not the smallest diff from the deprecated implementation.

About

An opinionated coding agent

Resources

Stars

Watchers

Forks

Contributors