Probe is being reset.
This repository used to contain a Rust implementation of Probe. That tree is now
deprecated and removed from main. The old implementation remains available in
Git history as source material for protocol ideas, command ergonomics, tests,
runtime failure cases, and deployment lessons, but it is not the new runtime
contract and should not be revived as a compatibility layer by default.
The intended product surface is still probe: the first-party OpenAgents coding
agent runtime. It should be one coherent surface for coding agents that can run
with API-key model providers, local inference, swarm inference, Codex-style
hosted execution, and future OpenAgents inference routes without exposing those
implementation choices as product names. Probe should be opinionated around the
OpenAgents stack: Omega dispatch, OpenAgents Sync events, SHC boxes, Pylons,
forum-thread workrooms, artifact receipts, and explicit product acceptance.
OpenCode is an important inspiration point, especially for the shape of a modern coding-agent runtime: durable sessions, a clean tool registry, permission flow, local control plane, and a focused CLI. Probe will not wrap, fork, or vendor OpenCode as its base. The goal is a first-party runtime that learns from OpenCode while fitting OpenAgents infrastructure more tightly than a generic coding agent can.
Pylons should eventually be able to spawn Probes as local or remote work executors. A Pylon-hosted Probe may combine local machine capability, swarm compute, and API-backed model calls behind one assignment contract. The user or OpenAgents product surface should ask for a coding outcome, not choose between a pile of separate agent products.
The same runtime should also be deployable into sandboxes from
openagents.com. Forum threads, workroom requests, and operator actions should
be able to create bounded Probe assignments that run in SHC boxes or other
approved sandboxes, emit redacted events back to Omega, produce artifacts, and
wait for explicit acceptance before any product or writeback authority is
treated as complete.
main is intentionally clean while the refactor begins.
Tracked files should stay minimal until the new architecture lands:
README.mdexplains the reset and target direction.package.jsondefines a Bun workspace/catalog layout modeled after the OpenCode workspace style while pinning the Effect line used by Omega.packages/runtime/src/contracts/provider-account.tsdefines the first Probe/Omega ChatGPT/Codex account contract.packages/runtime/src/contracts/assignment.tsdefines the Probe run assignment contract for Omega refs and grants.packages/runtime/src/omega/grant-client.tsresolves Omega auth grants into Probe materialization plans.packages/runtime/src/auth/materializer.tsmaterializes brokered auth only inside per-run env/file targets and scrubs it on closeout.packages/runtime/src/runner/identity.tsgates Omega grant use on linked Probe runner identity before resolving and materializing auth.packages/runtime/src/runtime/backend-assignment.tsroutes no-auth Apple FM assignments through live backend health and emits backend run events.packages/runtime/src/cli.tsexposes the first Probe CLI commands for Omega linking, account management, and Gemini chat through local keys or the Omega broker fallback.packages/runtime/src/fleet/telemetry.tsreports auth/account health signals and requests Omega failover without locally iterating raw account tokens.packages/runtime/src/fleet/token-usage.tsemits redacted Probe inference token-usage events to Omega for global Stats totals and opt-out-aware leaderboards.packages/runtime/src/fleet/backend-capability.tsreports Apple FM backend capability for Pylon/SHC/sandbox routing from live health.packages/runtime/src/backends/registry.tsregisters the first Apple FM backend profile.packages/runtime/src/backends/apple-fm/contract.tsdefines the Effect v4 Apple FM backend contract.packages/runtime/src/backends/apple-fm/client.tsattaches to Apple FM bridge health and preserves typed availability.packages/runtime/src/backends/apple-fm/receipts.tsdefines redacted Apple FM availability, failure, and transcript receipts.packages/runtime/src/backends/apple-fm/tools.tsdefines the Probe-owned Apple FM tool-callback session runtime.packages/runtime/src/backends/apple-fm/acceptance.tsdefines retained Apple FM acceptance cases and comparison receipts.packages/runtime/src/benchmark/defines the public-safe benchmark closeout bundle writer and retained Terminal-Bench fixture package for GEPA Stage 0/1 optimization handles.packages/runtime/src/blueprint/defines the narrowed Blueprint consumer contracts, static registry fixtures, and registry source client Probe needs before live Omega Blueprint routes exist, plus the typed signature lookup service, backend-independent tool menu planner, and local Program Run evidence contracts used for backend preflight.docs/probe-omega-auth-contract.mdrecords the implemented account-contract slice.docs/probe-omega-run-assignment.mdrecords the implemented assignment and grant-resolution slice, including Blueprint-scoped assignment refs for Pylon/Omega dispatch.docs/probe-auth-materialization.mdrecords the implemented per-run auth materialization slice.docs/probe-runner-identity.mdrecords the SHC/Pylon/sandbox runner identity slice.docs/probe-cli-omega-auth.mdrecords the CLI account-management slice.docs/probe-fleet-telemetry.mdrecords the fleet telemetry and failover slice.docs/probe-token-usage-telemetry.mdrecords the Probe-to-Omega token usage producer, privacy boundaries, opt-out controls, and fake-endpoint test path.docs/2026-06-07-apple-fm-first-backend-audit.mdaudits the previous Apple FM implementations and records the plan for making Apple FM the first supported backend in the new Probe runtime.docs/probe-apple-fm-backend.mdrecords the implemented Apple FM backend contract slice.docs/apple-fm-admitted-mac-acceptance.mdrecords the live admitted-Mac Apple FM acceptance runbook.docs/2026-06-07-blueprint-signature-lookup-apple-fm-tool-use-audit.mdaudits Guidance Module and Blueprint prior art and records the roadmap for shared Blueprint signature lookup feeding Apple FM and other Probe backends.docs/probe-blueprint-consumer-contracts.mdrecords the local narrowed Blueprint mirror and safety posture.docs/probe-blueprint-signature-lookup-service.mdrecords the typed lookup service and no-keyword-routing selector posture.docs/probe-blueprint-tool-menu-planner.mdrecords the backend-independent Probe tool menu planner.docs/probe-blueprint-program-run-evidence.mdrecords the evidence-only Program Run shape emitted by Apple FM tool streams.docs/probe-blueprint-backend-capability-routing.mdrecords the Blueprint-aware backend capability facts used by Pylon/SHC routing.docs/probe-blueprint-action-submission-boundary.mdrecords the proposal-only write boundary for external effects.docs/probe-blueprint-contribution-release-gates.mdrecords the Probe-side contribution draft and release-gate model.docs/probe-benchmark-contracts.mdrecords the Probe benchmark assignment, closeout, candidate, promotion, and closeout bundle writer contracts.docs/probe-retained-terminal-bench-fixtures.mdrecords the public-ref-only retained Terminal-Bench fixture package for GEPA Stage 0/1.docs/2026-06-08-probe-gepa-live-network-system-audit.mdrecords the current cross-repo Probe GEPA live-network system state after the SHC Harbor smoke, Benchmark Cloud runner work, Psionic live import path, Omega route scorecard/product evidence path, and Artanis projection updates..gitignorekeeps local build/cache noise out of the repo.- Add a license file before publishing or distributing a new runtime artifact.
Do not add compatibility scaffolding only because the old repo had it. Search for real callers first. If a deprecated command, protocol, adapter, or runtime mode has no current caller and no clear final-product role, harvest its useful idea into tests or notes and leave the surface deleted.
The next Probe should start from the final shape:
probeas the only runtime/product name.- A typed assignment and event protocol shared deliberately with Omega.
- A durable session and event log that can resume, replay, and close out runs.
- A single tool and approval model across local, swarm, API-key, and Codex-style inference routes.
- SHC-first deployment, with later Pylon and other sandbox hosts using the same assignment contract.
- Redacted artifacts and events suitable for OpenAgents Sync and
openagents.comworkrooms. - Explicit separation between runtime completion and product acceptance.
The refactor should optimize for the code that should exist, not the smallest diff from the deprecated implementation.