Skip to content

Commit 8f02dd4

Browse files
abrichrclaude
andcommitted
fix: correct OpenCUA attribution to macOS a11y code reuse
OpenCUA reused OpenAdapt's macOS accessibility tree capture code (AX API traversal functions + oa_atomacos dependency), not the full capture-to-deployment pipeline. The recorder architecture came from DuckTrack. Updated README, landing page strategy, competitor table, and proof points to reflect this accurately. Evidence: arxiv.org/html/2508.09123v3 Section 2.2, OpenCUA README "Acknowledge" section. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent 6fa74e0 commit 8f02dd4

2 files changed

Lines changed: 5 additions & 5 deletions

File tree

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -298,7 +298,7 @@ The bottom-right cell is OpenAdapt's unique value: training models to **use** de
298298

299299
**Validated result**: On a controlled macOS benchmark (45 System Settings tasks sharing a common navigation entry point), demo-conditioned prompting improved first-action accuracy from 46.7% to 100%. A length-matched control (+11.1 pp only) confirms the benefit is semantic, not token-length. See the [research thesis](https://github.com/OpenAdaptAI/openadapt-ml/blob/main/docs/research_thesis.md) for methodology and the [publication roadmap](docs/publication-roadmap.md) for limitations.
300300

301-
**Industry validation**: [OpenCUA](https://github.com/xlang-ai/OpenCUA) (NeurIPS 2025 Spotlight, XLANG Lab) built their cross-platform capture tool on OpenAdapt, but uses demos only for training — not runtime conditioning. No open-source CUA framework currently does demo-conditioned inference, which remains OpenAdapt's architectural differentiator.
301+
**Industry validation**: [OpenCUA](https://github.com/xlang-ai/OpenCUA) (NeurIPS 2025 Spotlight, XLANG Lab) [reused OpenAdapt's macOS accessibility capture code](https://arxiv.org/html/2508.09123v3) in their AgentNetTool, but uses demos only for model training — not runtime conditioning. No open-source CUA framework currently does demo-conditioned inference, which remains OpenAdapt's architectural differentiator.
302302

303303
### Key Concepts
304304

docs/design/landing-page-strategy.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@ OpenAdapt has evolved from a monolithic application (v0.46.0) to a **modular met
4242
**Key Innovation**:
4343
- **Trajectory-conditioned disambiguation of UI affordances** — the only open-source CUA framework that conditions agents on recorded demonstrations at runtime (validated: 46.7% → 100% first-action accuracy)
4444
- **Specialization over scale** — fine-tuned Qwen3-VL-2B outperforms Claude Sonnet 4.5 and GPT-5.1 on action accuracy (42.9% vs 11.2% vs 23.2%) on an internal benchmark
45-
- **Capture-to-deployment pipeline** — record → retrieve → train → deploy, used by [OpenCUA](https://github.com/xlang-ai/OpenCUA) (NeurIPS 2025 Spotlight) as foundation for their capture tooling
45+
- **Capture-to-deployment pipeline** — record → retrieve → train → deploy. [OpenCUA](https://github.com/xlang-ai/OpenCUA) (NeurIPS 2025 Spotlight) [reused OpenAdapt's macOS accessibility capture code](https://arxiv.org/html/2508.09123v3) in their AgentNetTool
4646
- **Set-of-Marks (SoM) mode**: 100% accuracy on synthetic benchmarks using element IDs instead of coordinates
4747

4848
### 1.2 Current Landing Page Assessment
@@ -223,7 +223,7 @@ Why: Clear 3-step process, action-oriented
223223
1. **Capture-to-Deployment Pipeline**
224224
- Not: "Prompt the AI to do your task"
225225
- But: "Record the task once. OpenAdapt handles the rest — retrieval, training, deployment."
226-
- Proof: 7 modular packages (capture, ML, evals, grounding, retrieval, privacy, viewer); OpenCUA (NeurIPS 2025) built on OpenAdapt's capture infrastructure
226+
- Proof: 7 modular packages (capture, ML, evals, grounding, retrieval, privacy, viewer); OpenCUA (NeurIPS 2025) [reused OpenAdapt's macOS a11y capture code](https://arxiv.org/html/2508.09123v3)
227227

228228
2. **Demonstration-Conditioned Agents**
229229
- Not: "Zero-shot reasoning about what to click"
@@ -270,7 +270,7 @@ Why: Clear 3-step process, action-oriented
270270
|------------|-----------|------------|---------------|
271271
| **Anthropic Computer Use** | 72.5% OSWorld (near-human), simple API | Proprietary, cloud-only, no customization, per-action cost | Open source, model-agnostic, trainable, runs locally |
272272
| **Agent S3 (Simular)** | 72.6% OSWorld (superhuman), open source | Zero-shot only, no demo conditioning, no fine-tuning pipeline | Demo-conditioned agents, capture-to-train pipeline |
273-
| **OpenCUA (XLANG Lab)** | NeurIPS Spotlight, 45% OSWorld, open models (7B-72B) | Zero-shot at inference — demos used only for training, not runtime | Runtime demo conditioning (unique), OpenCUA built on our capture tool |
273+
| **OpenCUA (XLANG Lab)** | NeurIPS Spotlight, 45% OSWorld, open models (7B-72B) | Zero-shot at inference — demos used only for training, not runtime | Runtime demo conditioning (unique); OpenCUA reused our macOS a11y code |
274274
| **Browser Use** | 50k+ GitHub stars, 89% WebVoyager | Browser-only, no desktop, no training pipeline | Full desktop support, fine-tuning, demo library |
275275
| **UI-TARS (ByteDance)** | Local models (2B-72B), Apache 2.0 | No demo conditioning, no capture pipeline | End-to-end record→train→deploy, demo retrieval |
276276
| **CUA / Bytebot** | Container infra, YC-backed | Infrastructure-only, no ML training pipeline | Full pipeline: capture + train + eval + deploy |
@@ -497,7 +497,7 @@ Example: Onboarding guides for complex internal tools.
497497

498498
- "46.7% → 100% first-action accuracy with demo conditioning (n=45, same model, no training)"
499499
- "Fine-tuned 2B model outperforms Claude Sonnet 4.5 on action accuracy (42.9% vs 11.2%, internal benchmark)"
500-
- "OpenCUA (NeurIPS 2025 Spotlight) built their capture tool on OpenAdapt"
500+
- "OpenCUA (NeurIPS 2025 Spotlight) reused OpenAdapt's macOS accessibility capture code in AgentNetTool"
501501
- "Only open-source CUA framework with runtime demo-conditioned inference"
502502
- "[X,XXX] PyPI downloads this month" (dynamic)
503503
- "[XXX] GitHub stars" (dynamic)

0 commit comments

Comments
 (0)