Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
127 commits
Select commit Hold shift + click to select a range
1862e2a
docs: design beta readiness docs
tky0065 May 18, 2026
5c66088
docs: plan beta readiness docs
tky0065 May 18, 2026
e59f243
docs: add beta guide
tky0065 May 18, 2026
90b2bf2
docs: add failed run issue template
tky0065 May 18, 2026
0cec185
docs: add provider guide
tky0065 May 18, 2026
f887c55
docs: link beta resources
tky0065 May 18, 2026
4b9845f
docs: track beta readiness lacunes
tky0065 May 18, 2026
0fae29b
docs: refine lacune tracking status
tky0065 May 18, 2026
5f50744
docs: design dev quality evals
tky0065 May 18, 2026
2522cce
docs: plan dev quality evals
tky0065 May 18, 2026
3e62f06
docs: add dev quality gate
tky0065 May 18, 2026
1ae939e
test: add dev eval fixtures
tky0065 May 18, 2026
15e668f
test: strengthen http api eval fixture
tky0065 May 18, 2026
8c6984e
test: add dev output checker
tky0065 May 18, 2026
61633bf
test: restrict dev eval scenario commands
tky0065 May 18, 2026
462a5cd
test: harden dev eval checker
tky0065 May 18, 2026
3af561f
test: constrain dev eval commands
tky0065 May 18, 2026
caa4a79
test: preserve empty eval fixture entries
tky0065 May 18, 2026
0debe2a
test: isolate dev eval command runs
tky0065 May 18, 2026
a27254b
test: reject symlinked eval project trees
tky0065 May 18, 2026
8b96674
test: check eval symlinks in generic mode
tky0065 May 18, 2026
e4e437a
docs: update quality eval lacunes
tky0065 May 18, 2026
950fe2b
feat: close lacunes 3,11,12,13,14,16,17,18,21,24
tky0065 May 18, 2026
e897c95
docs: design security secrets hardening
tky0065 May 18, 2026
fbe6b72
docs: plan security secrets hardening
tky0065 May 18, 2026
a7e390d
feat: add secret redactor
tky0065 May 18, 2026
2e792ce
test: harden secret env redaction test
tky0065 May 18, 2026
2dc60a4
feat: redact secrets in run artifacts
tky0065 May 19, 2026
1d3c1b3
feat: redact email tool output
tky0065 May 19, 2026
a45eb8c
feat: redact web search context
tky0065 May 19, 2026
9d6d90f
test: add adversarial tool coverage
tky0065 May 19, 2026
694d349
docs: add security threat model
tky0065 May 19, 2026
1c4f76d
docs: add custom validation design
tky0065 May 19, 2026
66c0be5
docs: add custom validation implementation plan
tky0065 May 19, 2026
af06610
chore: ignore local worktrees
tky0065 May 19, 2026
df841b6
feat: add custom validation report types
tky0065 May 19, 2026
9a76733
fix: allow staged validation module
tky0065 May 19, 2026
a8f3b7a
feat: validate custom agent definitions
tky0065 May 19, 2026
684923e
fix: report missing custom agent fields
tky0065 May 19, 2026
d2f5868
fix: isolate custom validation tests
tky0065 May 19, 2026
07834b2
fix: align custom agent validation with generated formats
tky0065 May 19, 2026
69fc335
fix: canonicalize custom agent tools
tky0065 May 19, 2026
e683ffe
feat: validate custom workflow definitions
tky0065 May 19, 2026
d268bdf
fix: report missing workflow step fields
tky0065 May 19, 2026
4530b2e
fix: harden custom workflow validation
tky0065 May 19, 2026
fec481d
feat: block invalid custom workflows
tky0065 May 19, 2026
ef0675b
fix: avoid cwd mutation in workflow validation test
tky0065 May 19, 2026
3da31b5
feat: add custom validation cli
tky0065 May 19, 2026
9c45fbd
fix: remove staged validation dead code allow
tky0065 May 19, 2026
4309491
feat: add validate repl command
tky0065 May 19, 2026
9a96539
fix: add validate to command palette
tky0065 May 19, 2026
a5c44f1
docs: document custom validation
tky0065 May 19, 2026
c4c263d
docs: remove stale custom validation next step
tky0065 May 19, 2026
1304d9f
docs: align lacunes with custom validation completion
tky0065 May 19, 2026
6316fb9
merge: custom validation
tky0065 May 19, 2026
72f72ce
docs: design run observability
tky0065 May 20, 2026
f336f0b
docs: plan run observability
tky0065 May 20, 2026
9026aff
feat: add run report model
tky0065 May 20, 2026
7f4a307
fix: clean run report model warnings
tky0065 May 20, 2026
53ac4eb
feat: collect run report events
tky0065 May 20, 2026
1429393
fix: harden run report aggregation
tky0065 May 20, 2026
dd32635
feat: persist redacted run reports
tky0065 May 20, 2026
fa15187
feat: write run reports from orchestrator
tky0065 May 20, 2026
b720e5c
fix: flush run report events before finalizing
tky0065 May 20, 2026
4f9c0b3
fix: acknowledge run report and log flushes
tky0065 May 20, 2026
3754774
docs: document run reports
tky0065 May 20, 2026
92d8a47
merge: run observability
tky0065 May 20, 2026
1474150
docs: design resume checkpoints
tky0065 May 20, 2026
8498a15
docs: plan resume checkpoints
tky0065 May 20, 2026
5df6aa8
feat: add checkpoint model
tky0065 May 20, 2026
14ceaf0
test: cover checkpoint conflict API
tky0065 May 20, 2026
cd217bc
feat: persist resume checkpoints
tky0065 May 20, 2026
ee02a18
fix: validate checkpoint file paths
tky0065 May 20, 2026
5ecf06d
feat: validate checkpoint resume
tky0065 May 20, 2026
3cefb05
fix: polish checkpoint resume errors
tky0065 May 20, 2026
2cd2fec
feat: write dev workflow checkpoints
tky0065 May 20, 2026
1845122
fix: refresh checkpoint hashes after dev fixes
tky0065 May 20, 2026
4cceb22
feat: resume dev workflow phases
tky0065 May 20, 2026
8589435
fix: validate dev resume checkpoint consistency
tky0065 May 21, 2026
d755a00
feat: persist checkpoint terminal status
tky0065 May 21, 2026
e1f75b5
docs: document resume checkpoints
tky0065 May 21, 2026
7e6e74b
fix: harden checkpoint resume finalization
tky0065 May 21, 2026
227d592
fix: require tracked artifacts for resumed phases
tky0065 May 21, 2026
b41d479
docs: design adversarial security coverage
tky0065 May 21, 2026
3fbcb69
docs: plan adversarial security coverage
tky0065 May 21, 2026
a63e340
test: cover adversarial web search context
tky0065 May 21, 2026
66d30da
test: cover adversarial custom definitions
tky0065 May 21, 2026
6030313
fix: validate custom workflow role names
tky0065 May 21, 2026
ca02cbe
test: cover composed tool boundary attacks
tky0065 May 21, 2026
d946e35
test: cover email safety defaults
tky0065 May 21, 2026
c77288b
test: cover updater suspicious inputs
tky0065 May 21, 2026
6204f1c
docs: close adversarial security gaps
tky0065 May 21, 2026
617d80f
docs: clarify remaining security limits
tky0065 May 21, 2026
706dc82
docs: refresh custom validation threat control
tky0065 May 21, 2026
f06d616
style: format adversarial security coverage
tky0065 May 21, 2026
b3488bc
docs: design concurrency cancellation stress tests
tky0065 May 23, 2026
69192ef
docs: plan concurrency cancellation stress tests
tky0065 May 23, 2026
68dd239
test: add orchestrator stress helpers
tky0065 May 23, 2026
3d3e763
test: cover slow workflow cancellation
tky0065 May 23, 2026
2abdc64
test: make slow cancellation handshake deterministic
tky0065 May 23, 2026
f485038
test: cover workflow failure event drain
tky0065 May 23, 2026
6642ca4
fix: emit final tasks before watcher shutdown
tky0065 May 23, 2026
9043a0d
fix: order final tasks snapshot after watcher shutdown
tky0065 May 23, 2026
e79f9dc
test: cover dropped event receiver
tky0065 May 23, 2026
187c902
test: assert dropped receiver report events
tky0065 May 23, 2026
a3e143d
test: cover parallel worker failure
tky0065 May 23, 2026
992d972
test: strengthen parallel worker failure coverage
tky0065 May 23, 2026
3fd3a90
test: cover parallel event burst reporting
tky0065 May 23, 2026
fbee219
test: strengthen parallel event burst assertions
tky0065 May 23, 2026
a43169d
test: cover cancelled run artifacts
tky0065 May 23, 2026
ca09ecf
docs: mark concurrency stress coverage complete
tky0065 May 23, 2026
223bf52
docs: design budget and tui smoke coverage
tky0065 May 23, 2026
ad06aad
docs: plan budget and tui smoke coverage
tky0065 May 23, 2026
1eb3ffe
test: add tui smoke coverage
tky0065 May 24, 2026
0d5d691
feat: add run budget enforcement
tky0065 May 24, 2026
8048197
test: harden budget and tui coverage
tky0065 May 24, 2026
13f5cda
docs: mark budget and tui smoke lacunes complete
tky0065 May 24, 2026
c4ea46d
docs: design lacunes tracking consolidation
tky0065 May 24, 2026
209241f
docs: plan lacunes tracking consolidation
tky0065 May 24, 2026
462715f
docs: consolidate lacunes tracking
tky0065 May 24, 2026
0d6d304
docs: design local release smoke
tky0065 May 24, 2026
cf0f7d4
docs: plan local release smoke
tky0065 May 24, 2026
3437fdf
chore: add local release smoke script
tky0065 May 24, 2026
c38c720
docs: document local release smoke
tky0065 May 24, 2026
cfa6c26
docs: mark local release smoke complete
tky0065 May 24, 2026
55e9826
chore: add gstack skill routing rules to CLAUDE.md
tky0065 Jun 19, 2026
100c319
cockpit report TUI + markdown rendering improvements
tky0065 Jun 19, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
78 changes: 78 additions & 0 deletions .github/ISSUE_TEMPLATE/failed_run.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,78 @@
---
name: Failed Cortex run
about: Report a workflow run that failed, stalled, or produced unusable output
labels: bug, run-failure
assignees: ''
---

## Summary

What were you trying to generate or review?

## Command

```bash
cortex start "..." --auto
```

Or paste the REPL slash command you used.

## Environment

- Cortex version:
- OS:
- Install method: installer / cargo / release binary
- Workflow: dev / code-review / marketing / prospecting / custom
- Provider:
- Model:
- Web search enabled: yes / no

## Expected result

What did you expect Cortex to create or report?

## Actual result

What happened instead?

## Failure point

- [ ] Provider/auth error
- [ ] Workflow stalled
- [ ] Tool execution failed
- [ ] Build/test failed in generated project
- [ ] Generated files were missing
- [ ] Generated files were low quality or inconsistent
- [ ] TUI/input/resume issue
- [ ] Other

## Logs and artifacts

Paste the smallest useful excerpt. Redact secrets before posting.

Safe to include:

- Error messages.
- Final summary.
- Generated project tree.
- Non-sensitive command output.
- `cortex.run.json` after reviewing it for private project details.

Do not include:

- API keys.
- OAuth tokens.
- SMTP credentials.
- Private customer data.
- Proprietary source code unless you are allowed to share it.
- Full `cortex.log` output unless you have reviewed and minimized it.

## Reproduction steps

1. Configure provider:
2. Run command:
3. Observe:

## Additional context

Any provider limits, unusual project files, custom agents, custom workflows, or resume steps involved?
70 changes: 70 additions & 0 deletions .github/ISSUE_TEMPLATE/quality_report.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,70 @@
---
name: Generated project quality report
about: Report poor quality, inconsistent, or incomplete output from a Cortex workflow
labels: quality, generated-output
assignees: ''
---

## Summary

What was wrong with the generated project?

## Command used

```bash
cortex start "..." --auto --workflow dev
```

## Environment

- Cortex version: (`cortex --version`)
- OS:
- Provider:
- Model:
- Workflow: dev / code-review / marketing / prospecting / custom

## Expected quality

What would a good output look like? (e.g. "a working Rust CLI with tests and a Dockerfile")

## Actual quality

What did Cortex produce instead? Describe the specific problem:

- [ ] Missing files (list them)
- [ ] Build fails in generated project
- [ ] Tests fail or are missing
- [ ] Dockerfile invalid or missing
- [ ] README missing or wrong instructions
- [ ] Specs / architecture don't match generated code
- [ ] Repeated or contradictory content
- [ ] TODO / placeholder code left in output
- [ ] Other

## Generated project structure

Paste the output of `find <project-dir> -type f` or a tree listing.

```
<paste here>
```

## Error output (if any)

```
<paste build/test/lint output here>
```

## Eval checker result (if you ran it)

```bash
evals/check_dev_output.sh <project-dir>
```

```
<paste output here>
```

## Additional context

Any custom agents, custom workflows, or unusual config involved?
48 changes: 48 additions & 0 deletions .github/ISSUE_TEMPLATE/security_report.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,48 @@
---
name: Security report
about: Report a security vulnerability, secret exposure, or unsafe behavior
labels: security
assignees: ''
---

<!-- IMPORTANT: For critical vulnerabilities (RCE, secret exfiltration, supply chain), consider using GitHub's private Security Advisory feature instead of a public issue. -->

## Summary

A one-sentence description of the security issue.

## Category

- [ ] Prompt injection (LLM output used to construct dangerous commands or paths)
- [ ] Path traversal (file access outside the project output directory)
- [ ] Secret exposure (API key, token, or credential leaked in logs, outputs, or generated files)
- [ ] Unsafe command execution (terminal tool bypass or non-allowlisted command)
- [ ] Supply chain (dependency vulnerability, binary tampering)
- [ ] Other

## Environment

- Cortex version: (`cortex --version`)
- OS:
- Install method: installer / cargo / release binary
- Provider used:

## Steps to reproduce

Describe how to trigger the issue. Include the minimum input needed:

1. Configure / install:
2. Run command:
3. Observe:

## Impact

What can an attacker do? What data is exposed or what action can be triggered?

## Suggested fix

If you have an idea for a fix, describe it here. Otherwise leave blank.

## Evidence

Paste logs, generated file snippets, or tool outputs that demonstrate the issue. **Redact any real secrets before posting.**
20 changes: 20 additions & 0 deletions .github/workflows/ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -60,3 +60,23 @@ jobs:
- uses: dtolnay/rust-toolchain@stable
- uses: Swatinem/rust-cache@v2
- run: cargo build --release

audit:
name: Security audit
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- uses: dtolnay/rust-toolchain@stable
- uses: Swatinem/rust-cache@v2
- run: cargo install cargo-audit --locked
- run: cargo audit

deny:
name: License & dependency check
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- uses: dtolnay/rust-toolchain@stable
- uses: Swatinem/rust-cache@v2
- run: cargo install cargo-deny --locked
- run: cargo deny check
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
/target
.worktrees/
.claude/worktrees/
cortex-output/
cortex.log
19 changes: 19 additions & 0 deletions CLAUDE.md
Original file line number Diff line number Diff line change
Expand Up @@ -209,3 +209,22 @@ web_search_enabled = false # enable with /websearch enable in the REPL

## DOCUMENTATION
- Updated `README.md` and `site` if we added new features or changed usage instructions.

## Skill routing

When the user's request matches an available skill, invoke it via the Skill tool. When in doubt, invoke the skill.

Key routing rules:
- Product ideas/brainstorming → invoke /office-hours
- Strategy/scope → invoke /plan-ceo-review
- Architecture → invoke /plan-eng-review
- Design system/plan review → invoke /design-consultation or /plan-design-review
- Full review pipeline → invoke /autoplan
- Bugs/errors → invoke /investigate
- QA/testing site behavior → invoke /qa or /qa-only
- Code review/diff check → invoke /review
- Visual polish → invoke /design-review
- Ship/deploy/PR → invoke /ship or /land-and-deploy
- Save progress → invoke /context-save
- Resume context → invoke /context-restore
- Author a backlog-ready spec/issue → invoke /spec
Loading
Loading