feat(harness): add memoize sub-action for memory hygiene by datashaman · Pull Request #8 · datashaman/code-skills

datashaman · 2026-05-02T05:24:42Z

Closes #5.

Issue

Memory under ~/.claude/projects/<slug>/memory/ is reactive — entries get written when the agent notices something worth saving, but nothing prunes or consolidates. Over time, project memories cite renamed files, feedback memories accumulate near-duplicates, and MEMORY.md drifts out of sync with the actual files. The current "verify before recommending from memory" rule in CLAUDE.md is a read-time check, not a maintenance pass.

Effect on the user

Stale memory leaks into responses. The agent recalls a function that no longer exists, or repeats the same feedback rule three different ways because nobody noticed the duplicate. Trust in memory erodes, and the user ends up suppressing the system rather than maintaining it.

Root cause

There's no maintenance surface. The harness has install / update / doctor for hooks and commands, snapshot + audit for the broader setup, but nothing that touches memory. The only existing safeguard runs at read-time, per-recall, and can't fix structural drift like a missing index entry or a frontmatter typo.

The fix

A new memoize sub-action — deterministic, idempotent, report-only.

skills/harness/scripts/memoize.sh runs four checks against ~/.claude/projects/<slug>/memory/:

Index sync — every memory/*.md is listed in MEMORY.md; every entry points at a real file.
Frontmatter hygiene — every memory has name, description, type.
Stale citations — path-shaped tokens (~/, /Users/, ./, or ending in a known source extension) that resolve nowhere across ~/.claude/projects/ and ~/Projects/. Conservative on purpose — false positives cost more than misses; slash-commands, brace expansions, and regex-shaped strings are filtered out.
Possible duplicates — pairs within a type whose name or description are lexically similar (Jaccard ≥ 0.5). Flag, don't merge.

Output goes to <memory>/_memoize-report.md (leading underscore so it's never indexed by MEMORY.md). The report is byte-stable on equal runs — running twice produces an identical file. Flags: --dry-run, --target=PATH. Env knobs mirror snapshot.sh: CLAUDE_DIR, USER_PROJECT_KEY, MEMOIZE_SEARCH_ROOTS.

skills/harness/scripts/memoize-prompt.md is the /schedule recipe for a weekly remote routine that runs the script, layers in the semantic checks the lexical pass can't catch (conceptual duplicates, outdated facts, conflicting guidance), and PRs audits/memory/YYYY-MM-DD.md against the snapshot repo. Suggested cron 0 6 * * 0 (Sunday 06:00 UTC), model claude-opus-4-7. The remote agent never modifies memory directly — the user reviews and applies.

skills/harness/SKILL.md picks up memoize in the frontmatter description (so the skill auto-routes on "memoize" / "consolidate memory" / "prune memory"), a row in the dispatch table, a new ## memoize section, and an entry in the Files-in-this-skill table.

Validation

./scripts/harness-check.sh passes — shellcheck clean (resolved three SC2005 stylistic warnings on echo $(dim …) patterns), ruff, mypy, pytest.
bash skills/harness/scripts/memoize.sh --dry-run against the real ~/.claude/projects/-Users-marlinf/memory/ reports zero findings.
Two consecutive memoize runs produce a byte-identical report (diff empty) — idempotency confirmed.
The path-resolution heuristic was tightened mid-implementation after the initial run flagged six false positives (slash commands like /grade, /harness-check, brace expansion stacks/{typescript,python,go,laravel}, regex /MFC/class-based). Restricting to hard path prefixes + known source extensions dropped the noise to zero against the same input.

Copilot

Pull request overview

This PR adds a new memoize sub-action to the harness skill so users can run a deterministic memory-hygiene pass over Claude Code memory and optionally wire a weekly remote review workflow around it. It extends the existing harness surfaces (snapshot, audit, doctor, etc.) with a maintenance path for keeping memory indexes, frontmatter, and stale references from drifting over time.

Changes:

Adds skills/harness/scripts/memoize.sh, a new report-generating script that checks memory index sync, frontmatter hygiene, stale citations, and possible duplicates.
Adds skills/harness/scripts/memoize-prompt.md, a /schedule prompt template for a weekly remote memory-review routine.
Updates skills/harness/SKILL.md to advertise, route, and document the new memoize action and its related files.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 14 comments.

File	Description
`skills/harness/scripts/memoize.sh`	Implements the new memory-hygiene scan and report generation logic.
`skills/harness/scripts/memoize-prompt.md`	Defines the weekly remote workflow for reviewing memory and opening a PR with findings.
`skills/harness/SKILL.md`	Documents the new `memoize` action, routing keywords, and supporting files.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

- CLAUDE_DIR override now cascades to default search roots - MEMOIZE_SEARCH_ROOTS uses PATH-style colon separator (paths with spaces) - ./ and ../ tokens resolve under search roots, not CWD - prune .git, node_modules, vendor, etc. during fallback walk - drop unread SUMMARY stderr emission - prompt drops the local-script branch (snapshot has no scripts) - prompt index-check explicitly skips MEMORY.md and _*.md - prompt clarifies stale-citation is local-only - SKILL.md clarifies report-vs-memory contract

feat(harness): add memoize sub-action for memory hygiene

eff27af

datashaman added the enhancement New feature or request label May 2, 2026

datashaman self-assigned this May 2, 2026

datashaman requested a review from Copilot May 2, 2026 05:27

Copilot started reviewing on behalf of datashaman May 2, 2026 05:27 View session

Copilot AI reviewed May 2, 2026

View reviewed changes

datashaman added 2 commits May 2, 2026 07:39

chore: drop stray .mcp.json from PR

151e722

datashaman marked this pull request as ready for review May 2, 2026 05:42

datashaman merged commit f68ed97 into main May 2, 2026
3 checks passed

This was referenced May 2, 2026

investigate: harness-layer ideas from @yyz81681981 (PTC, spatiotemporal context, dreaming cycle) #4

Closed

docs(harness): add spatial-vs-temporal context taxonomy #10

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(harness): add memoize sub-action for memory hygiene#8

feat(harness): add memoize sub-action for memory hygiene#8
datashaman merged 3 commits into
mainfrom
feat/harness-memoize

datashaman commented May 2, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

datashaman commented May 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue

Effect on the user

Root cause

The fix

Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

datashaman commented May 2, 2026 •

edited

Loading