feat(test): unify e2e harness across local and kubernetes by kalbasit · Pull Request #1381 · kalbasit/ncps

kalbasit · 2026-06-09T07:36:49Z

Summary

Consolidates the three overlapping e2e harnesses — nix/k8s-tests and the standalone dev-scripts/test-cdc-lifecycle-e2e.py / test-inflight-staging-contention-e2e.py drivers — into one scenario-driven harness at nix/e2e-tests, exposed as task test:e2e / nix run .#e2e.

--mode local|kubernetes selects the substrate. local drives ncps through dev-scripts/run.py against the fixed-port nix run .#deps backends; kubernetes provisions a Kind cluster and installs the Helm chart.
Single catalog: scenarios stay declared in config.nix (kept as the source of truth, materialized via nix eval), with harness-only phase / modes keys derived per entry. Run nix run .#e2e -- --list.
Shared phase drivers: a Deployment adapter (LocalDeployment) plus shared Client/DBAccess run the serve, cdc-lifecycle, and staging-contention phases unchanged across backends; kubernetes mode reuses the in-cluster NCPSTester validation.
Removals: the two former drivers + their *-auto.sh wrappers are deleted (parity verified live); k8s_tests.py / k8s_tests_tester.py / config.nix move into nix/e2e-tests, folding the k8s backend into the unified package and deleting nix/k8s-tests (devshell now ships packages.e2e).
Docs (Contributing, Testing, CLAUDE.md, README) point at the new entrypoint.

The harness is manual / opt-in and intentionally stays out of nix flake check; no scenario is promoted unless proven under three minutes.

Test plan

task fmt (0 changed), task lint (0 issues), task test (all ok)
openspec validate — valid; change archived, specs synced
Local mode verified live: serve, cdc-lifecycle (5 phases + cross-cutting, 20 chunked NARs drained), staging-contention (download + chunking windows, staging activated, byte-identical) all PASS
Kubernetes mode verified live: single-local-sqlite on Kind — cluster + infra + image build/push + Helm install + NCPSTester 6/6 checks PASS

Consolidate the three overlapping e2e harnesses — nix/k8s-tests and the standalone dev-scripts/test-cdc-lifecycle-e2e.py / test-inflight-staging-contention-e2e.py drivers — into one scenario-driven harness at nix/e2e-tests, exposed as `task test:e2e` / `nix run .#e2e`. A `--mode local|kubernetes` flag selects the substrate: local drives ncps through dev-scripts/run.py against the fixed-port `nix run .#deps` backends; kubernetes provisions a Kind cluster and installs the Helm chart. Scenarios are declared once in config.nix (the existing catalog, kept as the single source of truth and materialized via `nix eval`), with harness-only `phase` and `modes` keys derived or overridden per entry. A Deployment adapter (LocalDeployment) plus shared Client/DBAccess let the serve, cdc-lifecycle, and staging-contention phase drivers run unchanged across backends; kubernetes mode reuses the in-cluster NCPSTester validation. The two former drivers and their *-auto.sh wrappers are removed (parity verified live: cdc-lifecycle and staging-contention both pass in local mode); k8s_tests.py, k8s_tests_tester.py and config.nix move into nix/e2e-tests so the k8s backend is folded into the unified package and nix/k8s-tests is deleted (devshell now ships packages.e2e). Docs (Contributing, Testing, CLAUDE.md, README) point at the new entrypoint. The harness is manual/opt-in and intentionally stays out of `nix flake check`; no scenario is promoted unless proven under three minutes. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

kalbasit · 2026-06-09T07:36:50Z

This change is part of the following stack:

feat(test): unify e2e harness across local and kubernetes #1381 ◀

_{Change managed by git-spice.}

coderabbitai · 2026-06-09T07:37:11Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 2be601d3-bc3f-47ef-81b4-c78b9728a67e

📥 Commits

Reviewing files that changed from the base of the PR and between eadec20 and 399f9ad.

📒 Files selected for processing (8)

nix/e2e-tests/README.md
nix/e2e-tests/src/catalog.py
nix/e2e-tests/src/db.py
nix/e2e-tests/src/deployment.py
nix/e2e-tests/src/local.py
nix/e2e-tests/src/phases/cdc_lifecycle.py
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/specs/unified-e2e-harness/spec.md
openspec/specs/unified-e2e-harness/spec.md

✅ Files skipped from review due to trivial changes (3)

nix/e2e-tests/README.md
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/specs/unified-e2e-harness/spec.md
openspec/specs/unified-e2e-harness/spec.md

🚧 Files skipped from review as they are similar to previous changes (5)

nix/e2e-tests/src/deployment.py
nix/e2e-tests/src/db.py
nix/e2e-tests/src/phases/cdc_lifecycle.py
nix/e2e-tests/src/local.py
nix/e2e-tests/src/catalog.py

📝 Walkthrough

Summary by CodeRabbit

New Features
- Unified, scenario-driven end-to-end test harness with CLI/app entrypoints and task integration; supports --mode (local|kubernetes), --scenario and listing.
Documentation
- Developer guides and READMEs updated with scenario usage, modes, and catalog/extension guidance; notes that E2E is manual/opt-in and not run by default.
Chores
- Removed legacy standalone E2E wrappers and deprecated Kubernetes-specific test package; added new task entrypoint for running the harness.

Walkthrough

A comprehensive consolidation of three overlapping E2E test harnesses into a single scenario-driven Python framework. The new unified harness supports both local (dev-scripts/run.py) and Kubernetes (Kind/Helm) execution modes via --mode, runs declarative scenarios from config.nix, validates behavior through phase drivers (serve, cdc-lifecycle, staging-contention), and exposes a single entrypoint via task test:e2e and nix run .#e2e.

Changes

Unified E2E Harness

Layer / File(s)	Summary
Flake module wiring and package exposure `flake.nix`, `nix/e2e-tests/flake-module.nix`, `nix/devshells/flake-module.nix`, `nix/checks/flake-module.nix`, `nix/e2e-tests/src/__init__.py`	Swap `k8s-tests` module import for `e2e` in flake; new `nix/e2e-tests/flake-module.nix` exports `packages.e2e` and `apps.e2e`, which run `python3 ${./src/cli.py}` with `CONFIG_FILE` and `PYTHONPATH` set; devshell and checks documentation updated to reference the unified harness.
Scenario catalog definition and discovery `nix/e2e-tests/config.nix`, `nix/e2e-tests/src/catalog.py`	Add `cdc-lifecycle` and `staging-contention` permutations to `config.nix` with harness-only `phase` and `modes` keys; implement runtime catalog loading via `nix eval`, normalize scenarios into `Scenario` dataclass, provide `load_catalog`, `find_scenario`, and `format_catalog_listing` utilities for CLI discovery and validation.
CLI argument parsing and dispatch `nix/e2e-tests/src/cli.py`	Implement main CLI entrypoint that parses `--mode`, `--scenario`, `--list`, and `--verbose`; handle `--list` output via catalog helpers; validate required `--mode` and `--scenario` arguments; dispatch to `runner.run_scenario`.
Scenario runner and deployment protocol `nix/e2e-tests/src/deployment.py`, `nix/e2e-tests/src/runner.py`	Define `Deployment` protocol specifying lifecycle (provision/teardown), replica access, restart/subcommand execution, and observability (db/logs); implement `run_scenario` that resolves scenarios, reports `SKIP` for unsupported topologies, dispatches to mode-specific runners, reports `PASS/FAIL/ERROR`, and guarantees cleanup.
Local deployment orchestration `nix/e2e-tests/src/local.py`, `nix/e2e-tests/src/deps.py`	Implement `LocalDeployment` that drives `dev-scripts/run.py` with scenario flags, polls replica readiness on `/nix-cache-info`, provides lifecycle methods (provision/start/stop/restart/clean_restart) with process group signaling, and exposes APIs for client access, subcommand execution, db queries, and logs. Implement `Deps` to manage fixed-port backend services (S3/PostgreSQL/MariaDB/Redis) via `nix run .#deps` with conditional Redis startup.
Kubernetes deployment delegation `nix/e2e-tests/src/kubernetes_mode.py`	Implement `run_kubernetes_scenario` that reuses `K8sTestsCLI` machinery, rebuilds/loads ncps OCI image, generates Helm values per scenario, installs release, runs in-cluster validation, and performs best-effort cleanup in a finally block.
Shared test utilities `nix/e2e-tests/src/harness_config.py`, `nix/e2e-tests/src/client.py`, `nix/e2e-tests/src/db.py`	Implement `harness_config` with fixed ports, repo root discovery, filesystem paths, database URLs, S3 config, logging/assertion helpers, and CLI argument generators; `client` with HTTP requests, narinfo parsing, NAR decompression (xz/zstd/none), digest computation, and Nix-based package realization; `db` with dialect-aware SQL execution (SQLite/Postgres/MySQL).
Phase implementations `nix/e2e-tests/src/phases/__init__.py`, `nix/e2e-tests/src/phases/serve.py`, `nix/e2e-tests/src/phases/cdc_lifecycle.py`, `nix/e2e-tests/src/phases/staging_contention.py`	Implement phase dispatcher with lazy imports; `serve.py` validates byte-identical NAR serving across replicas; `cdc_lifecycle.py` exercises full CDC lifecycle (baseline, eager, lazy, drain, restart) with narinfo/DB/digest assertions; `staging_contention.py` verifies in-flight staging activation under concurrent contention with byte-identical delivery and dual-window (CDC off/on) coverage.
Documentation and task integration `Taskfile.yml`, `docs/docs/Developer Guide/Contributing.md`, `docs/docs/Developer Guide/Testing.md`, `nix/e2e-tests/README.md`, `CLAUDE.md`	Add `task test:e2e` that forwards CLI args to `nix run .#e2e`; update developer guides with unified harness usage (modes, scenarios, examples); add comprehensive `nix/e2e-tests/README.md` explaining purpose, catalog, phases, CI behavior (manual/opt-in); update `CLAUDE.md` reference.
Removal of deprecated harnesses `dev-scripts/test-cdc-lifecycle-.py/.sh`, `dev-scripts/test-inflight-staging-contention-.py/.sh`, `nix/k8s-tests/`, `dev-scripts/profile-flake-checks.py`	Remove old CDC and staging-contention E2E drivers; remove `nix/k8s-tests/flake-module.nix` and `README.md`; update `profile-flake-checks.py` to exclude unified `e2e` harness instead of old `k8s-tests`.

Sequence Diagram

sequenceDiagram
  participant CLI
  participant Runner
  participant Catalog
  participant Local
  participant K8s
  CLI->>Runner: run_scenario(--mode, --scenario, --verbose)
  Runner->>Catalog: find_scenario(name)
  Runner->>Runner: supports(mode)?
  alt mode=local
    Runner->>Local: provision + run phase
    Local->>Runner: PASS/FAIL/ERROR
  else mode=kubernetes
    Runner->>K8s: run_kubernetes_scenario(scenario)
    K8s->>Runner: rc (0/1)
  end
  Runner-->>CLI: exit code

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

kalbasit/ncps#1373: Replaces the standalone inflight-staging contention driver with the staging-contention phase in the unified harness.
kalbasit/ncps#1371: Overlaps removal/changes to the CDC lifecycle driver that this PR consolidates into the cdc-lifecycle scenario.
kalbasit/ncps#1249: Related flake-check profiling changes affecting dev-scripts/profile-flake-checks.py.

Suggested labels

documentation, merge-stack

Poem

🐰 I hopped through scripts both far and near,
Collected scenarios, made them all appear.
One harness now runs local and on Kind,
With phases, catalog and docs aligned.
A carrot for tests — consolidated cheer!

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

github-actions · 2026-06-09T07:39:07Z

Pages deployed to https://2b092783.ncps-docs.pages.dev

gemini-code-assist

Code Review

This pull request consolidates multiple end-to-end test harnesses into a single, unified scenario-driven e2e harness under nix/e2e-tests/ that supports both local and Kubernetes execution modes. The review feedback highlights several critical correctness and robustness improvements, including fixing an incorrect relative path calculation for the repository root, defensively handling potentially null values from the Nix configuration, decoding URL-encoded database credentials, ensuring the Deployment protocol declares all invoked methods, and explicitly specifying UTF-8 encoding when opening log files.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

coderabbitai

Actionable comments posted: 4

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

nix/e2e-tests/README.md (1)
81-99: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Add language specifier to fenced code block.

The fenced code block at line 81 is missing a language specifier, which is flagged by markdownlint (MD040). Add a language identifier for proper syntax highlighting and markdown compliance.
📝 Proposed fix
-```
+```text
 nix/e2e-tests/
   flake-module.nix       packages.e2e + apps.e2e (writeShellApplication)
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@nix/e2e-tests/README.md` around lines 81 - 99, Add a language specifier to
the fenced code block that shows the project tree (the block beginning with
"nix/e2e-tests/ ..."), e.g., change the opening triple backticks to include a
language like text or bash; update the README.md fenced block so markdownlint
MD040 is satisfied and syntax highlighting is explicit.
Source: Linters/SAST tools

🧹 Nitpick comments (2)

nix/e2e-tests/src/client.py (1)

78-110: ⚡ Quick win

Consider extracting duplicate decompression logic.

The decompression logic in served_nar_digest (lines 87-96) is duplicated in the standalone decode_nar function (lines 102-110). Consider refactoring served_nar_digest to call decode_nar instead.

♻️ Proposed refactor

     def served_nar_digest(self, narinfo_fields: Dict[str, str]) -> Tuple[str, int]:
         """(sha256 of DECOMPRESSED NAR, served byte length).
 
         Comparing the decompressed-content digest proves byte-identity across
         compression representations (xz whole-file vs none-from-chunks), which
         catches same-size corruption a length check would miss.
         """
         raw = self.fetch_nar_bytes(narinfo_fields)
         comp = narinfo_fields.get("Compression", "none")
-        if comp in ("none", ""):
-            data = raw
-        elif comp == "xz":
-            data = lzma.decompress(raw)
-        elif comp in ("zst", "zstd"):
-            import zstandard
-
-            data = zstandard.ZstdDecompressor().stream_reader(io.BytesIO(raw)).read()
-        else:
-            raise RuntimeError(f"served_nar_digest: unsupported compression {comp!r}")
+        data = decode_nar(raw, comp)
         return hashlib.sha256(data).hexdigest(), len(raw)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@nix/e2e-tests/src/client.py` around lines 78 - 110, The decompression logic
is duplicated between served_nar_digest and decode_nar; update served_nar_digest
to call decode_nar(raw, comp) instead of reimplementing branching logic: obtain
comp via narinfo_fields.get(...), call data = decode_nar(raw, comp), then
compute and return hashlib.sha256(data).hexdigest() and len(raw); preserve the
existing RuntimeError behavior by relying on decode_nar for unsupported
compressions and ensure any required imports used by decode_nar (lzma,
zstandard, io) remain available.

nix/e2e-tests/src/phases/cdc_lifecycle.py (1)

97-97: ⚡ Quick win

Clarify the log message format.

The message format ({before} -> after) is misleading because "after" appears to be a variable placeholder but is actually literal text. Either compute and include the actual after-count, or use clearer wording.

📝 Suggested improvements

Option 1 - Show the actual count:

-    check(_chunked_nar_count(db) > before, f"new chunked NAR recorded ({before} -> after)")
+    after = _chunked_nar_count(db)
+    check(after > before, f"new chunked NAR recorded ({before} -> {after})")

Option 2 - Use clearer wording:

-    check(_chunked_nar_count(db) > before, f"new chunked NAR recorded ({before} -> after)")
+    check(_chunked_nar_count(db) > before, f"new chunked NAR recorded (was {before})")

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@nix/e2e-tests/src/phases/cdc_lifecycle.py` at line 97, The log message passed
to check is misleading because it uses the literal "after" instead of the actual
post-count; compute the current count by calling _chunked_nar_count(db) into a
variable (e.g., after) and use it in the message or change the wording to
something like "new chunked NAR recorded (was {before})" to avoid implying a
variable. Update the call in the check that currently references
_chunked_nar_count(db) and f"new chunked NAR recorded ({before} -> after)" to
include the computed after value or clearer wording so the message accurately
reflects the state.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@nix/e2e-tests/src/catalog.py`:
- Around line 36-44: The repo_root calculation uses os.path.dirname three times
on __file__, which yields the package directory instead of the repository root;
update the repo_root expression to apply one more os.path.dirname (i.e., four
dirname() calls) so that repo_root points to the repository root before
constructing candidate paths (look for the repo_root variable assignment where
__file__ is wrapped with os.path.abspath and nested os.path.dirname calls).

In `@nix/e2e-tests/src/deployment.py`:
- Around line 36-38: Add a proper type annotation for the `extra` parameter on
the `run_subcommand` method: change its type to Optional[List[str]] (or
Sequence[str] if you prefer immutability) and import the needed typing symbols
(Optional, List or Sequence) at the top of the module; update the signature of
run_subcommand to reflect the new annotation so callers and static checkers know
`extra` is an optional list of CLI argument strings.

In `@nix/e2e-tests/src/phases/cdc_lifecycle.py`:
- Line 135: The second return value from
deployment.run_subcommand("migrate-chunks-to-nar", ["--force-reclaim"]) is
intentionally unused; replace the unused variable name `out` with a throwaway
underscore variable (e.g., `_`) so the call becomes `rc, _ =
deployment.run_subcommand(...)`, updating any references to `out` if present;
this makes intent explicit while keeping `rc` as the meaningful result.

In
`@openspec/changes/archive/2026-06-09-consolidate-test-harnesses/specs/unified-e2e-harness/spec.md`:
- Around line 24-25: The example phrase "a multi-replica scenario requested in a
single-process local mode" contradicts the PR (local mode can be used for
multi-replica scenarios); update the wording in the WHEN/THEN block to a
non-conflicting topology example such as "a multi-datacenter or
network-partitioned topology requested in single-process local mode" or remove
the example entirely; edit the quoted lines ("**WHEN** a scenario requires a
topology the selected mode cannot express ..." and the following example) so the
example accurately reflects a topology local mode cannot express (e.g.,
multi-datacenter or network-partitioned) and ensure the THEN clause remains
unchanged (report SKIPPED with reason, not PASSED).

---

Outside diff comments:
In `@nix/e2e-tests/README.md`:
- Around line 81-99: Add a language specifier to the fenced code block that
shows the project tree (the block beginning with "nix/e2e-tests/ ..."), e.g.,
change the opening triple backticks to include a language like text or bash;
update the README.md fenced block so markdownlint MD040 is satisfied and syntax
highlighting is explicit.

---

Nitpick comments:
In `@nix/e2e-tests/src/client.py`:
- Around line 78-110: The decompression logic is duplicated between
served_nar_digest and decode_nar; update served_nar_digest to call
decode_nar(raw, comp) instead of reimplementing branching logic: obtain comp via
narinfo_fields.get(...), call data = decode_nar(raw, comp), then compute and
return hashlib.sha256(data).hexdigest() and len(raw); preserve the existing
RuntimeError behavior by relying on decode_nar for unsupported compressions and
ensure any required imports used by decode_nar (lzma, zstandard, io) remain
available.

In `@nix/e2e-tests/src/phases/cdc_lifecycle.py`:
- Line 97: The log message passed to check is misleading because it uses the
literal "after" instead of the actual post-count; compute the current count by
calling _chunked_nar_count(db) into a variable (e.g., after) and use it in the
message or change the wording to something like "new chunked NAR recorded (was
{before})" to avoid implying a variable. Update the call in the check that
currently references _chunked_nar_count(db) and f"new chunked NAR recorded
({before} -> after)" to include the computed after value or clearer wording so
the message accurately reflects the state.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 7ab40503-4bdc-41b9-84f5-e20d8d187f62

📥 Commits

Reviewing files that changed from the base of the PR and between 3c11a3b and eadec20.

📒 Files selected for processing (46)

CLAUDE.md
Taskfile.yml
dev-scripts/profile-flake-checks.py
dev-scripts/test-cdc-lifecycle-auto.sh
dev-scripts/test-cdc-lifecycle-e2e.py
dev-scripts/test-inflight-staging-contention-auto.sh
dev-scripts/test-inflight-staging-contention-e2e.py
docs/docs/Developer Guide/Contributing.md
docs/docs/Developer Guide/Testing.md
flake.nix
nix/checks/flake-module.nix
nix/devshells/flake-module.nix
nix/e2e-tests/README.md
nix/e2e-tests/config.nix
nix/e2e-tests/flake-module.nix
nix/e2e-tests/src/__init__.py
nix/e2e-tests/src/catalog.py
nix/e2e-tests/src/cli.py
nix/e2e-tests/src/client.py
nix/e2e-tests/src/db.py
nix/e2e-tests/src/deployment.py
nix/e2e-tests/src/deps.py
nix/e2e-tests/src/harness_config.py
nix/e2e-tests/src/k8s_tests.py
nix/e2e-tests/src/k8s_tests_tester.py
nix/e2e-tests/src/kubernetes_mode.py
nix/e2e-tests/src/local.py
nix/e2e-tests/src/phases/__init__.py
nix/e2e-tests/src/phases/cdc_lifecycle.py
nix/e2e-tests/src/phases/serve.py
nix/e2e-tests/src/phases/staging_contention.py
nix/e2e-tests/src/runner.py
nix/k8s-tests/README.md
nix/k8s-tests/flake-module.nix
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/.openspec.yaml
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/design.md
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/proposal.md
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/specs/cdc-lifecycle-e2e/spec.md
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/specs/cdc-lifecycle-k8s-test/spec.md
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/specs/inflight-staging-contention-e2e/spec.md
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/specs/unified-e2e-harness/spec.md
openspec/changes/archive/2026-06-09-consolidate-test-harnesses/tasks.md
openspec/specs/cdc-lifecycle-e2e/spec.md
openspec/specs/cdc-lifecycle-k8s-test/spec.md
openspec/specs/inflight-staging-contention-e2e/spec.md
openspec/specs/unified-e2e-harness/spec.md

💤 Files with no reviewable changes (9)

dev-scripts/test-inflight-staging-contention-auto.sh
nix/k8s-tests/README.md
openspec/specs/inflight-staging-contention-e2e/spec.md
nix/k8s-tests/flake-module.nix
dev-scripts/test-cdc-lifecycle-auto.sh
openspec/specs/cdc-lifecycle-e2e/spec.md
dev-scripts/test-cdc-lifecycle-e2e.py
openspec/specs/cdc-lifecycle-k8s-test/spec.md
dev-scripts/test-inflight-staging-contention-e2e.py

Apply the valid CodeRabbit/Gemini review findings on the unified e2e harness: - catalog: fix the CONFIG_FILE-unset fallback, which resolved the repo root one directory short (nix/ instead of the repo) and so looked for config.nix under nix/nix/...; reuse harness_config.find_repo_root, which walks up to flake.nix. Also harden the Nix-permutation accessors with `or {}` / `or []` so an explicit null in config.nix cannot raise. - db: url-decode percent-encoded MySQL/MariaDB credentials before passing them to pymysql.connect. - deployment: declare stop() and start() on the Deployment protocol (the cdc-lifecycle phase calls them) and type run_subcommand's `extra` as Optional[List[str]]. - local: open the harness and replica log files with encoding="utf-8". - cdc_lifecycle: discard the unused migrate-chunks-to-nar output. - docs/spec: use a non-contradictory topology example (local mode does support multi-replica) and add a language to the README layout block. The remaining review threads are devskim/CodeQL false positives — intentional fixed-dev-port (127.0.0.1) access and the well-known dev Garage fixture credentials (identical to dev-scripts/run.py), which are redacted before logging. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

github-actions · 2026-06-09T08:00:38Z

Pages deployed to https://d196ca0b.ncps-docs.pages.dev

dosubot Bot added size:XXL This PR changes 1000+ lines, ignoring generated files. enhancement New feature or request labels Jun 9, 2026

github-advanced-security AI found potential problems Jun 9, 2026

View reviewed changes

Comment thread nix/e2e-tests/src/harness_config.py Dismissed

gemini-code-assist Bot reviewed Jun 9, 2026

View reviewed changes

coderabbitai Bot requested changes Jun 9, 2026

View reviewed changes

Comment thread nix/e2e-tests/src/catalog.py Outdated

Comment thread nix/e2e-tests/src/deployment.py Outdated

Comment thread nix/e2e-tests/src/phases/cdc_lifecycle.py Outdated

Comment thread ...spec/changes/archive/2026-06-09-consolidate-test-harnesses/specs/unified-e2e-harness/spec.md Outdated

coderabbitai Bot approved these changes Jun 9, 2026

View reviewed changes

kalbasit merged commit 845f43f into main Jun 9, 2026
45 of 57 checks passed

kalbasit deleted the user/wnasreddine/consolidate-test-harnesses branch June 9, 2026 15:21

coderabbitai Bot mentioned this pull request Jun 9, 2026

feat(e2e): run cdc-lifecycle on kubernetes via a Deployment adapter #1384

Merged

6 tasks

Uh oh!

Conversation

kalbasit commented Jun 9, 2026

Summary

Test plan

Uh oh!

kalbasit commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested labels

Poem

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 9, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kalbasit commented Jun 9, 2026 •

edited

Loading

coderabbitai Bot commented Jun 9, 2026 •

edited

Loading