Add Cloud Run runtime guard alerts#59
Conversation
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 344e3ebc08
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
| issues.append(f"{len(failures)} Cloud Run failure log(s) for {service}") | ||
| details.extend(_summarize(entry) for entry in failures[:5]) | ||
|
|
||
| if services and require_success and success_count == 0: |
There was a problem hiding this comment.
Check heartbeat success per service
When RUNTIME_GUARD_REQUIRE_SUCCESS=true and RUNTIME_GUARD_CLOUD_RUN_SERVICES lists more than one service, this global success_count lets any one service's 2xx/3xx request suppress the missed-run alert for all the others. In that configuration, a scheduler failure or missed invocation for service B is not reported if service A had a recent successful request, so the heartbeat guard can miss exactly the per-service outage it is meant to catch.
Useful? React with 👍 / 👎.
Summary
Validation