fix(sentinel): close audit drop and login attempt gaps by Agent-Hellboy · Pull Request #195 · Agent-Hellboy/mcp-runtime

Agent-Hellboy · 2026-05-15T19:36:58Z

Summary

Fixes #146.

Count and log mcp-gateway analytics events dropped because the bounded queue is full.
Bound API login-attempt state with idle eviction and a maximum retained entry count.
Bound UI login-attempt state with idle eviction and a maximum retained client count, including the pre-decode allow path.

Validation

go test ./... -count=1 in services/mcp-gateway, services/api, services/ui, and services/processor
go test -race ./... -count=1 in services/mcp-gateway, services/api, services/ui, and services/processor
pre-commit hooks during commit: gitleaks, go fmt, staticcheck, go vet, go unit tests, generated file drift

Security notes

No secrets or credentials are logged; the analytics drop log only includes drop count, source, and event type.
Login attempt trackers remain per-process throttles but no longer retain attacker-controlled keys indefinitely.

gemini-code-assist

Code Review

This pull request introduces memory management for login attempt trackers in the API and UI services by implementing idle entry pruning and a maximum entry cap. It also updates the MCP gateway's analytics emission to track dropped events and refine mutex usage. Feedback focuses on the performance impact of the new maintenance routines; specifically, the O(N) complexity of pruning and eviction logic on every request could lead to CPU exhaustion or DoS vulnerabilities. It is recommended to perform these operations periodically or use more efficient data structures like an LRU cache to ensure scalability.

gemini-code-assist · 2026-05-15T19:38:44Z

+func (t *apiLoginAttemptTracker) pruneLocked(now time.Time) {
+	if apiLoginAttemptIdleTTL <= 0 {
+		return
+	}
+	for key, state := range t.entries {
+		if state.lastSeen.IsZero() || (now.Sub(state.lastSeen) > apiLoginAttemptIdleTTL && !state.lockedUntil.After(now)) {
+			delete(t.entries, key)
+		}
+	}
+}


The pruneLocked method iterates over the entire entries map on every call to allow, recordFailure, and recordSuccess. With a default limit of 4096 entries, this O(N) operation per request introduces significant CPU overhead, especially under high load or during a brute-force attack. This could be exploited as a CPU exhaustion vector.

Consider performing pruning periodically (e.g., using a background goroutine) or only when the map size exceeds a certain threshold to decouple this maintenance from the request path.

gemini-code-assist · 2026-05-15T19:38:44Z

+func (t *loginAttemptTracker) pruneLocked(now time.Time) {
+	if loginAttemptIdleTTL <= 0 {
+		return
+	}
+	for clientID, state := range t.clients {
+		if state.lastSeen.IsZero() || (now.Sub(state.lastSeen) > loginAttemptIdleTTL && !now.Before(state.lockedUntil)) {
+			delete(t.clients, clientID)
+		}
+	}
+}


Similar to the API service, pruneLocked performs an O(N) iteration over the clients map on every request to the login flow. This is inefficient and presents a performance bottleneck. Pruning should ideally be decoupled from the request path or performed less frequently (e.g., probabilistically or via a background task).

gemini-code-assist · 2026-05-15T19:38:44Z

+func (t *apiLoginAttemptTracker) enforceMaxLocked() {
+	for len(t.entries) > apiLoginAttemptMaxEntries {
+		var oldestKey string
+		var oldestSeen time.Time
+		for key, state := range t.entries {
+			if oldestKey == "" || state.lastSeen.Before(oldestSeen) {
+				oldestKey = key
+				oldestSeen = state.lastSeen
+			}
+		}
+		if oldestKey == "" {
+			return
+		}
+		delete(t.entries, oldestKey)
+	}
+}


The enforceMaxLocked method performs a full scan of the map to find the oldest entry, resulting in O(N) complexity. While it is only executed when the map exceeds its limit, an attacker could intentionally trigger this scan repeatedly by rotating keys. For better scalability and resistance to DoS, consider using a data structure that supports O(1) eviction, such as a doubly linked list combined with the map (LRU cache pattern).

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 566032d34d

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-18T07:25:44Z

+		for clientID, state := range t.clients {
+			if oldestClientID == "" || state.lastSeen.Before(oldestSeen) {
+				oldestClientID = clientID
+				oldestSeen = state.lastSeen
+			}


Preserve locked clients when enforcing UI tracker cap

The eviction policy removes whichever client has the oldest lastSeen without checking whether that client is currently locked out. Because enforceMaxLocked() is called from allow() and recordFailure(), once more than 4096 distinct client IDs are seen in a 30-minute window, active lockouts can be evicted and the same attacker can continue login attempts immediately. This weakens the lockout control under high-cardinality or intentionally churned client IDs.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-18T07:25:44Z

+		for key, state := range t.entries {
+			if oldestKey == "" || state.lastSeen.Before(oldestSeen) {
+				oldestKey = key
+				oldestSeen = state.lastSeen
+			}


Avoid evicting active API login lockouts under map pressure

The API attempt cap also evicts solely by oldest lastSeen, regardless of whether an entry is still locked. After the map reaches 4096 keys, additional failed logins on other keys can evict a currently locked ip|email entry, allowing immediate retries for that principal and reducing the effectiveness of password-bruteforce throttling.

Useful? React with 👍 / 👎.

fix(sentinel): close audit drop and login attempt gaps

566032d

gemini-code-assist Bot reviewed May 15, 2026

View reviewed changes

Agent-Hellboy marked this pull request as ready for review May 18, 2026 07:22

chatgpt-codex-connector Bot reviewed May 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(sentinel): close audit drop and login attempt gaps#195

fix(sentinel): close audit drop and login attempt gaps#195
Agent-Hellboy wants to merge 1 commit into
mainfrom
sentinel/issue_146_operational_fixes

Agent-Hellboy commented May 15, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 15, 2026

Uh oh!

gemini-code-assist Bot May 15, 2026

Uh oh!

gemini-code-assist Bot May 15, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 18, 2026

Uh oh!

chatgpt-codex-connector Bot May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Agent-Hellboy commented May 15, 2026

Summary

Validation

Security notes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 18, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant