Implement Vertical Pod Autoscaling for FAQ System#697
Conversation
|
@Ajibose Great news! 🎉 Based on an automated assessment of this PR, the linked Wave issue(s) no longer count against your application limits. You can now already apply to more issues while waiting for a review of this PR. Keep up the great work! 🚀 |
|
Great job so far There’s just one blocker — the workflow is failing. Could you take a look and fix it so all checks pass? Happy to review again once that’s done. |
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Thanks. Please help me check now |
|
Thank you for contributing to the project |
Closes #383
Summary
k8s/faq-system-deployment.yaml) with production-ready resource requests/limits, readiness/liveness probes, and health-check endpointsk8s/faq-system-vpa.yaml) usingautoscaling.k8s.io/v1that targets the FAQ System deployment and automatically adjusts CPU and memory resourcesk8s/faq-system-vpa.test.ts) with 13 vitest assertions covering both the deployment and VPA manifests, matching the existing k8s test conventions in the repositoryVPA Configuration Details
autoscaling.k8s.io/v1Auto(live pod updates)faq-systemRequestsAndLimits100m128Mi2000m(2 cores)2GiResource Limits and Requests
Initial deployment requests (before VPA takes effect):
150m256Mi500m512MiVPA will dynamically adjust these based on observed usage. The
Automode allows VPA to update running pods without requiring manual restarts.Test Results Confirmation
13 vitest assertions across 2 describe blocks validate:
Auto)controlledValues: RequestsAndLimitssettingPerformance Impact Assessment
Documentation Updates
The VPA manifest is self-documenting via YAML comments-compatible structure. The test file serves as living documentation of the expected manifest contract.