intelliaide: add skill metadata, OWNERS, config, and README#28
intelliaide: add skill metadata, OWNERS, config, and README#28sakshipatels98-byte wants to merge 2 commits into
Conversation
|
Skipping CI for Draft Pull Request. |
9b0d0fa to
46eba4f
Compare
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: sakshipatels98-byte The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
@sakshipatels98-byte: The following test failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
| """ | ||
| extract_cluster.py — Step 1 of IntelliAide pure-skills pipeline. | ||
|
|
||
| Reads diagnostic data from /data/input (mounted from a PVC specified in the |
There was a problem hiding this comment.
The PVC-based data source approach is being questioned on the companion operator PR — the sandbox pods are ephemeral and the agent can collect must-gather data itself during its run using /tmp. See this comment.
If that feedback lands, this script would need to collect the data instead of assuming it's pre-mounted at /data/input.
|
This PR doesn't include evals. The repo has an eval framework under It would be especially valuable to include negative cases that demonstrate why this skill is needed — e.g., what happens when the agent tries to analyze a large must-gather bundle without the IntelliAide pipeline (buffer overflow from raw file reads, shallow diagnosis that misses root cause, etc.). Those cases help justify the skill's existence and guard against regressions if someone later simplifies the flow. |
| @@ -0,0 +1,6 @@ | |||
| # See the OWNERS docs: https://git.k8s.io/community/contributors/guide/owners.md | |||
|
|
|||
| approvers: | |||
There was a problem hiding this comment.
Individual teams are responsible for the ownership of the skill.
Summary
Add IntelliAide as a new agentic skill for deep root-cause analysis of OpenShift
cluster issues using pre-mounted diagnostic data.
SKILL.mdorchestrates a 4-step workflow(
extract_cluster→select_files→analyze_data→perform_rca) with3-pass priority analysis (High → Medium → Low). Python scripts handle
computation only (token estimation, chunking, file I/O); the orchestrating
agent performs all LLM reasoning.
/data/input,mounted read-only from the Proposal's
spec.dataSourcePVC. No manualoc cpor wait step;extract_cluster.pyvalidates the pre-populated mountat pipeline start.
requirements.txtdeps (requests,PyYAML, google-auth, drain3, odfpy, python-docx) are vendored under
intelliaide/vendor/for Python 3.12 so the restricted sandbox can runscripts without
pip installat runtime.DataSource/),ML log/YAML classifiers, and pipeline configuration (
Config/).intelliaide/README.mddocuments the pipeline and vendorregeneration;
intelliaide/OWNERSfor Prow review assignment; YAMLfrontmatter on
SKILL.mdfor skill discovery; simplifiedconfig.json(removed obsolete
oc cp-related keys).Test plan
Containerfile) and confirmintelliaide/is copied to/app/skills/intelliaidefor RCA/deep-analysis requests via SKILL.md frontmatterspec.dataSource.claimNamepointing to a PVC pre-populated with must-gather data/data/inputis mounted andextract_cluster.pyvalidates data withoutoc cppip install) inside the sandboxconfig.jsonkeys match what skill scripts consume (no references to removedmust_gather_incoming_dir/must_gather_wait_seconds)