Skip to content

Feature: Partial cancel#9

Open
vsoch wants to merge 4 commits into
mainfrom
partial-cancel
Open

Feature: Partial cancel#9
vsoch wants to merge 4 commits into
mainfrom
partial-cancel

Conversation

@vsoch
Copy link
Copy Markdown
Member

@vsoch vsoch commented Jan 30, 2025

This is the start of work for partial cancel. Specifically, we needed to refactor the match and jobspec to ask for resources on the level of a core for a slot (and not node as previously done). This means we need to get back the graph and parse the cores into units based on the size of the slot (pod).

I tested doing a satisfy and then match -> cancel to attempt schedule with a constraint, and the latter worked through the cancel but the constraint addition resulted in no allocation and no obvious errors. I'm next going to work on the right way to do the partial cancel when the job comes back - likely we will want to prepare a jobspec with the resource to remove for each pod (as a label / annotation or similar) so it can be just grabbed and used when the event is received.

vsoch added 4 commits January 29, 2025 19:09
I can format the jobspec correctly, but then when
the formatting errors go away it silently returns no
response and no errors, so I cannot debug further.

Signed-off-by: vsoch <vsoch@users.noreply.github.com>
Signed-off-by: vsoch <vsoch@users.noreply.github.com>
This is the start of work for partial cancel.
Specifically, we needed to refactor the match
and jobspec to ask for resources on the level
of a core for a slot (and not node as previously
done). This means we need to get back the
graph and parse the cores into units based
on the size of the slot (pod).

Signed-off-by: vsoch <vsoch@users.noreply.github.com>
Signed-off-by: vsoch <vsoch@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant