GPU Sharing Pods on Kubernetes

By default, Kubernetes doesn't allow GPU sharing cases as follows:

A pod with multiple containers that share a single GPU.
Multiple pods that share a single GPU.

In this repository, I introduce some tricks for GPU sharing pods on Kubernetes with only use of NVIDIA device plugin.

Prerequisites

Install Kubectl
Install Minikube
Install HELM
Install Argo Workflows CLI

K8s Cluster Creation

make cluster
kubectl get pods --all-namespaces
# Check `nvidia-device-plugin-daemonset` is running.

Argo Workflow Installation

kubectl create namespace argo
helm install argo-workflows charts/argo-workflows -n argo
# Wait for argo-workflows ready...
make port-forward

Open http://localhost:2746/

Login with the token:

kubectl apply -f secret.yaml
kubectl get secret  # Check `argo-workflows-admin.service-account-token` created.
make token
# Paste all strings including Bearer.

Execute a simple workflow for testing:

argo submit --watch workflows/hello-world.yaml

Examples

Create a workflow template that have parallel jobs sharing GPU(s).

kubectl apply -f workflows/templates/gpu-sharing-workflowtemplate.yaml

Trigger the gpu allocation and gpu-sharing workflow execution.

# time slicing with 1 GPU
argo submit --watch workflows/submit-gpu-sharing-workflow.yaml
# time slicing with 2 GPUs
argo submit --watch workflows/submit-gpu-sharing-workflow.yaml -p gpus=2  # 2 gpus
# MPS with 1 GPU
argo submit --watch workflows/submit-gpu-sharing-workflow.yaml -p mps=enabled

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
charts/argo-workflows		charts/argo-workflows
workflows		workflows
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
secret.yaml		secret.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPU Sharing Pods on Kubernetes

Prerequisites

K8s Cluster Creation

Argo Workflow Installation

Examples

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GPU Sharing Pods on Kubernetes

Prerequisites

K8s Cluster Creation

Argo Workflow Installation

Examples

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages