How Kubernetes sees a GPU

easy

Learn with your AI

Open this lesson in your favourite AI. It'll walk you through the why, explain the demo, and quiz you on the try-it list.

Open in Claude Open in ChatGPT

Why this matters

Kubernetes was built to schedule CPU and memory, which it understands natively. A GPU is none of those things — the kubelet has no idea a GPU exists until something tells it. That something is the device plugin framework, which lets vendors advertise hardware as a named extended resource like nvidia.com/gpu. Until you grasp that a GPU is just a counted, opaque resource the scheduler matches against pod requests, every GPU-scheduling decision later in this course will feel arbitrary. This task plants the single mental model the rest of the course extends: a GPU is an extended resource a node advertises and a pod requests.

Demo

Inspect a GPU node and you'll see nvidia.com/gpu listed under Capacity and Allocatable, right next to cpu and memory. That line is the device plugin doing its job — the node is telling the scheduler how many GPUs it has to hand out.

Try it yourself

Run kubectl describe node and confirm nvidia.com/gpu appears under both Capacity and Allocatable.
Compare Capacity vs Allocatable for nvidia.com/gpu — if they differ, a system pod is already holding a GPU.
On a node with no device plugin installed, confirm nvidia.com/gpu is absent entirely (not zero — missing).
List all nodes with the custom-columns command and verify the per-node GPU counts match your hardware.

Prompt your AI

Use these three in order. Each builds on the one before.

1. Basics & terminology

In one paragraph, explain how Kubernetes 'sees' a GPU when it has no built-in concept of one. What is an extended resource like nvidia.com/gpu?

2. Why it works (the mechanism)

Walk me through, step by step, how a GPU goes from physical hardware to appearing as nvidia.com/gpu in a node's Allocatable list. What component reports it and to whom?

3. Advanced — application & what's next

Given a node where kubectl describe node shows nvidia.com/gpu under Capacity but Allocatable shows 0, how would the scheduler behave for a pod requesting one GPU, and what are the likely causes?