Quickstart¶
Every command works as either kempnerpulse or the shorter kp.
Launch the live dashboard¶
kempnerpulse
This is equivalent to kempnerpulse --backend dcgm --poll 0.1 — it streams
directly from dcgmi dmon at ~100 ms resolution and shows every GPU visible to
your process (respecting CUDA_VISIBLE_DEVICES / SLURM_JOB_GPUS). Press
Ctrl-C to quit.
One-shot snapshot¶
kp --once # render a single frame and exit
Export CSV¶
kp --export # default columns, to stdout
kp --export all > run.csv # every column
kp --export sm_active_pct,tensor_active_pct,real_util_pct # a custom set
See CSV Export Reference for the full column reference.
Pick a weight preset¶
The Real Utilization composite is a weighted blend of SM / Tensor / DRAM / GR counters. Choose a preset for your workload, or supply custom weights:
kp --ai-weights # AI / LLM training & inference (default): 0.35,0.35,0.20,0.10
kp --hpc-weights # general HPC: 0.45,0.15,0.25,0.15
kp --mem-weights # memory-bound / bandwidth-heavy: 0.35,0.10,0.40,0.15
kp --weights 0.40,0.30,0.20,0.10 # custom (normalized to sum to 1)
Choose which GPUs to show¶
kp --focus-gpu 0 # start focused on GPU 0
kp --gpus 0,1 # only GPUs 0 and 1 (also accepts ranges like 0-3)
kp --show-all # ignore SLURM/CUDA env and show every accessible GPU
Switch backends¶
kp --backend prometheus --source http://localhost:9400/metrics --poll 1.0
kp --backend replay --source capture.txt --once # no GPU needed
Interactive commands¶
While the live dashboard is running, type :-prefixed commands:
Command |
Action |
|---|---|
|
focus a single GPU’s detailed view |
|
line-chart view (history sparklines) |
|
running-process table |
|
step back one view (focus/plot/job → fleet) |
|
quit |
In the fleet view, scroll card rows with ↑/↓, PgUp/PgDn, or j/k when
there are more GPUs than fit the window.