Quickstart¶

Every command works as either kempnerpulse or the shorter kp.

Launch the live dashboard¶

kempnerpulse

This is equivalent to kempnerpulse --backend dcgm --poll 0.1 — it streams directly from dcgmi dmon at ~100 ms resolution and shows every GPU visible to your process (respecting CUDA_VISIBLE_DEVICES / SLURM_JOB_GPUS). Press Ctrl-C to quit.

One-shot snapshot¶

kp --once          # render a single frame and exit

Export CSV¶

kp --export                 # default columns, to stdout
kp --export all > run.csv   # every column
kp --export sm_active_pct,tensor_active_pct,real_util_pct   # a custom set

See CSV Export Reference for the full column reference.

Pick a weight preset¶

The Real Utilization composite is a weighted blend of SM / Tensor / DRAM / GR counters. Choose a preset for your workload, or supply custom weights:

kp --ai-weights      # AI / LLM training & inference (default): 0.35,0.35,0.20,0.10
kp --hpc-weights     # general HPC:                              0.45,0.15,0.25,0.15
kp --mem-weights     # memory-bound / bandwidth-heavy:           0.35,0.10,0.40,0.15
kp --weights 0.40,0.30,0.20,0.10   # custom (normalized to sum to 1)

Choose which GPUs to show¶

kp --focus-gpu 0     # start focused on GPU 0
kp --gpus 0,1        # only GPUs 0 and 1 (also accepts ranges like 0-3)
kp --show-all        # ignore SLURM/CUDA env and show every accessible GPU

Switch backends¶

kp --backend prometheus --source http://localhost:9400/metrics --poll 1.0
kp --backend replay --source capture.txt --once     # no GPU needed

Interactive commands¶

While the live dashboard is running, type :-prefixed commands:

Command	Action
`:focus <id>`	focus a single GPU’s detailed view
`:plot`	line-chart view (history sparklines)
`:job`	running-process table
`:q`	step back one view (focus/plot/job → fleet)
`:exit`	quit

In the fleet view, scroll card rows with ↑/↓, PgUp/PgDn, or j/k when there are more GPUs than fit the window.