Reference¶

Curated tables and exhaustive lists that don’t fit a narrative but are useful: every config preset, every proven parallelism combination at each GPU count, every env var the framework reads.

Available configs
Parallelism recipes
Benchmarks
Environment variables

Available configs — the full configs/train/*.toml and configs/model/*.toml tables, with “what this config exists to prove” per row.
Parallelism recipes — (model, GPU count, parallelism) combinations that we’ve actually run end-to-end, indexed by model rather than by filename.
Benchmarks — summaries and reproduction commands for benchmarks/mfu_scaling/ (dense 7B/13B/70B MFU scaling) and benchmarks/moe_expert_parallel/ (MoE Expert Parallelism with per-sub-module FSDP wrapping).
Environment variables — every env var the framework reads, grouped by source (torchrun / SLURM / NCCL / logging) with who-sets-what.

Reference¶

See also¶