User guide¶
pybench ships two example benchmarks you can run yourself.
Start with the synthetic example — a fast, dependency-light loss-curve benchmark that runs in seconds. It is the quickest way to watch pybench save a baseline and then report a PASS or a FAIL, and it doubles as a hands-on demonstration of why pybench’s statistics hold up where simpler tests don’t.
Then see the MNIST example — a real neural-network training run, where pybench catches a regression in a genuinely noisy metric and walks through the full baseline → regression → re-baseline lifecycle.