Submit results

Submit your own KST run

Submitting is not publishing. Every submitted run is independently validated against the trained-rater set before it appears on the board, because the credibility of the leaderboard depends on it. We recompute from your audit envelope: we do not take the numbers on face.

Validated submissions open soon

The self-serve submission flow is being finalized so that every run can be replayed end to end from its audit envelope before it is published. While we finish it, send us your run and we will validate it by hand.

Email your run → Request a test instead

What a complete submission includes

When the form opens, and when you email a run today, a complete submission carries everything needed to reproduce the result:

  • The per-item JSON envelope the harness emits, plus the composite report.
  • The KST version and the battery config used.
  • The exact model version-pin: the snapshot you tested.
  • Whether certified raters were used, or whether the run is submitted for KST-side rater validation.

We then replay every item and validate against the trained-rater set. If our recomputed numbers diverge from yours, you will see exactly which items and why, before anything is public. Self-submitted runs are validated, not trusted on face. That is a feature: it is what lets a reader trust the board.