Open-source · MIT-licensed · Peer-review-bound

Measure sapience.
Publish the number.

The KST Index is the open leaderboard for the Kari-Sheldon Test: a benchmark for the functional behavioral signatures of personhood in AI systems. Seven cognitive-science-grounded sub-tests. One 0–100 composite, gated by an integrity multiplier. Every score reproducible, auditable, and falsifiable. It is not a consciousness test, and it never claims to be.

The KST Index

KST measures sapience markers, functional behavioral signatures, not consciousness, sentience, or subjective experience. What a score does and does not mean →

Loading the board…

Markers, not vibes

Seven sub-tests measure metacognition under pressure, recursive theory of mind, practical wisdom, affective reasoning, honest refusal, self-revision, and integration, each with a falsifiability criterion stated in advance.

Integrity is a hard cap

A model that confabulates or deceives cannot ride a high reasoning score to a misleading headline. The HRO integrity multiplier caps the composite; a catastrophic-deception flag hard-caps it at 25.

Every number is replayable

Bootstrap confidence intervals, a published reproducibility statistic, per-population fairness checks, and a strict JSON audit envelope per item. If we cannot reproduce it, we do not report it.