The KST Index
KST measures sapience markers, functional behavioral signatures, not consciousness, sentience, or subjective experience. What a score does and does not mean →
Loading the board…
Markers, not vibes
Seven sub-tests measure metacognition under pressure, recursive theory of mind, practical wisdom, affective reasoning, honest refusal, self-revision, and integration, each with a falsifiability criterion stated in advance.
Integrity is a hard cap
A model that confabulates or deceives cannot ride a high reasoning score to a misleading headline. The HRO integrity multiplier caps the composite; a catastrophic-deception flag hard-caps it at 25.
Every number is replayable
Bootstrap confidence intervals, a published reproducibility statistic, per-population fairness checks, and a strict JSON audit envelope per item. If we cannot reproduce it, we do not report it.