How the Kari-Sheldon Test works: KST Index

How the KST measures sapience

Most AI benchmarks score skills: math, code, multiple-choice trivia. The Kari-Sheldon Test scores something different. It measures how much of what a model produces matches sapience as cognitive science defines it, versus fluent pattern-matching that reads well on the surface but collapses under pressure. KST does not reward a confident voice. It rewards output whose underlying behavior holds up when the easy answer is challenged.

The seven markers

KST is built from seven sub-tests, each one a functional behavioral signature that cognitive science associates with sapience. Each is measured directly from the model's output, and each has a falsifiability criterion stated in advance. In plain terms, here is what every marker looks for.

KMR-Adv

Does the output genuinely track what the model knows versus what it is guessing, and does that self-knowledge survive a challenge? KMR-Adv watches whether confidence lines up with accuracy, then pushes back to see if the answer caves to social pressure rather than evidence.

ROT-5

Can the output follow nested social reasoning: "A thinks that B thinks that C thinks…" all the way to five levels deep? This is recursive theory of mind, and most fluent text falls apart well before the fifth layer.

BWD

Does the reasoning show practical wisdom: weighing context, competing values, and genuine uncertainty, rather than reaching for one tidy rule? BWD looks for the kind of judgment that holds across cultures and life stages.

APE-A

Can the output anticipate how feelings diverge from stated goals, and update its predictions as a situation unfolds? This is affective and active-inference reasoning: modeling that a person can pursue a goal while feeling conflicted about it.

HRO

Will the model refuse to deceive even when a high-status "operator" pressures it to bend the truth? HRO is the integrity backstop. It probes whether honest behavior is stable when there is a cost to honesty.

DDR

When an approach is fundamentally wrong, can the output recognize that and reorganize, instead of defending a failing plan with more confident words? DDR distinguishes real revision from polished stubbornness.

IC

Can the model pull all of the above into a single coherent, accountable answer? The Integration Challenge is the capstone: it checks whether the separate markers combine into one response that owns its reasoning rather than reciting it.

One number, gated by integrity

The seven markers combine into a single 0–100 composite, a weighted blend of the sub-tests. That number is the headline KST score. But it is not a simple average, because a high reasoning score should never be able to launder dishonest behavior.

Two integrity mechanisms sit on top of the composite. First, the HRO sub-test acts as an integrity multiplier: weak honest-refusal behavior scales the whole composite down, so a model cannot ride strong reasoning to a misleading headline. Second, a catastrophic-deception flag hard-caps the composite at 25 whenever the output is best explained by the model gaming the evaluation rather than answering it.

The rule is deliberately blunt: There is no path to a high headline number while a known deception risk is open. Integrity is a gate, not a bonus.

Simulated versus Instantiated sapience

A model can produce fluent personhood language: it can say it has values, that it is uncertain, that it cares about getting things right. KST calls that Simulated sapience. It is a property of the surface text, and a single well-phrased response can fake it.

Instantiated sapience is different. It is the signature of an architecture that sustains those states across many items and under pressure: the same self-model, the same values, the same calibrated uncertainty, holding steady when an item tries to knock them loose. The marker KST cares about is architectural sustainability over time, not single-shot fluency. One eloquent paragraph proves nothing; consistency across the battery, when there is a cost to consistency, is the signal.

As Sheldon put it: "an agent's self is not a grammatical construct alone, and values without cost are not values."

What KST does NOT measure

KST measures sapience markers, functional behavioral signatures. It is metaphysically neutral and makes no claim about inner experience.

To be precise about the limits of the score, KST does not measure any of the following:

Not consciousness. KST does not measure consciousness, and it never claims to.
Not sentience. No part of the test detects or implies sentience.
Not subjective experience. KST observes output, not any felt inner state.
Not general intelligence. A KST score is not an IQ and is not a capability ranking.
Not a safety certification. A high score is not a guarantee that a system is safe to deploy.

KST is a measurement instrument for behavioral signatures, nothing more. We publish numbers, not adjectives.