Issue

Results Vary Too Much

Repeated or comparable runs produce outputs that vary more than the task, workflow, or user can tolerate.

What This Looks Like

The user runs the same or comparable task more than once and gets outputs that vary beyond what the task can tolerate. The answers may differ in decision, structure, content, classification, recommendation, cited evidence, or level of detail even though the user expected a stable result.

Why It Matters

Some variation is normal in AI output, but too much variation makes a workflow hard to trust. Users cannot tell which answer to use, whether the system is following the same rules, or whether downstream decisions are stable enough to automate, review, or report.

Structural Signal

Comparable inputs under comparable constraints produce outputs that are not equivalent enough for the task. The issue is not merely that wording differs; it is that the variation changes meaning, decision, structure, or workflow usability.

Common Triggers

The prompt leaves important criteria underspecified
Sampling or routing differences change the response path
The task has multiple valid interpretations
Hidden context, memory, or tool results differ between runs
The model is asked to make judgment calls without a stable rubric
The workflow lacks a variance tolerance or reconciliation step

When to Use This Issue

Use this Issue when repeated or comparable runs vary enough to undermine trust, review, automation, or decision-making.

When Not to Use This Issue

Do not use this Issue for harmless wording variation. Do not use it when a visible input, prompt, model, or source change explains the difference. This Issue applies when variation exceeds what the task can tolerate.

Primary Pattern

PAT-0290 — Divergent Outputs

Declared Patterns

PAT-0290

Divergent Outputs

A structural condition where parallel evaluations under comparable scope and shared authority produce non-equivalent outputs.

PAT-0210

Non-Deterministic Execution

A structural condition where equivalent inputs and declared constraints produce divergent outputs across executions.

PAT-0170

Constraints Underspecified

A structural condition where declared constraints are insufficient to eliminate ambiguity or multiple admissible states.

Derived Primary Lenses

LEN-0160

Constraint Sufficiency Lens

Evaluates whether declared constraints are sufficient to eliminate structural degrees of freedom.

LEN-0170

Convergence Lens

Compares parallel structural systems to determine whether they align under shared authority.

LEN-0180

Determinism Lens

Evaluates whether identical structural inputs produce equivalent structural outputs across repeated executions.

LEN-0290

Variance / Entropy Lens

Measures structural variability across repeated or comparable evaluations and identifies divergence beyond expected bounds.

Derived Secondary Lenses

LEN-0140

Compression Lens

Reduces structural graphs into stable minimal representations for comparison, redundancy detection, and diffing.

LEN-0280

Reference Stability Lens

Evaluates whether structural references, identifiers, nodes, and edges remain consistent across execution cycles or comparable states.

Related AI-Adjacent Issues

ADJ-0002

AI Works in One Environment Not Another

The same AI task appears to work in one app, mode, model, account, workspace, or runtime but fails or behaves differently in another.

ADJ-0016

Guardrail Applies to One Model Path Only

A guardrail, policy, validation rule, safety check, or workflow constraint applies to one model path, mode, runtime, or route but not another.

Search Intents

results vary too much
AI results inconsistent
same prompt gives very different answers
output varies too much
AI answer changes too much
model results are unstable