The problem has a precise name

An AI hallucination is a confident, fluent, and incorrect output from a generative AI system. The term is borrowed loosely from cognitive science, but in AI it means something specific: the model generates text (or code, or citations) that sounds authoritative and plausible but is factually wrong.

This is not a bug that will be patched. It’s a structural property of how current large language models work.

Why language models hallucinate

Large language models (LLMs) are trained to predict the most probable next token given a context. They are not retrieving facts from a database — they are pattern-matching at enormous scale.

When a model is asked about something outside its training distribution, or where the training data was sparse or contradictory, it doesn’t say “I don’t know” by default. It generates the most statistically probable continuation of the prompt — which may sound like a confident answer but isn’t grounded in any specific source.

This means:

  • Citations are invented — the author, journal, year, and title all pattern-match plausible academic text, but the paper doesn’t exist
  • Statistics are approximate or fabricated — the model has seen many statistics and will generate one that fits
  • Legal and medical information is especially dangerous — high-stakes domains where the model has been trained on expert-sounding text but cannot verify accuracy

Hallucination frequency in practice

Independent benchmarks (as of early 2026) show that leading LLMs hallucinate in roughly 3–27% of factual queries depending on domain, prompt structure, and model version. In legal document review, hallucination rates for specific citations can exceed 30%.

The wide range matters: models are better at some tasks than others. Code generation has much lower hallucination rates than biography or historical fact retrieval.

Three patterns to recognize

1. The authoritative citation

“According to a 2023 study published in Nature Human Behaviour by researchers at ETH Zurich (Smith et al., 2023), AI systems demonstrate X.”

Verify before you use it. The paper may not exist. Smith et al. (2023) is one of the most common hallucinated citation patterns.

2. The statistical confidence trick

“Approximately 67% of organizations reported…”

Check where this number comes from. If the model can’t point to a source, it’s a generated statistic.

“Under GDPR Article 13(2)(f), organizations are required to…”

GDPR Article 13 exists. But the specific obligation described may be paraphrased incorrectly, applied to the wrong context, or simply invented.

What to do in practice

Use AI for generation, not verification. AI can draft, summarize, and ideate. Verification of facts, citations, and legal requirements must go through authoritative sources.

Ask for sources explicitly. Even if the model hallucinates sources, asking “what is your source for this?” shifts your attention to the verification step.

Cross-reference with RAG systems. Retrieval-Augmented Generation (RAG) architectures ground model output in real documents. If your organization uses AI for knowledge management, RAG significantly reduces hallucination risk.

Build verification into your workflow. Any AI-generated content that includes facts, statistics, or legal claims should pass through a human reviewer before publication or use in decisions.


The bottom line

Hallucinations are not going away — they are a structural feature of current-generation LLMs. The organizations that use AI well are the ones that deploy it for tasks where fluency matters more than precision, and build verification into every workflow where precision is critical.

Treating AI as a “knowledgeable colleague” is the most dangerous mental model. Treat it as a very fast, very fluent first draft that needs to be checked.