There is a specific kind of AI failure that is worse than a chatbot being wrong, and it is a chatbot being wrong with total confidence inside a system that was never built to double check it. The legal profession is now the clearest example of that failure on the planet. Across the United States, attorneys keep submitting briefs that cite court decisions, judicial quotations, and legal authorities that were never written by any judge anywhere, because a generative AI tool invented them and the lawyer trusted the output. The cases look real. The citations are formatted perfectly. The quotations sound exactly like something a court would say. And none of it exists.

This is the hallucination problem in its purest and most dangerous form. A general-purpose chatbot that makes up a fact in casual conversation is a nuisance. The same behavior, dropped into a federal court filing, becomes a sworn document built on fiction. Judges have started catching these fabrications during routine review, and the result has been a steadily escalating run of sanctions, public hearings, and standing orders aimed at a problem that did not meaningfully exist three years ago.

The Scale Of The Problem

This is not a handful of embarrassing one-off incidents anymore. Trackers that monitor court filings have documented roughly 900 instances of AI hallucinations surfacing in US legal filings since 2023, and observers describe the pace as accelerating rather than slowing. The first wave of these cases tended to involve solo practitioners or small firms cutting corners. The newer wave is broader, because AI features have been folded so deeply into mainstream legal research software that many attorneys may not even realize a draft was machine-generated before they signed it.

~900 Documented AI hallucinations found in US court filings since 2023
25+ Federal district courts that have adopted standing orders on AI use in filings
Six figures The size the largest AI-hallucination sanctions have now reached

The financial stakes have climbed in step with the volume. Early sanctions in this category were measured in the low thousands of dollars, intended as a slap on the wrist and a warning to the bar. That ceiling has shattered. In 2026 the reported record for an AI-hallucination penalty reached into six figures, the kind of number that turns a careless shortcut into a firm-threatening event. The escalation has been roughly linear: warnings, then fines in the one to five thousand dollar range, then ten thousand and up, then tens of thousands, and now penalties large enough that bar associations are openly discussing whether repeat offenders should face suspension or disbarment.

A citation is supposed to be a promise. It tells a judge that somewhere in the record of decided law, this exact case said this exact thing. An AI that invents citations does not just make an error. It breaks the one assumption the entire system runs on.

Why The Tools Do This

The reason these failures keep happening is baked into how the underlying models work, and it is worth being precise about it. A large language model does not look up a case in a database and report what it found. It predicts the most statistically likely next words given everything before them. When you ask it for a case that supports a legal argument, it produces text that resembles a real citation, because real citations are exactly the kind of pattern it has absorbed millions of examples of. It is not retrieving a fact. It is generating something that looks like one. When a matching real case happens to exist, the output is correct. When one does not, the model fabricates a plausible substitute rather than admitting the gap, because admitting gaps is not what prediction does by default.

That is the trap. The tool is not malfunctioning when it hallucinates a case. It is doing exactly what it was built to do, which is produce fluent, confident text. The fluency is the danger, because a fabricated citation that reads like a real one will sail past a tired lawyer at 11 p.m. far more easily than an obvious error would. The better these systems get at sounding authoritative, the harder their mistakes are to catch by eye.

The cruel irony is that the more polished and professional the AI output looks, the more dangerous it becomes. A clumsy fake gets caught. A flawless-looking fake gets filed.

How The Courts Are Fighting Back

The judiciary has not waited for the tool vendors to fix this. As of early 2026, at least 25 federal district courts had adopted standing orders that require attorneys to certify whether AI was used in preparing a filing and to confirm that a human being actually reviewed every citation and quotation in the document. Several state supreme courts have held public hearings on the issue. The message from the bench has been consistent and blunt: the convenience of the tool does not transfer the responsibility. If your name is on the filing, you vouched for every word in it, regardless of which software helped you write it.

Some legal experts argue the disclosure rules are already being outrun by the technology, because AI assistance is now so embedded in everyday legal software that drawing a clean line around when it was used is becoming impossible. That is a familiar shape for anyone who has followed AI's spread into other fields. The capability arrives, gets integrated everywhere before anyone agrees on the guardrails, and then the institutions scramble to write rules for a thing that has already changed how the work gets done.

The Lesson For Everyone Else

The legal world is the most visible victim here only because courts keep meticulous public records and a fake citation is easy to check. But the failure underneath it is not unique to law. Any profession that has started leaning on AI to produce confident, formatted, authoritative-sounding output is exposed to the same risk: a fabrication that is wrong in exactly the way that is hardest to notice. Medicine, finance, journalism, and academic research all run on cited facts the same way the courts do. The lawyers are simply the ones getting caught first and fined hardest.

The Verdict

The tool was sold as a research assistant. In court after court it has acted as a fabrication engine that produces fake cases with perfect formatting and total confidence. The sanctions are now severe enough to end careers, and the only reliable defense is the oldest one in the book: check every citation yourself, because the AI will never tell you when it made one up.

If there is a single takeaway from the past two years of these cases, it is that AI output is a draft, never a source. The professionals who have avoided disaster are the ones who treated every machine-generated citation as a claim to be verified rather than a fact to be trusted. The ones now writing six-figure checks treated it the other way around.