In 2026, "hallucination rate" is a vanity metric if you don’t define the test....

https://lukasnpyy234.cavandoragh.org/claude-vs-gpt-which-is-better-at-admitting-i-don-t-know

In 2026, "hallucination rate" is a vanity metric if you don’t define the test. Comparing benchmarks like Vectara’s HHEM against AA-Omniscience is apples to oranges; one measures factual grounding while the other tests logic

Submitted on 2026-05-18 08:01:16