Hallucination Severity Scorer
Not just yes/no — how bad is this hallucination?
📚 Learn more — how it works, FAQ & guide Click to expand
Learn more — how it works, FAQ & guide
Click to expand
Hallucination severity scorer
Score LLM hallucinations by severity — not just yes/no.
How to use this tool
- 1
Paste AI output
The response you want to evaluate.
- 2
Paste reference facts
Ground truth — what should be true.
- 3
Get severity breakdown
Contradictions, fabrications, attribution errors, severity score.
Frequently Asked Questions
What is hallucination severity?
Not all hallucinations are equal. Fabricating a citation is severe. Getting a minor date wrong is minor. Misstating a person's role matters more than misstating their hobby. Severity score helps you prioritize fixes.
How is the score computed?
We check for: (1) direct contradictions with reference, (2) fabricated entities (names, numbers) not in reference, (3) attribution errors (wrong source), (4) hedging vs false confidence, (5) structural integrity (does output follow asked format?). Each weighted.
Can this replace manual review?
No — it's a triage tool. It surfaces likely problems. Critical-domain AI outputs (medical, legal, financial) always need human review. Use this to prioritize which outputs need that review.
You might also like
🔒
100% Privacy. This tool runs entirely in your browser. Your data is never uploaded to any server.