⚠️

Hallucination Severity Scorer

Not just yes/no — how bad is this hallucination?

📚
Learn more — how it works, FAQ & guide
Click to expand

Hallucination severity scorer

Score LLM hallucinations by severity — not just yes/no.

How to use this tool

  1. 1

    Paste AI output

    The response you want to evaluate.

  2. 2

    Paste reference facts

    Ground truth — what should be true.

  3. 3

    Get severity breakdown

    Contradictions, fabrications, attribution errors, severity score.

Frequently Asked Questions

What is hallucination severity?
Not all hallucinations are equal. Fabricating a citation is severe. Getting a minor date wrong is minor. Misstating a person's role matters more than misstating their hobby. Severity score helps you prioritize fixes.
How is the score computed?
We check for: (1) direct contradictions with reference, (2) fabricated entities (names, numbers) not in reference, (3) attribution errors (wrong source), (4) hedging vs false confidence, (5) structural integrity (does output follow asked format?). Each weighted.
Can this replace manual review?
No — it's a triage tool. It surfaces likely problems. Critical-domain AI outputs (medical, legal, financial) always need human review. Use this to prioritize which outputs need that review.

You might also like

🔒
100% Privacy. This tool runs entirely in your browser. Your data is never uploaded to any server.