Tenant: primeassist-dev

P26 dashboard skeleton — module pages land in M2+.

Evals · Judge rubrics

Judge rubrics.

An LLM judge scores each eval-run case across rubric dimensions like grounded-ness, citation precision, tone, and refusal correctness. Judge scoring is additive — it never flips a case's pass / fail status.

New rubric
No rubrics match these filters.