Tenant: primeassist-dev

P26 dashboard skeleton — module pages land in M2+.

Judge rubrics · New

New judge rubric.

A rubric carries a prompt template + dimensions + scale + LLM provider. The eval runner invokes it per-case after the case lands and persists one row per dimension.

Dimensions
groundednesscitation_precisiontonerefusal_correctness

Each dimension is scored by the LLM judge. 1..16 entries; 1-64 chars each.

Cancel