Skip to main content

Eval span attributes

When memledger's MAI evaluators run (deterministic, structural, RAGAS), each evaluator emits an OpenTelemetry span with a structured attribute set. Any OTEL-supported observability backend can ingest them; the Phoenix integration renders them as inline annotations on the source span.

What evaluators emit

AttributeTypeSet byUse
eval.outcome"pass" | "fail"all 3 tiersPhoenix filter chips
eval.scorefloatall 3 tiersrubric score
eval.passedboolall 3 tierspass/fail boolean
eval.record_idscomma-separatedall 3 tiersUI deep-link from eval span back to per-record inspector
eval.judge_modelstringRAGAS (Tier 2) onlywhich LLM judged
openinference.span.kind"EVALUATOR"all 3 tiersPhoenix categorization

Configuring evaluator runs

The 3 evaluator tiers (deterministic, structural, RAGAS) all consume the same canonical MAI rubric. The runner is your choice; the rubric is constant. See The evaluator tiers.

Sample annotation in Phoenix

Tier 1 deterministic evaluator annotation in Phoenix

Tier 1 — deterministic evaluator annotation rendered in Phoenix.

Tier 2 RAGAS evaluator annotation in Phoenix

Tier 2 — RAGAS LLM-judge evaluator annotation rendered in Phoenix.

RAGAS evaluator span detail showing judge model, score, and rubric

RAGAS evaluator span detail — judge model, score, rubric criteria.

The full attribute inventory + Phoenix dashboard template ships in a future release.