Product2026-04-01
What evaluation insights should actually show
Dashboards should explain memory quality, not just display token counts.
Users do not need another vanity dashboard. They need to know whether the memory layer is improving agent outcomes.
Good evaluation views surface reuse rate, score distributions, failure clusters, and recall quality over time.
That makes the memory system operational instead of mystical.