E2E LLM evals, with less focus on metrics and more focus on binary assertions | Heykuki News