Install
Write your first eval
Add an LLM judge
Load cases from a file
Run in parallel
Block CI on regression
Next steps
Deterministic evaluators
All 11 built-in deterministic checks
LLM judge evaluators
Faithfulness, hallucination, relevance, and more
Agent evaluation
Tool call accuracy and plan quality
CI/CD integration
Run evals as a quality gate

