$ the open-source LLM evaluation framework

Get Started Try the DeepEval Platform

Delivered by

Confident AI

Regression Testing for LLMs

LLM evaluation metrics to unit test LLM outputs in Python

Prompt and Model Discovery

Gain insights to quickly iterate towards optimal prompts and model

LLM Red Teaming

Security and safety test LLM applications for vulnerabilities