Docs
Confident AI
Guides
Tutorials
Github
Blog

$ the open-source LLM evaluation framework

Get StartedTry the DeepEval Platform
Delivered by
Confident AI
Unit-Testing for LLMs

LLM evaluation metrics to regression test LLM outputs in Python

Prompt and Model Discovery

Gain insights to quickly iterate towards optimal prompts and model

LLM Red Teaming

Security and safety test LLM applications for vulnerabilities

Documentation
  • Introduction
  • Confident AI
  • Tutorials
  • Guides
Articles You Must Read
  • LLM evaluation metrics
  • LLM-as-a-judge
  • LLM testing
  • LLM chatbot evaluation
Evaluation Community
  • GitHub
  • Discord
  • Newsletter
Copyright © 2025 Confident AI Inc. Built with ❤️ and confidence.