LLM evaluation metrics to unit test LLM outputs in Python
Gain insights to quickly iterate towards optimal hyperparameters
Evaluate existing LLM applications built with other frameworks