Native integration with Pytest, that fits right in your workflow.
40+ research-backed metrics, including custom G-Eval and deterministic metrics.
Covering any use cases, any system architecture, including multi-modality.