🐢 Open-Source Evaluation & Testing framework for LLMs and ML models
Test your prompts, models, and RAGs. Catch regressions and improve promp...
The LLM Evaluation Framework
The all-in-one LLM developer platform: prompt management, evaluation, hu...
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, le...
Evaluating LLMs with CommonGen-Lite
Framework for LLM evaluation, guardrails and security
A simple GPT-based evaluation tool for multi-aspect, interpretable asses...
A collection of hand on notebook for LLMs practitioner
The implementation for EMNLP 2023 paper ”Beyond Factuality: A Comprehens...