Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
https://github.com/OpenGenerativeAI/llm-colosseum/assets/19614572/6c54a7af-bc07-4de5-a66f-24bf754d182a