Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

Scalable, interpretable automated evaluation of LLM alignment with human preferences using MT-Bench and Chatbot Arena.
Alignment
Author

Imad Dabbura

Published

May 30, 2024

Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

#nlp #fine-tuning #eval

Back to top