Evaluating LLM systems: Metrics, challenges, and best practices
A detailed consideration of approaches to evaluation and selection