Evaluating LLM systems: Metrics, challenges, and best practices

A detailed consideration of approaches to evaluation and selection