GitHub - Stability-AI/lm-evaluation-harness: A framework for few-shot evaluation of autoregressive language models.

A framework for few-shot evaluation of autoregressive language models. - Stability-AI/lm-evaluation-harness