Testing Language Models (and Prompts) Like We Test Software | Towards Data Science
TL;DR: You should