Training a better GPT model: learnings from PaLM

Revisiting the model released in 2018