How to Train Your HuggingFace Models Twice As Fast

This article summarizes 14 experiments & 5 reproducibility experiments on 2+1 optimizations using dynamic padding & uniform length batching to reduce training time. .