How to code The Transformer in Pytorch
Could The Transformer be another nail in the coffin for RNNs?