My implementation of the original transformer model (Vaswani et al.). I'...
Attention Is All You Need | a PyTorch Tutorial to Transformers