list of efficient attention modules
Absolutely amazing SOTA Google Colab (Jupyter) Notebooks for creating/tr...
Code for scaling Transformers