Reformer, the efficient Transformer, in Pytorch
Add dim_head keyword for fixing dimension of each head
dim_head