Implementation of ConvMixer for "Patches Are All You Need? 🤷"
These weights have slightly different parameter names and aren't compatible with this codebase.
We provide weights for:
nn.GELU()
to nn.ReLU()
in convmixer.py
to use these weights; we will fix this later.