Running large language models on a single GPU for throughput-oriented sc...
Run Mixtral-8x7B models in Colab or consumer desktops
dpdk infrastructure for software acceleration. Currently working on RX a...
An Epic Mega Grants backed Master Thesis about creating the Next Generat...