Collection of recent methods on (deep) neural network compression and ac...
A list of high-quality (newest) AutoML works and lightweight models incl...
knowledge distillation papers
Lightweight and Scalable framework that combines mainstream algorithms o...
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
SqueezeLLM: Dense-and-Sparse Quantization
Filter Pruning via Geometric Median for Deep Convolutional Neural Networ...
[CVPR2020] GhostNet: More Features from Cheap Operations
Accelerate your Neural Architecture Search (NAS) through fast, reproduci...
Awesome machine learning model compression research papers, tools, and l...
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile...
Papers for deep neural network compression and acceleration
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Infrastructures™ for Machine Learning Training/Inference in Production.
The Truth Is In There: Improving Reasoning in Language Models with Layer...