Best 80 Model Compression Open Source Projects

Collection of recent methods on (deep) neural network compression and ac...

A list of high-quality (newest) AutoML works and lightweight models incl...

knowledge distillation papers

Lightweight and Scalable framework that combines mainstream algorithms o...

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

SqueezeLLM: Dense-and-Sparse Quantization

Filter Pruning via Geometric Median for Deep Convolutional Neural Networ...

[CVPR2020] GhostNet: More Features from Cheap Operations

Accelerate your Neural Architecture Search (NAS) through fast, reproduci...

Awesome machine learning model compression research papers, tools, and l...

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile...

Papers for deep neural network compression and acceleration

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks

Infrastructures™ for Machine Learning Training/Inference in Production.

The Truth Is In There: Improving Reasoning in Language Models with Layer...