Filter Pruning via Geometric Median for Deep Convolutional Neural Networ...
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via ...
Awesome machine learning model compression research papers, tools, and l...
YOLO ModelCompression MultidatasetTraining
Pruning and other network surgery for trained Keras models.
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Infrastructures™ for Machine Learning Training/Inference in Production.
Neural network model repository for highly sparse and sparse-quantized m...
A PyTorch-based model pruning toolkit for pre-trained language models
yolov3 network slimming剪枝的一种实现
Reference ImageNet implementation of SelecSLS CNN architecture proposed ...
A model compression and acceleration toolbox based on pytorch.
This repository contains a Pytorch implementation of the paper "The Lott...
Tutorial notebooks for hls4ml
Learn about the Neumorphic engineering process of creating large-scale i...