Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
LICENSE-NEURALMAGIC
removed and consolidated model license attributions into a single file. (#400, #409)sparsezoo.analyze
functionality has been renamed to sparsezoo.analyze_v1
for clarity. (#460)RegistryMixin
class. (#385)sparsezoo.model.download
now works as intended, where it would previously not download all of the files necessary for Transformers-based models. (#422)TypeErrors
instead of ValueErrors
improved. (#427)sparsezoo.analyze
command-line tool were corrected. (#433)metrics.yaml
files, whereas before these files were not available. (#431)sparsezoo.analyze
results no longer results in serialization errors for channelwise quantized models. (#455)This is a patch release for 1.6.0 that contains the following changes:
Version support added:
The initial feature set for SparseZoo V3 web UI is now live, where the home page has been restructured to include and highlight generative AI models.
SparseZoo V2 model file structure and V2 stubs enabled, which expands the number of supported files and reduces the number of bytes that need to be downloaded for model checkpoints, folders, and files. It also simplifies the stubs used to access models in the SparseZoo. (Documentation: V2 file structure and stubs docs will be added in v1.7) (#286, #271, #355, #359, #354 , #361, #363, #368, #370, #373)
SparseZoo Analyze CLI and APIs added to enable simple functions for quickly checking general and sparsification info for params, operations, reads/writes, and overall model layouts. (#288, #344, #345)
RegistryMixin
class and patterns added, enabling a centralized and universal registry across Neural Magic's repos and products. (#365)
To address DeepSparse deployment pipelines failing due to the missing files, the following models have been updated to include new tokenizer files for the deployment directory across dense, sparse, and sparse quantized versions, with the targeted datasets:
This is a patch release for 1.5.0 that contains the following changes:
This is a patch release for 1.5.0 that contains the following changes:
sparsezoo.analyze
CLI to enable easy analysis of ONNX models including performance and sparsity metrics (#263) (#281)sparsezoo.deployment_package
CLI to enable easy packaging of models from the SparseZoo for deployments (#261)export NM_DISABLE_ANALYTICS=True
(#287)ModelAnalysis.from_onnx(...)
updated to accept ModelProto
objects rather than just ONNX files. (#253)This is a patch release for 1.3.0 that contains the following changes:
Tokenizer_config.json
added as required file for transformers models.