Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
Full Changelog: https://github.com/ELS-RD/transformer-deploy/compare/v0.2.0...v0.3.0
QDQRoberta
modelFull Changelog: https://github.com/ELS-RD/transformer-deploy/compare/v0.1.1...v0.2.0
all the scripts to reproduce https://medium.com/p/e1be0057a51c