⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Highlights Improvements Examples Bug Fixing
Highlights
Improvements
from_pretrained
When use_neural_speed
(39ecf38e )Examples
Bug Fixing
Validated Configurations
Highlights Features Productivity Examples Bug Fixing
Highlights
Features
Productivity
Examples
Bug Fixing
Validated Configurations
Thanks to these Contributors
Thanks for the contribution from dillonalaird, igeni, sramakintel, alexsin368 and huiyan2021
Welcome to contribute to our project and report issues to us.
Highlights
Improvements
Examples
Bug Fixing
Validated Configurations
Highlights Improvements Examples Bug Fixing Validated Configurations
Highlights
Improvements
Examples
Bug Fixing
Validated Configurations
Highlights Publication Features Examples Bug Fixing Incompatible change
Highlights
Publications
Features
Examples
Bug Fixing
Incompatible Changes
Validated Configurations
Bug Fixing & Improvements
Validated Configurations
Examples
Bug Fixing & Improvements
Validated Configurations
Highlights Features Productivity Examples Bug Fixing API Modification Documentation
Highlights
Features
Productivity
Examples
Bug Fixing
API Modification
Documentation
Validated Configurations
Highlights In this release, we improved NeuralChat, a customizable chatbot framework under Intel® Extension for Transformers. NeuralChat is now available for you to create your own chatbot within minutes on multiple architectures.
Bug Fixing & Improvements
Tests & Tutorials
Validated Configurations
Acknowledgements Thanks for the contributions from sywangyi, jiafuzha and itayariel. Thanks to all the participants to Intel Extension for Transformers.
Highlights
Features
Productivity
Examples
Bug Fixing
Documentation
Validated Configurations