A python package to build AI-powered real-time audio applications
Major changes in this new version! Including compatibility with pyannote 3.*, SpeechBrain, WeSpeaker and NeMo embedding models, totaling 8 new models to create speaker diarization pipelines and 1 new model for voice activity detection.
This version also adds compatibility with ONNX models and a documentation page at diart.readthedocs.io
Thank you @sorgfresser for your huge contribution in #188 !
Full Changelog: https://github.com/juanmc2005/diart/compare/v0.8...v0.9
diart.stream
by @juanmc2005 in #183PipelineConfig.from_dict()
by @juanmc2005 in #189Thank you @sneakers-the-rat for your extremely valuable feedback and help as part of the JOSS review!
Full Changelog: https://github.com/juanmc2005/diart/compare/v0.7...v0.8
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.6...v0.7
cropping_mode
to DelayedAggregation
by @bhigy in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/105
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.5.1...v0.6
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.5...v0.5.1
study_or_path
as a Path for conversion from string by @AMITKESARI2000 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/74
diart.benchmark
when output is provided by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/86
Thank you @AMITKESARI2000, @ckliao-nccu and @zaouk for all the bug fixes!
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.4...v0.5
resolve_features
with TemporalFeatureFormatter
by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/59
pyannote.audio
optional (still mandatory to run default pipeline) by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/61
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.3...v0.4
OverlapAwareSpeakerEmbedding
class by @juanmc2005 in #51RealTimeInference
and Benchmark
by @juanmc2005 in #55Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.2.1...v0.3
buffer_output
causing a crash by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/24
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.2...v0.2.1
operators.aggregate()
with functional.DelayedAggregation
by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/16
DelayedAggregation
by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/18
OutputBuilder
+ better demo performance by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/20
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.1...v0.2