Rishit Dagli Conformer Versions Save

An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras

v0.2.0

2 years ago

✅ Bug Fixes / Improvements

Completed the ConformerBlock implementation, now usable and no longer WIP in https://github.com/Rishit-dagli/Conformer/pull/9
Unit Tests for Attention class in https://github.com/Rishit-dagli/Conformer/pull/10
Unit Tests for ConformerConvModule and ConformerBlock in https://github.com/Rishit-dagli/Conformer/pull/11
Update documentation and workflows for tests in https://github.com/Rishit-dagli/Conformer/pull/12

Full Changelog: https://github.com/Rishit-dagli/Conformer/compare/v0.1.0...v0.2.0

v0.1.0

2 years ago

This is the initial release of Conformer and implements the Conformer (Convolution-augmented Transformer for Speech Recognition) as a subclassed TensorFlow model. Conformer achieves the best of both worlds (transformers for content-based global interactions and CNNs to exploit local features) by studying how to combine convolution neural networks and transformers to model both local and global dependencies of an audio sequence in a parameter-efficient way.

Classes

You would majorly want to follow the examples and use the ConformerConvModule or ConformerBlock classes.

Attention: Implements attention mechanism as shown in the paper as a TensorFlow Keras layer
Swish: The Swish activation function originally proposed by Ramachandran et al. in Searching for Activation Functions
GLU: The GLU activation function originally proposed by Dauphin et al. in Language Modeling with Gated Convolutional Networks
DepthwiseLayer: A modified depthwise convolutional layer
Scale: Apply an operation and scale it by a certain value
PreNorm: Normalize the activations of the previous layer for each given example in a batch independently and apply some function to it, implemented as a TensorFlow Keras Layer
FeedForward: Create a FeedForward neural net with two Dense and 2 Dropout layers and Swish activation, implemented as a TensorFlow Keras Layer
BatchNorm: Modified Conditional batchnorm layer
ConformerConvModule: The class for a ConformerConvModule, the major contribution of this paper
ConformerBlock: A complete ConformerBlock put together