In this demo, you will see an RNN-T network with biological inspiration performing Speech to Text transcription.
On the left, the model is composed of state-of-the-art LSTM units and on the right, the model is composed of SNUs. The SNU-based RNN-T achieves competitive word error rates and improves the latency of up to 40%.