Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards efficient end-to-end speech recognition with biologically-inspired neural networks (2110.02743v2)

Published 4 Oct 2021 in eess.AS, cs.LG, cs.NE, and q-bio.QM

Abstract: Automatic speech recognition (ASR) is a capability which enables a program to process human speech into a written form. Recent developments in AI have led to high-accuracy ASR systems based on deep neural networks, such as the recurrent neural network transducer (RNN-T). However, the core components and the performed operations of these approaches depart from the powerful biological counterpart, i.e., the human brain. On the other hand, the current developments in biologically-inspired ASR models, based on spiking neural networks (SNNs), lag behind in terms of accuracy and focus primarily on small scale applications. In this work, we revisit the incorporation of biologically-plausible models into deep learning and we substantially enhance their capabilities, by taking inspiration from the diverse neural and synaptic dynamics found in the brain. In particular, we introduce neural connectivity concepts emulating the axo-somatic and the axo-axonic synapses. Based on this, we propose novel deep learning units with enriched neuro-synaptic dynamics and integrate them into the RNN-T architecture. We demonstrate for the first time, that a biologically realistic implementation of a large-scale ASR model can yield competitive performance levels compared to the existing deep learning models. Specifically, we show that such an implementation bears several advantages, such as a reduced computational cost and a lower latency, which are critical for speech recognition applications.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Thomas Bohnstingl (4 papers)
  2. Ayush Garg (12 papers)
  3. Stanisław Woźniak (16 papers)
  4. George Saon (39 papers)
  5. Evangelos Eleftheriou (23 papers)
  6. Angeliki Pantazi (13 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.