I think I can speak for us all when I say I don't understand what all the fuss is about:
We present aTENNuate, a simple deep state-space autoencoder configured for efficient online raw speech enhancement in an end-to-end fashion. The network's performance is primarily evaluated on raw speech denoising, with additional assessments on tasks such as super-resolution and...
arxiv.org
Raw Speech Enhancement with Deep State Space Modeling
Yan Ru Pei,
Ritik Shrivastava,
FNU Sidharth
We introduced a lightweight deep state-space autoencoder, aTENNuate, that can perform raw audio denoising, superresolution, and de-quantization. Compared to previous works, the key features of this network are: 1) consisting of statespace layers that can be efficiently trained and configured for inference, 2) allowing for real-time inference with low latency, 3) architecturally simple and light in parameters and MACs, 4) capable of processing raw audio waveforms directly without requiring pre/post-processing, and 5) highly competitive with other speech enhancement solutions.