This tutorial was held on Sunday, 15 September, 1400–1730, Hall 11.
While multi-channel speech enhancement was traditionally approached by linear or non-linear time-variant filtering techniques, in the last years neural network-based solutions have achieved remarkable performance by data-driven learning techniques. Even more recently, hybrid techniques, which blend traditional signal processing with deep learning, have been shown to combine the best of both worlds: achieving excellent enhancement performance, while at the same time being resource efficient and amenable to human interpretability due to the underlying physical model.
Tutorial T8: Microphone array signal processing and deep learning for speech enhancement - strong together [all slides and list of references]
Individual parts of the tutorial