Multimodal Speech Separation with Feedback Architecture

This thesis tackles the high computational cost, and output inconsistency of processing long audio sequences in multimodal speech separation. We introduce the Self-Feedback RE-Sepformer (SFRS), an architecture integrating an RNN-inspired incremental inference mechanism with RE-Sepformer backbone. SF...

Täydet tiedot

Bibliografiset tiedot
Päätekijä:	Qu, Yanren
Muut tekijät:	Informaatioteknologian tiedekunta, Faculty of Information Technology, Jyväskylän yliopisto, University of Jyväskylä
Aineistotyyppi:	Pro gradu
Kieli:	eng
Julkaistu:	2025
Aiheet:	Master's Degree Programme in Artificial Intelligence
Linkit:	https://jyx.jyu.fi/handle/123456789/102963

Internet

https://jyx.jyu.fi/handle/123456789/102963

Multimodal Speech Separation with Feedback Architecture

Internet

Samankaltaisia teoksia