Université Blida 1

Language-queried audio source separation

Afficher la notice abrégée

dc.contributor.author Benlaoubi, Chaima Nour el Houda
dc.contributor.author Khettal, Mounia
dc.contributor.author Ykhlef, Hadjer. (Promotrice)
dc.date.accessioned 2025-12-10T13:25:42Z
dc.date.available 2025-12-10T13:25:42Z
dc.date.issued 2025
dc.identifier.uri https://di.univ-blida.dz/jspui/handle/123456789/41130
dc.description ill.,Bibliogr.cote:MA-004-1055 fr_FR
dc.description.abstract Language-queried audio source separation (LASS) enables on-demand sound extraction of sound sources using natural language queries overcoming limitations in traditional audio source separation systems. In this work, we propose a language-queried audio source separation architecture integrating two major innovations: a cross attention driven ResUNet++ with multi scale receptive fields (via Atrous Spatial Pyramid Pooling), channel wise attention(Squeeze and Excitation block) and residual connections to integrate FLAN-T5 text embedding with audio features; Cosine similarity filtering to suppress overly similar mixture target pairs that might hinder the training. We trained our model on Clotho dataset derived mixtures and evaluated on its test set using state of the art metrics. Our system achieves good separation quality with an SDR of 2.41 and SDRI of 8.37. This work presents a lightweight, efficient framework for language-queried audio source separation compared to current state of the art models. Keywords: Language-queried audio source separation, Cross-Modal Attention, ResUNet++, Cosine similarity filtering, Phase-aware reconstruction, computational efficiency. fr_FR
dc.language.iso en fr_FR
dc.publisher Université Blida 1 fr_FR
dc.subject Language-queried audio source separation fr_FR
dc.subject Cross-Modal Attention fr_FR
dc.subject ResUNet++ fr_FR
dc.subject computational efficiency. fr_FR
dc.subject Cosine similarity filtering fr_FR
dc.subject Phase-aware reconstruction. fr_FR
dc.title Language-queried audio source separation fr_FR
dc.type Thesis fr_FR


Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée

Chercher dans le dépôt


Recherche avancée

Parcourir

Mon compte