Université Blida 1

Audio search engine based on joint embedding

Afficher la notice abrégée

dc.contributor.author Kadi, Abdelhakim
dc.contributor.author Kameche, A. (Promoteur)
dc.date.accessioned 2024-11-04T14:15:16Z
dc.date.available 2024-11-04T14:15:16Z
dc.date.issued 2024
dc.identifier.uri https://di.univ-blida.dz/jspui/handle/123456789/32424
dc.description ill., Bibliogr. Cote:ma-004-1019 fr_FR
dc.description.abstract Audio retrieval based on language allows users to search for audio content using natural language queries. This technology, which has gained popularity in recent years, has numerous applications in fields such as entertainment, education, and healthcare. To achieve our goal, we conducted several tests and validated our results using a phonetic subtitle dataset, converting the sentences into vectors using sBert. We extracted log mel spectrograms from the corresponding audio files. Our analysis was further deepened by applying a convolutional neural network (CNN) architecture to extract features from the log mel spectrograms. We then calculated the similarity with subtitles using the cosine metric. This research underscores the potential for enhanced audio retrieval systems, paving the way for more intuitive and effective methods for accessing audio information. Keywords: Language-based audio retrieval, natural language queries, log mel spectrogram, sBert fr_FR
dc.language.iso en fr_FR
dc.publisher Université Blida 1 fr_FR
dc.subject Language-based audio retrieval fr_FR
dc.subject natural language queries fr_FR
dc.subject log mel spectrogram fr_FR
dc.subject sBert fr_FR
dc.title Audio search engine based on joint embedding fr_FR
dc.type Thesis fr_FR


Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée

Chercher dans le dépôt


Recherche avancée

Parcourir

Mon compte