Please use this identifier to cite or link to this item: http://localhost:8080/xmlui/handle/123456789/32424
Title: Audio search engine based on joint embedding
Authors: Kadi, Abdelhakim
Kameche, A. (Promoteur)
Keywords: Language-based audio retrieval
natural language queries
log mel spectrogram
sBert
Issue Date: 2024
Publisher: Université Blida 1
Abstract: Audio retrieval based on language allows users to search for audio content using natural language queries. This technology, which has gained popularity in recent years, has numerous applications in fields such as entertainment, education, and healthcare. To achieve our goal, we conducted several tests and validated our results using a phonetic subtitle dataset, converting the sentences into vectors using sBert. We extracted log mel spectrograms from the corresponding audio files. Our analysis was further deepened by applying a convolutional neural network (CNN) architecture to extract features from the log mel spectrograms. We then calculated the similarity with subtitles using the cosine metric. This research underscores the potential for enhanced audio retrieval systems, paving the way for more intuitive and effective methods for accessing audio information. Keywords: Language-based audio retrieval, natural language queries, log mel spectrogram, sBert
Description: ill., Bibliogr. Cote:ma-004-1019
URI: https://di.univ-blida.dz/jspui/handle/123456789/32424
Appears in Collections:Mémoires de Master

Files in This Item:
File Description SizeFormat 
Kadi Abdelhakim.pdf947,67 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.