Veuillez utiliser cette adresse pour citer ce document : https://di.univ-blida.dz/jspui/handle/123456789/6198
Affichage complet
Élément Dublin CoreValeurLangue
dc.contributor.authorZekrifa, Yousra-
dc.contributor.authorDiffallah, Zhor-
dc.date.accessioned2020-10-06T10:48:35Z-
dc.date.available2020-10-06T10:48:35Z-
dc.date.issued2020-
dc.identifier.urihttp://di.univ-blida.dz:8080/jspui/handle/123456789/6198-
dc.descriptionill., Bibliogr.fr_FR
dc.description.abstractAcoustic scene classification (ASC) refers to the identification of the environment in which audio excerpts have been recorded, which associates a semantic label to each audio recording. This task has drawn lots of attention during the past several years as a result of machines and electronics such as smartphones, autonomous robots, or security systems acquiring the ability to perceive sounds. This work aims to classify 10 common indoor and outdoor locations using environmental sounds. To accomplish the ensuing task, we have performed multiple experiments using a dataset which consists of 14400 sound files. The goal is to explore three different aspects of an ASC system: deep learning architecture, feature extraction technique and data augmentation method. In particular, two deep neural networks have been employed in the construction of our systems namely: Residual Neural Network (ResNet) and Alex Neural Network (AlexNet), along with a combination of feature representations based on signal processing techniques. Specifically, 3 feature sets have been extracted: Log-Mel energies, ∆Log-Mel energies and ∆∆Log-Mel energies. Moreover, this work deeply explores the use of Mixup data augmentation method and the effects of varying its hyperparameters in reducing the chance of overfitting. A series of thorough experimental comparisons and statistical tests have been performed with regards to evaluating our systems. The obtained results indicate that a proper choice of the feature set is crucial in view of the deep learning architecture. Additionally, statistical testing has shown the significant impact of mixup data augmentation technique on the predictive performance of our models, as systems trained on augmented data have a considerably better generalization ability compared to the counterpart systems trained on original data. Moreover, we have found that a well-tuned mixup hyperparameter α significantly improves the classification system performance. Keywords: Acoustic Scene Classification, Feature Extraction, Data Augmentation, Deep Learning, Mixup, Statistical Tests.fr_FR
dc.language.isoenfr_FR
dc.publisherUniversité Blida 1fr_FR
dc.subjectAcoustic Scene Classificationfr_FR
dc.subjectFeature Extractionfr_FR
dc.subjectData Augmentationfr_FR
dc.subjectDeep Learningfr_FR
dc.subjectMixupfr_FR
dc.subjectStatistical Testsfr_FR
dc.titleDeep neural networks-based systems for acoustic scene recoginitionfr_FR
dc.title.alternativeComparative study between data augmentation paradigmsfr_FR
dc.typeThesisfr_FR
Collection(s) :Mémoires de Master

Fichier(s) constituant ce document :
Fichier Description TailleFormat 
Zekrifa Yousra et Diffallah ( DEEP NEURAL NETWORKS-BASED SYSTEMS FOR.pdf2,92 MBAdobe PDFVoir/Ouvrir


Tous les documents dans DSpace sont protégés par copyright, avec tous droits réservés.