Veuillez utiliser cette adresse pour citer ce document : https://di.univ-blida.dz/jspui/handle/123456789/19955
Titre: Contribution to a Transfer Learning Approach for a Multilabel Biomedical Text Classification
Auteur(s): Ahmed Serir, Aymen
Hadj Ramdane, Said
Mezzi, M. ( promoteur)
Mots-clés: Automatic Text Classification
Multilabel Classification
Automatic Medical Language Processing
Deep Learning
Date de publication: 28-sep-2022
Editeur: Université Blida 1
Résumé: In the age of big data, textual data is more important than ever, with an everincreasing size and an abundant production of digital documents, particularly in the biomedical field as a consequence of the convergence between medical computer science and bioinformatics. In addition to the fact that these textual data are usually expressed in an unstructured form (i.e., natural language), which makes their automated processing more difficult. Moreover the rapid growth of the biomedical literature, makes the manual indexing approaches more complex, time-consuming and error-prone. Thus, automated classification is essential. Despite the many efforts, classification complete biomedical texts according to segments specific to these texts, such as their title and summary, remains a real challenge. In this thesis we investigate state of the art approaches in classifying biomedical texts so that we can compare with pre-trained models that we have tested. After performing tests on different artificial intelligence models: BioBERT, Roberta, XLNet, we found out that the ideal model for classifying biomedical texts is BioBERT with an average F1 score of 85,1% which was very similar to the roBERTa model with a score of 85% which unlike BioBERT, was not pre-trained on biomedical texts and with XLNet performing slightly worse with a score of 83%. Finally, we deployed the three above-mentioned models and developed an Online User Interface on the Hugging Face Platform in order to test and show the classification results clearly and easily. Keywords: Automatic Text Classification, Multilabel Classification, Automatic Medical Language Processing, Deep Learning.
Description: ill., Bibliogr. Cote: ma-004-864
URI/URL: https://di.univ-blida.dz/jspui/handle/123456789/19955
Collection(s) :Mémoires de Master

Fichier(s) constituant ce document :
Fichier Description TailleFormat 
Ahmed Serir Aymen et Hadj Ramdane Said.pdf4,11 MBAdobe PDFVoir/Ouvrir


Tous les documents dans DSpace sont protégés par copyright, avec tous droits réservés.