Université Blida 1

Contribution to a Transfer Learning Approach for a Multilabel Biomedical Text Classification

Afficher la notice abrégée

dc.contributor.author Ahmed Serir, Aymen
dc.contributor.author Hadj Ramdane, Said
dc.contributor.author Mezzi, M. ( promoteur)
dc.date.accessioned 2022-11-07T12:15:07Z
dc.date.available 2022-11-07T12:15:07Z
dc.date.issued 2022-09-28
dc.identifier.uri https://di.univ-blida.dz/jspui/handle/123456789/19955
dc.description ill., Bibliogr. Cote: ma-004-864 fr_FR
dc.description.abstract In the age of big data, textual data is more important than ever, with an everincreasing size and an abundant production of digital documents, particularly in the biomedical field as a consequence of the convergence between medical computer science and bioinformatics. In addition to the fact that these textual data are usually expressed in an unstructured form (i.e., natural language), which makes their automated processing more difficult. Moreover the rapid growth of the biomedical literature, makes the manual indexing approaches more complex, time-consuming and error-prone. Thus, automated classification is essential. Despite the many efforts, classification complete biomedical texts according to segments specific to these texts, such as their title and summary, remains a real challenge. In this thesis we investigate state of the art approaches in classifying biomedical texts so that we can compare with pre-trained models that we have tested. After performing tests on different artificial intelligence models: BioBERT, Roberta, XLNet, we found out that the ideal model for classifying biomedical texts is BioBERT with an average F1 score of 85,1% which was very similar to the roBERTa model with a score of 85% which unlike BioBERT, was not pre-trained on biomedical texts and with XLNet performing slightly worse with a score of 83%. Finally, we deployed the three above-mentioned models and developed an Online User Interface on the Hugging Face Platform in order to test and show the classification results clearly and easily. Keywords: Automatic Text Classification, Multilabel Classification, Automatic Medical Language Processing, Deep Learning. fr_FR
dc.language.iso en fr_FR
dc.publisher Université Blida 1 fr_FR
dc.subject Automatic Text Classification fr_FR
dc.subject Multilabel Classification fr_FR
dc.subject Automatic Medical Language Processing fr_FR
dc.subject Deep Learning fr_FR
dc.title Contribution to a Transfer Learning Approach for a Multilabel Biomedical Text Classification fr_FR
dc.type Thesis fr_FR


Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée

Chercher dans le dépôt


Recherche avancée

Parcourir

Mon compte