Veuillez utiliser cette adresse pour citer ce document :
https://di.univ-blida.dz/jspui/handle/123456789/25250
Affichage complet
Élément Dublin Core | Valeur | Langue |
---|---|---|
dc.contributor.author | Boucif, Miyyada | - |
dc.contributor.author | Rahim, Ikram | - |
dc.contributor.author | Ouahrani, L. ( Promotrice) | - |
dc.date.accessioned | 2023-10-04T13:31:51Z | - |
dc.date.available | 2023-10-04T13:31:51Z | - |
dc.date.issued | 2023-07 | - |
dc.identifier.uri | https://di.univ-blida.dz/jspui/handle/123456789/25250 | - |
dc.description | ill., Bibliogr. Cote:ma-004-950 | fr_FR |
dc.description.abstract | In recent years, researchers have focused on developing and training visual question generation models that based on deep neural networks. these models have a wide range of applications in various domains, However, there have been no specialized works conducted on visual question generation in the Arabic language. Our work aims to automate the process of generating Arabic educational questions from visual content. We propose a visual Arabic question generation multi-modal, which integrates two distinct models. The first model is a fine-tuned Arabic image captioning model, obtained by fine-tuning the Google Vision transformer and AraBert transformer using a new collected dataset. The second model is an Arabic natural question generation fine-tuned model. Our proposed multi-model has been evaluated using the Transparent Human benchmark protocol, and the results demonstrate its ability to generate relevant captions. 51% of the captions received a rating between 2 to 4 out of 5 on the scale, indicating their relevance. Additionally, the model produced relevant questions based on these captions, achieving an average rating of 3.33 out of 5 in term of relevance. Keywords: Visual question generation, Arabic image captioning, Transformers, Vision transformer, deep learning. | fr_FR |
dc.language.iso | en | fr_FR |
dc.publisher | Université Blida 1 | fr_FR |
dc.subject | Visual question generation | fr_FR |
dc.subject | Arabic image captioning | fr_FR |
dc.subject | Transformers | fr_FR |
dc.subject | Vision transformer | fr_FR |
dc.subject | deep learning | fr_FR |
dc.title | Towards Visual Question Generation System | fr_FR |
dc.type | Thesis | fr_FR |
Collection(s) : | Mémoires de Master |
Fichier(s) constituant ce document :
Fichier | Description | Taille | Format | |
---|---|---|---|---|
Boucif Miyyada et Rahim Ikram.pdf | 13,16 MB | Adobe PDF | Voir/Ouvrir |
Tous les documents dans DSpace sont protégés par copyright, avec tous droits réservés.