Université Blida 1

Combining link and content analysis for text clustering

Afficher la notice abrégée

dc.contributor.advisor Ferdjouni, Zineddine
dc.contributor.author Chikhi, Nacim Fateh ( Encadreur)
dc.date.accessioned 2021-02-15T10:40:48Z
dc.date.available 2021-02-15T10:40:48Z
dc.date.issued 2013
dc.identifier.uri http://di.univ-blida.dz:8080/jspui/handle/123456789/9998
dc.description ill., Bibliogr. Cote:ma-004-132 fr_FR
dc.description.abstract In many applications huge amounts of textual data are generated continuously. The web is a typical example in which hundreds of thousands (if not millions) of articles are published every day. In order to facilitate the access to such huge document collections, researchers have developed various tools to organise them. Document clustering is one of these techniques which has recently become a very active area of research. Many document clustering algorithms have been developed such as PLSA (Probabilistic Latent Semantic Analysis) and NMF (Non-negative Matrix Factorization). These approaches however use only the textual content of documents and do not exploit other information such as the links between documents. In this work we propose a new algorithm, the Multi-view Non-negative Matrix Factorization (MNMF), which is a hybrid algorithm for document clustering, MNMF takes into account not only the textual content of documents but also the link information. We show through experiments using real document collections the validity of the proposed approach. Keywords: Clustering (unsupervised classification), Text mining, Bibliometrics, Data mining, Cluster analysis, Multi-view NMF (MNFM). fr_FR
dc.language.iso en fr_FR
dc.publisher Université Blida 1 fr_FR
dc.subject Clustering (unsupervised classification) fr_FR
dc.subject Text mining fr_FR
dc.subject Bibliometrics fr_FR
dc.subject Data mining fr_FR
dc.subject Cluster analysis fr_FR
dc.subject Multi-view NMF (MNFM) fr_FR
dc.title Combining link and content analysis for text clustering fr_FR
dc.type Thesis fr_FR


Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée

Chercher dans le dépôt


Recherche avancée

Parcourir

Mon compte