Résumé:
In recent years, cyber criminals have successfully invaded many important information
systems by using phishing mail, causing huge losses. The detection of phishing mail from big email
data has been paid public attention. However, the camouflage technology of phishing mail is becoming
more
and
more
complex,
and
the
existing
detection
methods
are
unable
to
confront
with
the
increasingly
complex
deception
methods
and
the
growing
number
of
emails.
In
this
paper
we
transformed
the
probleme from classification of emails into similarity detection between two emails in
order to classify them into 4 mains classes “ Normal, Harrassment , suspicious and fraudulent” to
solve this probleme we used Siamese Neural networks which gave us an accuracy of 95.13%.
Key words : NLP, Siamese network, deep learning, phishing attacks, phishing detection, similarity learning.