In this project, I had two datasets:
- Financial Dataset (Employees income)
- The Famous Enron Email Dataset
I have explored different machine learning algorithms to see how can we identify those who were involved in the Enron fraud in the late 1990's from their salary and stocks share.
The other part of the project was a natural language processing, trying to identify the persons of interest from their emails.