Skip to content

DaniGate/for-fun

Repository files navigation

Content

A collection of different data science projects and tools and also some Python functions and classes that I programmed for fun:

  • POI_identifier.ipynb: A classification algorithm that predict which Enron's executive are persons of interest (POI) and should be further investigated for possible fraud activities. I use some public information about their contract conditions like salary or bonus, the list of prosecuted Enron executives as positive examples and also a few features extracted from the public Enron email dataset.
  • Magic Star: an algorithm to solve a 6-points magic star and print it on screen. Solutions to the 7- and 8-pointed magic stars are still under development.
  • Top words: implementation of a simple TF-IDF analysis to extract the most common words from a text.
  • Top words in press: Finding top words in presidential candidate Pablo Iglesias' articles on elpais.com
  • Minuto decisivo: Finding most used words by each politician during their final speech at the end of the 12-7-2015 presidential debate. First, the speeches were transcribed by Google Voice-to-Text software. Then, the most common words for each of them were found using top_words function (also in this repository). Finally, an infographic was created and published via Twitter.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published