When I transitioned over to working primarily in Python from R, one of the things that I missed was ggplot2. For me, the plots in ggplot2 look so much nicer and the syntax is more intuitive compared to matplotlib. Happily, last year I discovered that …
I am finally back to blogging after a 7 month hiatus, and I promise I have a good excuse this time! In September I accepted an offer to work as a data scientist in text mining with StepStone, the largest job board in Europe. While this involved some …
One of the most basic (and most important) tasks when doing text mining is cleaning up your text. While this might seem a bit dull compared to sexy stuff like sentiment analysis and topic modelling, I hope to show you in this post that not only is …
A few months ago, I posted a blog post about a small project I did where I analysed how people felt about the New Year's resolutions they post on Twitter. In this post, we'll go through the under-the-hood details of how I carried out this analysis, …
A few months ago at work, I was fortunate enough to see some excellent presentations by a group of data scientists at Experian regarding the analytics work they do. One of the presenters gave a demonstration of some work they were doing with …