One of the most basic (and most important) tasks when doing text mining is cleaning up your text. While this might seem a bit dull compared to sexy stuff like sentiment analysis and topic modelling, I hope to show you in this post that not only is …
A few months ago, I posted a blog post about a small project I did where I analysed how people felt about the New Year's resolutions they post on Twitter. In this post, we'll go through the under-the-hood details of how I carried out this analysis, …
A few months ago at work, I was fortunate enough to see some excellent presentations by a group of data scientists at Experian regarding the analytics work they do. One of the presenters gave a demonstration of some work they were doing with …
Hierarchical clustering functionality in R is great, right? Between dist and vegdist it is possible to base your clustering on almost any method you want, from cosine to Canberra. However, what if you do want to use a different or custom method, and …
Happy New Year everyone! And given that it's a new year, it's the season for New Year's resolutions (I myself have been sadly off cake for the last week). Why do we love making New Year's resolutions so much? Well, change is hard - so hard, in fact, …