Logo
About Blog Projects Talks Podcasts Tags Other work
About Blog Projects Talks Podcasts Tags Other work

Blog

Post image

Text cleaning in multiple languages

One of the most basic (and most important) tasks when doing text mining is cleaning up your text. While this might seem a bit dull compared to sexy stuff like sentiment analysis and topic modelling, I hope to show you in this post that not only is …

Posted on June 17, 2017 • 15 minutes read Read on
Post image

Applying sentiment analysis with VADER and the Twitter API

A few months ago, I posted a blog post about a small project I did where I analysed how people felt about the New Year's resolutions they post on Twitter. In this post, we'll go through the under-the-hood details of how I carried out this analysis, …

Posted on April 15, 2017 • 12 minutes read Read on
Post image

Using VADER to handle sentiment analysis with social media text

A few months ago at work, I was fortunate enough to see some excellent presentations by a group of data scientists at Experian regarding the analytics work they do. One of the presenters gave a demonstration of some work they were doing with …

Posted on April 8, 2017 • 8 minutes read Read on
Post image

Doing hierarchical clustering with a precalculated dissimilarity index

Hierarchical clustering functionality in R is great, right? Between dist and vegdist it is possible to base your clustering on almost any method you want, from cosine to Canberra. However, what if you do want to use a different or custom method, and …

Posted on March 10, 2017 • 4 minutes read Read on
Post image

How do we feel about New Year's resolutions (according to sentiment analysis)?

Happy New Year everyone! And given that it's a new year, it's the season for New Year's resolutions (I myself have been sadly off cake for the last week). Why do we love making New Year's resolutions so much? Well, change is hard - so hard, in fact, …

Posted on January 10, 2017 • 6 minutes read Read on
Previous Page 9 of 17 Next
Copyright © 2015 - 2026 Jodie Burchell   |   BY-NC 4.0