11 posts with this tag
Note: This post was updated on 11/12/2016 to reflect changes in how you change your PYTHONPATH in Rodeo. I recently discovered the lovely Rodeo by yhat, an IDE for Python that is focused on data science. It was originally released as an in-browser …
If you followed along with my analysis of the top Christmas movies according to the MovieLens 10M dataset, you would remember I obtained the list of Christmas movies by scraping this page. I am fairly new to web scraping and this is one of my first …
I recently needed to install the XLConnect package in R, which has the dependency rJava. Unfortunately rJava is one of those R packages which is a real pain to install and required a bit of fiddling. I'll talk you through how I installed it in OS X …
When I first seriously decided to learn programming about three years ago, I picked up Python using Zed Shaw's Learn Python the Hard Way. While an excellent introduction to programming for someone who had previously only cobbled together SPSS syntax, …
Choosing the right linear regression model for your data can be an overwhelming venture, especially when you have a large number of available predictors. Luckily R has a wide array of in-built and user-written tools to make this process easier. In …