27 posts with this tag
During my years of working as a data scientist, I've tried quite a number of IDEs. When I was primarily working with R, RStudio was a very nice environment to work with, but when I moved to working in Python I hadn't been able to find anything close. …
We're all keen to say goodbye to this awful year, so I imagine most of you want to make the most of your New Year's celebration. While many of us won't be going anywhere, we can at least enjoy a good bottle of sparkling wine at home. In this blog …
For the past year and a half, I have been switching gradually from using matplotlib to create graphs in Python to Hassan Kibirige's wonderful port of R's ggplot2, plotnine. When I was first starting to use this package, I found it was quite tricky to …
In the last post I talked about how to find the coefficients that give us the line of best fit for a OLS regression problem using the normal solution. The core of this approach is the equation: $$ X^TXb = X^Ty $$The way we solved this in the previous …
When I first learned least-squares linear regression in my undergrad degree, I remember that we approached it in the "calculus" way: taking the sum of the squared differences for each observation and solving a massive (and tedious) equation until we …