28 posts with this tag
If you had to invent a machine learning algorithm from scratch, what would be some of the ways you'd find patterns in your data? One idea that you might have come up with is to assume that data points that are "close" to each other are similar, and …
During my years of working as a data scientist, I've tried quite a number of IDEs. When I was primarily working with R, RStudio was a very nice environment to work with, but when I moved to working in Python I hadn't been able to find anything close. …
We're all keen to say goodbye to this awful year, so I imagine most of you want to make the most of your New Year's celebration. While many of us won't be going anywhere, we can at least enjoy a good bottle of sparkling wine at home. In this blog …
For the past year and a half, I have been switching gradually from using matplotlib to create graphs in Python to Hassan Kibirige's wonderful port of R's ggplot2, plotnine. When I was first starting to use this package, I found it was quite tricky to …
In the last post I talked about how to find the coefficients that give us the line of best fit for a OLS regression problem using the normal solution. The core of this approach is the equation: $$ X^TXb = X^Ty $$The way we solved this in the previous …