17 posts with this tag
In the last blog post, we discussed how to calculate the Manhattan and Euclidean distances from first principles. However, in that post, we did a very manual implementation for a single pair of vectors, which would not generalise well to more than …
If you had to invent a machine learning algorithm from scratch, what would be some of the ways you'd find patterns in your data? One idea that you might have come up with is to assume that data points that are "close" to each other are similar, and …
During my years of working as a data scientist, I've tried quite a number of IDEs. When I was primarily working with R, RStudio was a very nice environment to work with, but when I moved to working in Python I hadn't been able to find anything close. …
With a full coronavirus shutdown in Germany, Christmas this year in Berlin will be very quiet. I have been keeping myself entertained for the last couple of weeks by watching old seasons of The Great British Bakeoff, so between that and the mountain …
In this post, we continue our discussion about how to use AWS Sagemaker's BlazingText to train a word2vec model. In the last post we learned how to set up, train and evaluate a single model. However, we essentially selected our hyperparameters at …