Running Rodeo within a virtualenv
Note: This post was updated on 11/12/2016 to reflect changes in how you change your PYTHONPATH in Rodeo.
I recently discovered the lovely Rodeo by yhat, an IDE for Python that is focused on data science. It was originally released as an in-browser web application, but the development team released it as an application in October last year. I am a huge fan of RStudio, and by early looks Rodeo may fill this role for me on the Python side of things.
However, there was a small hiccup when it came to setting up Rodeo. As you might know, I am a devotee of virtual environments (or virtualenvs) in Python. However, it is not immediately obvious how to run Rodeo within one. I therefore thought I would write this quick tutorial on how to set up Rodeo using a virtualenv.
Creating the virtualenv (Python 2.7 version)
vf new rodeo
(See this blog post for more information about setting up virtualenvs.) Now that we’re in our new virtualenv called “Rodeo”, we can install the required packages. Rodeo requires Jupyter and matplotlib as a minimum to run, so let’s install them using pip.
!pip install jupyter !pip install matplotlib
!pip install numpy !pip install pandas
Now we’re ready to open Rodeo!
Setting up Rodeo
The first step to setting up Rodeo with a virtualenv is to get the PYTHONPATH of the virtualenv. This is very straightforward: all you need to do is type
which python at the command line while you are in your virtualenv. You should get something that looks like the below:
(Note that this path will be a bit different if you are not working in Mac OSX.)
Now let’s get Rodeo to use our virtualenv. If you haven’t yet installed Rodeo, it can be downloaded from here. Once that is done, we can get on with changing its PYTHONPATH. To do so, we first need to go to Rodeo > Preferences in the Rodeo menu. Under the ‘Python’ tab, you will find an option called ‘Python Command’. This is where we change the PYTHONPATH. Simply paste the path to your virtualenv in this space, click ‘OK’, and you’re good to go!
You can check your code has worked by running this example I got from Wes McKinney’s excellent Python for Data Analysis.
import numpy as np from pandas import Series, DataFrame import pandas as pd import matplotlib.pyplot as plt df = DataFrame(np.random.randn(10, 4).cumsum(0), columns=['A', 'B', 'C', 'D'], index=np.arange(0, 100, 10)) df.plot()
You should get something like the screenshot below:
What about Python 3?
You can easily switch Rodeo over to using Python 3 by creating a new virtualenv. In order to tell virtualfish that you want it to use Python 3, you simply use:
vf new -p python3 rodeo
We then install all of the required packages as above, and get our PYTHONPATH using
which python. In order to switch Rodeo’s PYTHONPATH over from our old virtualenv, simply replace the path in ‘Python Command’ as detailed above.
Now you’re ready to go! Happy analysing :)