My Favorite Data Science Computing Stack

My favorite – if rather expensive – setup for large-scale Data Science and Machine Learning projects involves:

I have coded scripts for bidding for an AWS EMR cluster at spot EC2 prices and installing the aforementioned software on the cluster. Please feel free to leverage these scripts for your own benefit. :)