Downloaded a bunch of data and wanted to start already. Then realised I faced the noob issue of not having the right infrastructure or tools set up. R and Rstudio are on my local machine, but the infrastructure to handle the large dataset is not.
Downloaded the cloudera virtual machine, it kept crashing for some reason when I try to add an additional CD/DVD drive. Finally decided to try Revolution Analytics’ Enterprise R (since I can use academic edition), although there might be some duplication – I already have the latest R & Rstudio installed, Revolution has a slightly older one in the installation for now and has their own gui. There’s a whole lot of installation needed too. I would have preferred a cleaner virtualised solution, but there’s no harm trying really.
Waiting now for the MS Visual Studio 2008 Shell SP1 installation….*twiddle thumb*