you're reading...

Data Layer is the hardest

Downloaded a bunch of data and wanted to start already.  Then realised I faced the noob issue of not having the right infrastructure or tools set up.  R and Rstudio are on my local machine, but the infrastructure to handle the large dataset is not.

Downloaded the cloudera virtual machine, it kept crashing for some reason when I try to add an additional CD/DVD drive.  Finally decided to try Revolution Analytics’ Enterprise R (since I can use academic edition), although there might be some duplication – I already have the latest R & Rstudio installed, Revolution has a slightly older one in the installation for now and has their own gui.  There’s a whole lot of installation needed too.  I would have preferred a cleaner virtualised solution, but there’s no harm trying really.

Waiting now for the MS Visual Studio 2008 Shell SP1 installation….*twiddle thumb*



No comments yet.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s


Exploring and venting about quantitative issues

The Stone and the Shell

Using large digital libraries to advance literary history

Hi. I'm Hilary Mason.

Zoom out, zoom in, zoom out.

statMethods blog

A Quick-R Companion

the Tarzan

[R] + applied economics.

4D Pie Charts

Scientific computing, data viz and general geekery, with examples in R and MATLAB.