Posts

  • Installation and hello world of rHadoop

    Recently I needed to install and configure a local version of rHadoop. I ended up using the Revolution Analytics rHadoop packages, running inside RStudio server on a CentOS based Cloudera quickstart VM. Here are instructions based on a couple of sites: ashokharnal’s tutorial and Imre Kocsis’s tutorial. The exact instructions...
  • Code for final Kaggle model.

    This is (more or less) the code that got me my best answer. After reading through a couple of other peoples code, I should have looked more carefully at the time variables, and figured out times etc. Given I got 0.92341 compared to the winners 0.94254, it’s not super important,...
  • Reflections on the Human or Robot? kaggle challenge.

    Here’s a link to the final standings. This is the first kaggle challenge I put a good amount of effort into and I’m pretty happy with the outcome. I ended up finishing 187/985 with a score of 0.92341, very close to the winning score of 0.94254. Happy to be in...
  • Kaggle data challenge post 1.

    This is the first post of an occasional series on R, data science, Kaggle and my quest to make it through the book “The elements of statistical learning”. For a first post, here is a write up of my current kaggle problem. This knitr is an archived version - it...

subscribe via RSS