Tag Archives: cloudera

Cloudera CDH in a box (setup steps) – Part 2

In the previous blog, we saw a screencast on how to start a small Cloudera CDH cluster and then run a simple MapReduce program on a small data set. In this blog post, we will look into the sequence of steps at a high level to setup the configuration. The steps are far from complete, but is good enough with lot of references to get started with.Cluster-on-Laptop Continue reading Cloudera CDH in a box (setup steps) – Part 2

Cloudera CDH in a box (screencast) – Part 1

With the laptops getting more powerful and accessible, it’s now possible to do much more interesting things which we could not have thought about in the past.

A four node Cloudera CDH cluster has been setup on a Lenevo Z510 as shown below. Cloudera Manager is used for the purpose of making the management of different nodes and services in the cluster easier. With just a few clicks in the Cloudera Manager, it should be possible to add a new service, add a node etc. On the Gateway node, none of the services run, but is usually used by the end user to access the different services in the cluster.Cluster-on-Laptop Continue reading Cloudera CDH in a box (screencast) – Part 1