Hadoop 2.6.0 Multi-Node cluster setup

I have assumed that 10.0.1.1 is my Master Node and 10.0.1.2 and 10.0.1.3 are my slaves.Now ultimately we need to assign following roles.

10.0.1.1 NameNode, DataNode  
10.0.1.2 DataNode  
10.0.1.3 DataNode

First Master node : 10.0.1.1


$ cd $HADOOP_HOME
$ mkdir -p hdfs-data
$ mkdir -p hdfs-site
$ cd etc/hadoop
$ sudo gedit hdfs-site.xml

Continue reading

Hadoop 2.6.0 Single Node Setup (pseudo-distributed mode)

Hadoop is one of the most popular tool or framework for Big Data today. Usually people say that setting up Hadoop is difficult but it isn’t so. So first we need to check whether java is there or not.


$ java -version

It should show something like : java version “1.6.0_65” Java(TM) SE Runtime Environment (build 1.6.0_65-b14-466.1-11M4716) Java HotSpot(TM) 64-Bit Server VM (build 20.65-b04-466.1, mixed mode) If you don’t see something like this then you need to install java first.


$ sudo apt-get install default-jre
$ sudo apt-get install openjdk-7-jdk
$ sudo apt-get install openssh-server
$ sudo apt-get install rsync

Continue reading