Hadoop 2.6.0 Multi-Node cluster setup

I have assumed that is my Master Node and and are my slaves.Now ultimately we need to assign following roles. NameNode, DataNode DataNode DataNode

First Master node :

$ mkdir -p hdfs-data
$ mkdir -p hdfs-site
$ cd etc/hadoop
$ sudo gedit hdfs-site.xml

Continue reading


Hadoop 2.6.0 Single Node Setup (pseudo-distributed mode)

Hadoop is one of the most popular tool or framework for Big Data today. Usually people say that setting up Hadoop is difficult but it isn’t so. So first we need to check whether java is there or not.

$ java -version

It should show something like : java version “1.6.0_65” Java(TM) SE Runtime Environment (build 1.6.0_65-b14-466.1-11M4716) Java HotSpot(TM) 64-Bit Server VM (build 20.65-b04-466.1, mixed mode) If you don’t see something like this then you need to install java first.

$ sudo apt-get install default-jre
$ sudo apt-get install openjdk-7-jdk
$ sudo apt-get install openssh-server
$ sudo apt-get install rsync

Continue reading