Once installed open terminal and run.
> sudo apt-get update
> sudo apt-get upgrade
> sudo apt-get install build-essential
Get some coffee beans and start brewing :D. Back to terminal.
> su
# adduser hadoop
# passwd hadoop
Into the new users account
> su hadoop
> cd ~
> wget ftp://ftp.mirrorservice.org/sites/ftp.apache.org/hadoop/common/hadoop-1.0.4/hadoop-1.0.4.tar.gz
Note: check from http://www.apache.org/dyn/closer.cgi/hadoop/common/ and see which newer version you want. Here the installation is based on stable 1.0.x.
Check if your java is installed.
> java -version
Unzip the file
> tar xvfz hadoop-1.0.4.tar.gz
This will show the location of your java installation directory
> which java
> grep JAVA ~/hadoop-1.0.4/conf/hadoop-env.sh
> nano ~/hadoop-1.0.4/conf/hadoop-env.sh
and change JAVA_HOME to below.
# The only required environment variable is JAVA_HOME. All others are
# set JAVA_HOME in this file, so that it is correctly defined on
# export JAVA_HOME=/usr/lib/j2sdk1.5-sun
export JAVA_HOME=/usr/
# note that this should be /usr/bin/java , but it seems that /usr/ works on linux mint??
> cd ~/hadoop-1.0.4
> mkdir input
> cp conf/*.xml input
To test..
> bin/hadoop jar hadoop-examples-*.jar grep input output ‘dfs[a-z.]+’
> ls -l output
and this will give you some output files.!
Reference: http://www.thegeekstuff.com/2012/02/hadoop-standalone-installation/