Inkfish original, do not reprint commercial nature, reproduced please indicate the source (http://blog.csdn.net/inkfish).
Hadoop is an open source cloud computing platform project under the Apache Foundation. Currently the latest version is Hadoop 0.20.1. The following is a blueprint for Hadoop 0.20.1, which describes how to install Hadoop under Ubuntu Linux 9.10. (Source: Http://blog.csdn.net/inkfish)
Supported Platforms: (Source: Http://blog.csdn.net/inkfish)
Linux can be used as a platform for development and product deployment;
Windows, which can be used as a development platform.
Pre-Required Software: (Source: Http://blog.csdn.net/inkfish)
1.javatm1.6.x, must be installed, recommended to choose the Java version issued by Sun Company;
2.ssh must be installed and guaranteed to run SSHD, Hadoop will communicate with SSH;
3. If it is windows, you need to install Cygwin to support shell commands.
Install the available modes: (Source: Http://blog.csdn.net/inkfish)
1. Local mode;
2. pseudo distribution mode;
3. Full distribution mode.
Local mode installation steps: (Source: Http://blog.csdn.net/inkfish)
1. Unzip the Hadoop installation package to a directory;
2. Modify conf/hadoop-env.sh file, export java_home= that line set the correct java_home location, of course, if the operating system has been set up java_home, you can ignore this step;
The local mode is finished, it is so simple.
Test the Operation Effect: (Source: Http://blog.csdn.net/inkfish)
$ mkdir Input
$ CP Conf/*.xml Input
$ bin/hadoop jar hadoop-*-examples.jar grep input Output ' dfs[a-z.] +'
$ cat output/*
This example program is a Hadoop, used to copy the XML file under Conf to the input directory, and to locate and display all the rows that match the regular expression of the last parameter, output is the Export folder.