Inkfish original, do not reprint commercial nature, reproduced please indicate the source (http://blog.csdn.net/inkfish).
Hadoop is an open source cloud computing platform project under the Apache Foundation. Currently the latest version is Hadoop 0.20.1. The following is a blueprint for Hadoop 0.20.1, which describes how to install Hadoop under Ubuntu Linux 9.10. (Source: Http://blog.csdn.net/inkfish)
Supported Platforms: (Source: Http://blog.csdn.net/inkfish)
Linux can be used as a platform for development and product deployment;
Windows, which can be used as a development platform.
Pre-Required Software: (Source: Http://blog.csdn.net/inkfish)
1.javatm1.6.x, must be installed, recommended to choose the Java version issued by Sun Company;
2.ssh must be installed and guaranteed to run SSHD, Hadoop will communicate with SSH;
3. If it is windows, you need to install Cygwin to support shell commands.
Install the available modes: (Source: Http://blog.csdn.net/inkfish)
1. Local mode;
2. pseudo distribution mode;
3. Full distribution mode.
Pseudo Distribution mode installation steps:
1. Unzip the Hadoop installation package to a directory;
2. Modify conf/hadoop-env.sh file, export java_home= the line set the correct java_home location
3. Revise the conf/core-site.xml to read:
<configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost :9000</value> </property> </configuration>
4. Revise the conf/hdfs-site.xml to read:
<configuration> <property> <name>dfs.replication</name> <value>1</value> </ Property> </configuration>
5. Revise the conf/mapred-site.xml to read:
<configuration> <property> <name>mapred.job.tracker</name> <value>localhost:9001 </value> </property> </configuration>
6. Set the native SSH computer without password
First Test, enter the command ssh localhost, if prompted password, you need to do the following:
$ ssh-keygen-t dsa-p '-F ~/.SSH/ID_DSA
$ cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys
7. Format Namenode:
$ bin/hadoop Namenode-format
8. Start Hadoop:
$ bin/start-all.sh
At this point, the Hadoop pseudo distribution mode is installed.
(Source: Http://blog.csdn.net/inkfish)