1 Installation Environment
System/Software-stated |
Version |
Download Address |
Note |
Hadoop |
0.20.2 |
http://mirror.esocc.com/apache/hadoop/common/
|
|
Mahout |
0.4 |
http://mirrors.cnnic.cn/apache/mahout/0.4/
|
|
Operating system Debain |
6.02 |
|
|
Hadoop node List
System Ming \ Name |
Hadoop node |
IP Address |
Note |
Debain7 |
Namenode |
192.168.1.111 |
|
Datanode1 |
Datanode |
192.168.1.112
|
|
Datanode2
|
Datanode
|
192.168.1.113
|
|
Datanode3
|
Datanode
|
192.168.1.114
|
|
2 Installation Steps
2.1 Decompression mahout-distribution-0.4.tar.gz?
1 |
TAR-AXVF mahout-distribution-0.4. Tar. gz |
2.2 Rename the file to Mahout?
1 |
MV mahout-distribution-0.4 Mahout |
2.3 Modify the ~/.BASHRC file to add the following lines of data
Export JAVA_HOME=/USR/LIB/JVM/JAVA-1.6.0-OPENJDK-AMD64
Export Hadoop_home=/usr/local/hadoop
Export hadoop_conf_dir=/usr/local/hadoop/conf
Export Mahout_home=/usr/local/mahout
Export mahout_conf_dir=/usr/local/mahout/conf
Export path= $PATH: $HADOOP _home_dir: $HADOOP _home/bin: $MAHOUT _home/conf: $MAHOUT _home/bin
2.4 Restart the server, or the information set above does not work
2.5 Start Hadoop to test if the mahout is installed successfully
To exit to the original path at startup, both
?
2.6 Download Mahout test data
wget Http://archive.ics.uci.edu/ml/databases/synthetic_control/synthetic_control.data
2.7 Hadoop Create test Catalog
2.8 Uploading test data
2.9 Start the Kmeans algorithm and start testing?