1. Download mahout: http://archive.cloudera.com/cdh4/cdh/4/mahout-0.7-cdh4.6.0.tar.gz
2, extract: mahout-0.7-cdh4.5.0.tar.gz
3, renamed: Music mahout-0.7-cdh4.5.0 mahout
4. Add the environment variable/tec/profile:
Export MAHOUT_HOME =/usr/local/mahout
Export CLASSPATH =.: $ CLASSPATH: $ MAHOUT_HOME/lib
Export PATH = $ PATH: $ MAHOUT_HOME/bin
5. Verification:
5.1) download test data: wget http://archive.ics.uci.edu/ml/databases/synthetic_control/synthetic_control.data
5.2) create a Hadoop Directory: hadoop fs-mkdir testdata
5.3) Upload File: hadoop fs-put synthetic_control.data testdata
5.4), run the program: hadoop jar/usr/local/mahout/mahout-examples-0.5-job.jar org. apache. mahout. clustering. syntheticcontrol. kmeans. Job
Therefore, hadoop must be installed on the mahout server first.
Build a Hadoop environment on Ubuntu 13.04
Cluster configuration for Ubuntu 12.10 + Hadoop 1.2.1
Build a Hadoop environment on Ubuntu (standalone mode + pseudo Distribution Mode)
Configuration of Hadoop environment in Ubuntu
Detailed tutorial on creating a Hadoop environment for standalone Edition
Build a Hadoop environment (using virtual machines to build two Ubuntu systems in a Winodws environment)