Use Cloudera Manager to install Hadoop

Source: Internet
Author: User

Use Cloudera Manager to install Hadoop

Hadoop is composed of many different services (such as HDFS, Hive, HBase, Spark, and so on). These services also have some dependencies. If you directly download the original Apache package, it is troublesome to download multiple times and configure multiple times. As a result, some companies have customized Hadoop, such as Cloudera, Hortonworks, and MapR. These companies all have their own Hadoop releases, and the Cloudera releases are called CDH. Because the company's system uses CDH, CDH5 has been installed for many times in this period of time, and some practical operations have been done, and some Cloudera documents have been read.

Cloudera uses Cloudera Manager, a Hadoop management system on the Web interface, to install, configure, and monitor Hadoop-related services.

Install Cloudera Manager

The official guide is provided here in several ways. I am only talking about one of them. It is not the simplest, but it can give you a better understanding of Cloudera Manager.

The following figure shows how Cloudera Manager manages Hadoop services.

Configure the YUM source of Cloudera Manager

Add a file CM. repo under/etc/yum. repos. d/. The content is as follows.

 

[CM]name=CMbaseurl=http://archive-primary.cloudera.com/cm5/RedHat/6/x86_64/cm/5.1.3/gpgcheck=0

[CM]

Name = CM

Baseurl = http: // archive-primary.cloudera.com/cm5/redhat/6/x86_64/cm/5.1.3/

Gpgcheck = 0

Run the following command to check whether the Yum repository configuration is successful. If the yum list can list the cloudera-manager-*** packages, it indicates that the configuration is successful. If you have installed it in a lab environment and may not be able to connect to the Internet, you can use the reposync command on a machine that can connect to the Internet to clone the Internet yum source to your local machine, then a local http service is set up to serve as a yum server for the lab machine. You can also configure a proxy in the CM. repo file of the lab to connect to the Internet. Just Google the steps.

 

[root@bogon yum.repos.d]# yum clean allLoaded plugins: fastestmirror, refresh-packagekit, securityCleaning repos: CM base extras updatesCleaning up EverythingCleaning up list of fastest mirrors[root@bogon yum.repos.d]# yum list | grep clouderacloudera-manager-daemons.x86_64        5.1.3-1.cm513.p0.155.el6          @CM    cloudera-manager-server.x86_64         5.1.3-1.cm513.p0.155.el6          @CM    cloudera-manager-agent.x86_64          5.1.3-1.cm513.p0.155.el6          CM     cloudera-manager-server-db-2.x86_64    5.1.3-1.cm513.p0.155.el6          CM

[Root @ bogon yum. repos. d] # yum clean all

Loaded plugins: fastestmirror, refresh-packagekit, security

Cleaning repos: CM base extras updates

Cleaning up Everything

Cleaning up list of fastest mirrors

[Root @ bogon yum. repos. d] # yum list | grep cloudera

Cloudera-manager-daemons. x86_64 5.1.3-1. cm513.20.155. el6 @ CM

Cloudera-manager-server. x86_64 5.1.3-1. cm513.20.155. el6 @ CM

Cloudera-manager-agent. x86_64 5.1.3-1. cm513.20.155. el 6 CM

Cloudera-manager-server-db-2. x86_64 5.1.3-1. cm513.20.155. el 6 CM

Install the Cloudera Manager package. When this package is installed, the cloudera-manager-daemons package on which it depends is automatically installed.

 

yum install cloudera-manager-server

Yum install cloudera-manager-server

Check what these two packages are installed. You can see that the server basically copies some configuration files. the real website management is installed through daemons. The specific files are under the/usr/share/cmf directory, and the logs are under the/var/log/cloudera-scm-server Directory, the runtime configuration file is in/var/run/cloudera-scm-server

[root@bogon ~]# rpm -ql cloudera-manager-server/etc/cloudera-scm-server/etc/cloudera-scm-server/db.properties/etc/cloudera-scm-server/log4j.properties/etc/default/cloudera-scm-server/etc/rc.d/init.d/cloudera-scm-server/etc/security/limits.d/cloudera-scm.conf/opt/cloudera/csd/opt/cloudera/parcel-repo/usr/sbin/cmf-server/var/lib/cloudera-scm-server[root@bogon ~]# rpm -ql cloudera-manager-daemons.../usr/share/cmf/yarn-fixtures/details_1.json/usr/share/cmf/yarn-fixtures/details_2.json/usr/share/cmf/yarn-fixtures/details_3.json/usr/share/cmf/yarn-fixtures/details_4.json/usr/share/cmf/yarn-fixtures/details_5.json/usr/share/cmf/yarn-fixtures/details_6.json/usr/share/cmf/yarn-fixtures/details_7.json/usr/share/cmf/yarn-fixtures/details_8.json/usr/share/cmf/yarn-fixtures/details_9.json/var/log/cloudera-scm-server/var/run/cloudera-scm-server

[Root @ bogon ~ ] # Rpm-ql cloudera-manager-server

/Etc/cloudera-scm-server

/Etc/cloudera-scm-server/db. properties

/Etc/cloudera-scm-server/log4j. properties

/Etc/default/cloudera-scm-server

/Etc/rc. d/init. d/cloudera-scm-server

/Etc/security/limits. d/cloudera-scm. conf

/Opt/cloudera/csd

/Opt/cloudera/parcel-repo

/Usr/sbin/cmf-server

/Var/lib/cloudera-scm-server

[Root @ bogon ~ ] # Rpm-ql cloudera-manager-daemons

...

/Usr/share/cmf/yarn-fixtures/details_1. json

/Usr/share/cmf/yarn-fixtures/details_2. json

/Usr/share/cmf/yarn-fixtures/details_3. json

/Usr/share/cmf/yarn-fixtures/details_4. json

/Usr/share/cmf/yarn-fixtures/details_5. json

/Usr/share/cmf/yarn-fixtures/details_6. json

/Usr/share/cmf/yarn-fixtures/details_7. json

/Usr/share/cmf/yarn-fixtures/details_8. json

/Usr/share/cmf/yarn-fixtures/details_9. json

/Var/log/cloudera-scm-server

/Var/run/cloudera-scm-server

Install the database and the corresponding driver. This is because Cloudera Manager needs to save the configuration to the database. In addition, when Hive is installed later, the database needs to save the metadata.

 

yum install mysql-serveryum install mysql-connector-java.noarch

Yum install mysql-server

Yum install mysql-connector-java. noarch

Then configure some database-related parameters, add the database cmf, the corresponding user cmf, And the password to the database.

 

mysql> create database cmf DEFAULT CHARACTER SET utf8;Query OK, 1 row affected (0.00 sec)mysql> grant all on cmf.* TO 'cmf'@'localhost' IDENTIFIED BY '123456';Query OK, 0 rows affected (0.00 sec)

Mysql> create database cmf default character set utf8;

Query OK, 1 row affected (0.00 sec)

Mysql> grant all on cmf. * TO 'cmf' @ 'localhost' identified by '123 ';

Query OK, 0 rows affected (0.00 sec)

Modify the/etc/cloudera-scm-server/db. properties file and configure the database to which the Cloudera manager accesses, the corresponding user name and password.

 

# The database type# Currently 'mysql', 'postgresql' and 'Oracle' are valid databases.com.cloudera.cmf.db.type=mysql# The database host# If a non standard port is needed, use 'hostname:port'com.cloudera.cmf.db.host=localhost# The database namecom.cloudera.cmf.db.name=cmf# The database user<a href="http://linmingren.me/blog/wp-content/uploads/2014/09/cloudera-welcome.png"></a>com.cloudera.cmf.db.user=cmf# The database user's passwordcom.cloudera.cmf.db.password=123456

# The database type

# Currently 'mysql', 'postgresql ', and 'oracle' are valid databases.

Com. cloudera. cmf. db. type = mysql

# The database host

# If a non standard port is needed, use 'hostname: port'

Com. cloudera. cmf. db. host = localhost

# The database name

Com. cloudera. cmf. db. name = cmf

# The database user <a href = "http://linmingren.me/blog/wp-content/uploads/2014/09/cloudera-welcome.png"> </a>

Com. cloudera. cmf. db. user = cmf

# The database user's password

Com. cloudera. cmf. db. password = 123456

Now you can start the cloudera-scm-server Service. Run the netstat command to check whether port 7180 has been started (from service start to port enabling on my machine for about half a minute ), if yes, you can enter localhost: 7180 in the browser to access the Cloudera Manger web Interface (the default user password is admin/admin ), if not, see what is an exception in/var/log/cloudera-scm-server/cloudera-scm-server.log.

 

service cloudera-scm-server restart

Service cloudera-scm-server restart

This is the first time you log on to the homepage.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.