Cdh5 Hadoop Redhat Local warehouse configurationCDH5 site location on the site:http://archive-primary.cloudera.com/cdh5/redhat/6/x86_64/cdh/Configuring on RHEL6 to point to this repo is very simple, just put:Http://archive-primary.cloudera.com/cdh5/redhat/6/x86_64/cdh/cloudera-cdh5.repoTo download the store locally, yo
Tags: HBase is a distributed, column-oriented, open-source database that comes from the Google paper "Bigtable: A distributed storage system of structured data" written by Fay Chang. Just as BigTable leverages the distributed data store provided by the Google File system, HBase provides bigtable-like capabilities on top of Hadoop. HBase is a sub-project of the Apache Hadoop project. HBase differs from the general relational database, which is a database suitable for unstructured data storage.
environment variable export Path = (copy the original path) +:/var/lib/pgsql:/etc/init. d 5.
Build cm local source1. download the directory connection. Select the appropriate version based on your needs. Note: cm and CDH versions must be consistent http://archive-primary.cloudera.com/cm5/repo-as-tarball http://archive-primary.cloudera.com/cm5/installer http://archive-primary.cloudera.com/cdh5/repo-as-tarball I'm using a 5.2.0 version of http://archiv
To add a new host node to the CDH5 clusterStep one: First you have to install the JDK in the new host environment, turn off the firewall, modify SELinux, NTP clock synchronization with the host, modify the hosts, configure SSH password-free login with the host, ensure that Perl and Python are installed.Step two: Upload the Cloudera-manager file to the/OPT directory and modify the agent configuration file:Vi/opt/cm-5.0.0/etc/cloudera-scm-agent/config.i
I. Related software preparation and planning
1, related software and download address:
Cloudera manager:http://archive-primary.cloudera.com/cm5/cm/5/CDH installation package Address: http://archive.cloudera.com/cdh5/parcels/latest/Java Official Download (login required): http://www.oracle.com/technetwork/java/archive-139210.htmlJava versions archive Download (no login required): https://www.reucon.com/cdn/java/MySQL JDBC driver jar pack: http://dev.
Tip: If you're not aware of Hadoop, you can view this article on the Hadoop ecosystem, which allows us to get an overview of the usage scenarios for tools in Hadoop and Hadoop ecosystems.
To build a distributed Hadoop cluster environment, here are the detailed steps to use CDH5.First, hardware preparationBasic configuration:
Operating system
64 guests
Cpu
(Intel) Intel (R) I3 processor
Memory
8.00 GB (MH
. Client: retrying connect to server: 0.0.0.0/0.0.0.0: 8030. Already tried 0 time (s ).
Info [main] org. Apache. hadoop. IPC. Client: retrying connect to server: 0.0.0.0/0.0.0.0: 8030. Already tried 0 time (s ).
Info [main] org. Apache. hadoop. IPC. Client: retrying connect to server: 0.0.0.0/0.0.0.0: 8030. Already tried 0 time (s ).
16. Solve the Problem
The spark core package under the spark directory lib package to local, found that there is a yarn-defaul.xml file, open the discovery
Find
, if there is a problem, follow the check prompt to solve (according to my pre-deployment preparation, there will be no problem).13, select the services to be installed, select all services can also be customized services, their own test to build a small memory, you can choose the core Hadoop14. Set up the database and test the connection15, the cluster settings, mostly are some directory settings, the default can be16. First startInstallation is complete!17, after the installation is complete,
ObjectiveIn the use of CDH cluster process, it will inevitably cause the node IP or hostname changes due to some irresistible reasons, and CM's monitoring interface can not complete these things, but CM will all the hosts in the cluster information is in the PostgreSQL database hosts table,Now let's do this by modifying the hosts.The first step is to close the service1. Turn off the Cluster service, and Cloudera Management services,2. Close cm Service: (CM installation node)Command: Service Clou
1.parcel Hash Validation Error:Cloudera downloaded from the CDH-5.1.0-1.CDH5.1.0.P0.53-EL6.PARCEL.SHA1 with VI open, the following path is deleted, such as the original content of 67fc4c86b260eeba15c339f1ec6be3b59b4ebe30 ./cdh5/parcels/5.1.0.53/cdh-5.1.0-1.cdh5.1.0.p0.53-el6.parcel, modified to 67fc4c86b260eeba15c339f1ec6be3b59b4ebe302.cloudera-scm-server dead but PID file exists:There may be a problem with the database connection, see if the database
/cloudera*/var/cache /yum/x86_64/6/cloudera*/var/log/cloudera*/var/run/cloudera* /etc/cloudera*3. Uninstalling the installation package: [[emailprotected] ~]# Rpm-qa | grep cloudera[[emailprotected] ~]# for f in ' Rpm-qa | grep Cloudera ' ; Do Rpm-e ${f}; Done (if there is a save, do it again) 4. Clear the installation files rm-rf/var/lib/hadoop-*/var/lib/impala/var/lib/solr/var/lib/zookeeper/var/lib/hue/var/lib/oozie /var/lib/pgsql / VAR/LIB/SQOOP2 /data/dfs//data/impala//data/yarn//dfs//impa
Modify the IP address, hostName, and cdh5hostname of the host node in the cdh5 cluster.Preface
When using the cdh cluster, it is inevitable that the node IP address or hostName changes due to some irresistible reasons, and the cm monitoring interface cannot complete these tasks, however, cm stores all host information in the hosts table of the postgresql database,
Now let's modify the hosts to complete this operation.Step 1: Disable the service
1. Dis
Org.apache.phoenix.query.ConnectionQueryServicesImpl.getAllTableRegions ( Connectionqueryservicesimpl.java:451) at Org.apache.phoenix.query.ConnectionQueryServicesImpl.checkClientServerCompatibility ( Connectionqueryservicesimpl.java:951) at org.apache.phoenix.query.ConnectionQueryServicesImpl.ensureTableCreated ( Connectionqueryservicesimpl.java:877) at org.apache.phoenix.query.ConnectionQueryServicesImpl.createTable (Connectionqueryservicesimpl.java:1223)
Original address: http://blog.csdn.net/a921122/article/details/51939692
File Download
CDH (Cloudera's distribution, including Apache Hadoop), is one of the many branches of Hadoop, built from Cloudera maintenance, based on the stable version of Apache Hadoop, and integrates many patches, Can be used directly in production environments.Cloudera Manager simplifies the installation and configuration management of the host, Hadoop, Hive, and spark services in a cluster by making it easy to instal
Course View Address: HTTP://WWW.XUETUWUYOU.COM/COURSE/62The course out of self-study, worry-free network: http://www.xuetuwuyou.comFirst, the software versionCentos6.5, VMware 10CDH5.2.0 (Hadoop 2.5.0)Hive-0.13sqoop-1.4.5Second, after completing the
Cdh5hadoopredhat local repository ConfigurationCdh5 hadoop redhat local repository Configuration
Location of the cdh5 Website:
Http://archive-primary.cloudera.com/cdh5/redhat/6/x86_64/cdh/
It is very easy to configure pointing to this repo On RHEL6, As long:
Http://archive-primary.cloudera.com/cdh5/redhat/6/x86_64/cdh/cloudera-
Applicable scenarios:1. Application servers in large clusters can only be accessed by intranet2. Want to maintain a stable local repository, to ensure uniform installation of member servers3. Avoid poor access to foreign yum sources or domestic source networksServer configuration:
Create an application local Yum source configuration file to ensure network access to the public network source, taking CDH as an example
[Email protected] ~]# Cat/etc/yum.repos.d/cdh.repo [cloudera-
inconsistent time between HBase nodes
Hadoop + ZooKeeper + HBase cluster configuration
Hadoop cluster Installation HBase lab environment setup
HBase cluster configuration based on Hadoop cluster'
Hadoop installation and deployment notes-HBase full distribution mode installation
Detailed tutorial on creating HBase environment for standalone Edition
Reference documentation (hortonworks will be short for hdp; cloudera is cdh ):
1. Create a system template. Because I found the centos6.5 template i
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.