;NBSP;JDK's full name of the TAR package to extract 2. Configure environment variables to modify the configuration file using the Vi/etc/profile vi/etc/profile command, Add the following: Export Java_home=/java/jdk1.8.0_73export jre_home= $JAVA _home/jreexport class_home= $JAVA _home/libexport PATH= $PATH: $JAVA _home/bin use Source/etc/profile to update the profile using Java–version to see if the success is successful, such as: hadoop User Trust 1.
. MapReduce is free to select a node that includes a copy of a shard/block of dataThe input shard is a logical division, and the HDFS data block is the physical division of the input data. When they are consistent, they are highly efficient. In practice, however, there is never a complete agreement that records may cross the bounds of a block of data, and a compute node that processes a particular shard gets a fragment of the record from a block of data Hado
Original link http://blog.csdn.net/xumin07061133/article/details/8682424An experimental environment:1. Three physical machines (three hosts that can be virtual virtual machines), one of which is the primary node (Namenode) ip:192.168.30.50, and two as slave nodes (Datanode) ip:192.168.30.51/192.168.30.522. For each host installation JDK1.6 above, and set the environment variables, recommended (such as: java_home=/usr/java/java1.7.0_17), configuration
MastersHost616) Configuration Slaveshost62Host635. Configure host62 and host63 in the same way6. Format the Distributed File system/usr/local/hadoop/bin/hadoop-namenode format7. Running Hadoop1)/usr/local/hadoop/sbin/start-dfs.sh2)/usr/local/hadoop/sbin/start-yarn.sh8. Check:[Email protected] sbin]# JPS4532 ResourceMa
Hadoop pseudo-distributed Installation Steps
Log on as a root user1.1 set static IP
Right-click the icon in the upper-right corner of the centos desktop and choose modify.
Restart the NIC and run the Command Service Network restart.
Verify: Execute the command ifconfig
1.2 Modify host name
Verification: restart the machine1.3 bind the hostname and IP address
Run the command VI/etc/hosts and add a line as
This Hadoop cluster installation uses a total of four nodes, each node IP is as follows:
Master
172.22.120.191
Slave1
172.22.120.192
Slave2
172.22.120.193
Slave3
172.22.120.193
System version CentOS 6.2LJDK Version: 1.7Hadoop version: 1.1.2After completing the four-node system in
The recent time to build up a bit hadoop-2.7.3 + hbase-1.3.1 + zookeeper-3.4.8 + hive-2.3.0 fully distributed platform environment, online query a lot of relevant information, installation success, deliberately recorded down for reference.
first, software preparation
VMware12, hadoop-2.7.3, hbase-1.3.1, zookeeper-3.4.8, hive-2.3.0, jdk-8u65-linux-x64.tar.gz
Se
The Hadoop installation in this article is based on the Hortonworks RPMs installation
Documents See: Http://docs.hortonworks.com/CURRENT/index.htm
Http://www.oracle.com/technetwork/java/javase/downloads/jdk-6u31-download-1501634.html
Download Java jdk-6u31-linux-x64.bin
#Java settings
chmod U+x/home/jdk-6u31-linux-x64.bin
/home/jdk-6u31-linux-x64.bin-noregi
space)
Now, after jkd is installed, configure the environment variables below
4.3 open/etc/profile (vim/etc/profile)
Add the following content at the end:
JAVA_HOME =/usr/java/jdk1.7.0 _ 40 (the version number 1.7.40 must be modified based on the download details)
CLASSPATH =.: $ JAVA_HOME/lib. tools. jar
PATH = $ JAVA_HOME/bin: $ PATH
Export JAVA_HOMECLASSPATH PATH
4.4. source/etc/profile
4.5 verify whether the installation is successful: j
=" border-top:0px; border-right:0px; Background-image:none; border-bottom:0px; padding-top:0px; padding-left:0px; margin:0px; border-left:0px; padding-right:0px "border=" 0 "alt=" clipboard "src=" http://s3.51cto.com/wyfs02/M00/6B/F3/ Wkiol1u7nkgbqs9gaageg2yscne517.jpg "" 425 "height=" 508 "/> Specific installation links can be in reference to the steps, but there are a few points to note. Host and Slave Unified create a dedicated user to run
can be backed up), its main job is to help nn merge edits log, Reduce NN startup time SNN execution merging time fs.checkpoint.period default 3,600 seconds based on profile settings edits log size fs.checkpoint.size rules
Datanode Storage data (block) to start the DN thread will report to nn blocks information by sending the heartbeat to NN to maintain its contact (3 seconds), if nn 10 minutes did not receive the heartbeat of the DN, then think it has been lost, and copy it on the block to othe
"Pseudo" fractional installation of Hadoop 2.6 compared to "full" fractional installation, 99% of the operation is the same, the only difference is not to configure the slaves file, here are a few key configurations:(Install JDK, create user, set SSH password, set environment variable these preparations, you can refer to the
In LinuxYou first need to install the JDK and configure the appropriate environment variablesDownload the hadoop1.2.1 version by wget, if it is a production environment use 1.* version is recommended, because the 2.* version just launched not long, more unstableHttp://mirror.bit.edu.cn/apache/hadoop/common/hadoop-1.2.1/hadoop-1.2.1.tar.gzYou can use the MV to cut
physically stored:You can see that the null value is not stored, so "contents:html" with a query timestamp of T8 will return NULL, the same query timestamp is T9, and the "anchor:my.lock.ca" item also returns NULL. If no timestamp is specified, the most recent data for the specified column should be returned, and the newest values are first found in the table because they are sorted by time. Therefore, if you query "contents" without specifying a timestamp, you will return the T6 data, which ha
equivalent to Red Hat AS4.
1.2 Installation versionThis installation we choose CetOS6.0 version, the following from a few aspects of this version of the introduction.
integrates kernel-based virtualization . CentOS 6 integrates kernel-based virtualization to fully integrate the KVM hypervisor into the kernel. This feature helps CentOS 6.0 users easily migrate virtual machines between hosts and mor
The latest hadoop2.5 installation directory has been modified to make installation easier.
First install the preparation Tool
$ sudo apt-get install ssh $ sudo apt-get install rsync
Configure SSH
$ ssh localhostIf you cannot ssh to localhost without a passphrase, execute the following commands: $ ssh-keygen -t dsa -P ‘‘ -f ~/.ssh/id_dsa $ cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys
Go to ETC/
Hadoop advanced 1. Configure SSH-free (1) Modify the slaves fileSwitch to master machine, this section is all done in master.Enter the/usr/hadoop/etc/hadoop directory, locate the slaves file, and modify:slave1slave2slave3(2) Sending the public keyEnter the. SSH directory under the root directory:
Generate Public Private key
SSH-KEYGEN-T RSA
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.