Today, when we set up the hadoop environment configuration, We need to log on via SSH without a password. It took a lot of effort and finally got it done.
First, different Linux operating systems may have slightly different commands. my operating system is Ubuntu, So I recorded what I did.
1. hadoop02 @ ubuntuserver2:/root $ ssh-keygen-t rsa command. When the result is displayed, I press enter until the end
Document directory
1.1 original Article Source
1.2 unzip and install JDK
1.3 environment variables to be configured
1.4 how to configure Environment Variables
1.5 test JDK
1.6 uninstall JDK
2.1 original Article Source
2.2 Preface
2.3 confirm that the system has installed the OpenSSH server and client
2.4 check the local sshd configuration file (Root)
2.5 If the configuration file is modified, r
set Password=password ("test") where user= ' root ';mysql> flush privileges; Mysql> exit; Under start of the Hadoop service, in which hive:To start the Hive service:Start hive service runs in the background:Hive--service hiveserver2 Then start the Hive client:HiveIf the entry into the hive> shell proves that the start was successful;First create the table:hive> CREATE EXTERNAL TABLE MYTEST (num INT, name STRING) > ROW FORMAT delimited fields T
Continuation of the article
Name
Value
Description
Hadoop.job.history.location
Job history File Save path, no configurable parameters, and do not write in the configuration file, default in the Logs folder.
Hadoop.job.history.user.location
User History File storage location
Io.sort.factor
30
Here we deal with the number of file sorts when the stream is merged, and I und
Core-site.xml (Namenode process configuration file)Hdfs-site.xml (Secondarynamenode process configuration file)Mapred-site.xml (Jobtracker process configuration file)Hadoop 1.2.1 Configuration
Author: those things |ArticleCan be reproduced. Please mark the original source and author information in the form of a hyperlink
Web: http://www.cnblogs.com/panfeng412/archive/2013/03/22/hadoop-capacity-scheduler-configuration.html
Refer to capacity scheduler guide and summarize the configuration parameters of capacity scheduler based on your practical experience. Most of the parts marked as red below
Database xhkdb;4. Connect to the databaseCommand: Use For example: If the XHKDB database exists, try to access it:mysql> use XHKDB;Screen tip: Database changed5. View the database currently in useMysql> Select Database ();6. The current database contains table information:Mysql> Show tables; (Note: There is a last s)This article is from the "If you bloom, the breeze came in" blog, please be sure to keep this source http://iqdutao.blog.51cto.com/2597934/1766879 The five chapters of
Resolution of SSH password-less login configuration error in Hadoop cluster setup some netizens said that firewall should be disabled before ssh is configured. I did it, but it should be okay to close it. Run the sudoufwdisable command to disable the firewall. then enter www.2cto. comssh-keygen on the terminal and parse the SSH password-less logon configuration e
divided (the above explanation does not find the relevant document, nor the original) Example1output output (keys) because-D stream.num.map.output.key.fields=4Specify the first 4 output lines of the map as key, followed by value11.12.1.2 11.14.2.3 11.11.4.1 11.12.1.1 11.14.2.2divided into 3 reducer (the first 2 fields as the keys of partition)11.11.4.1-----------11.12.1.2 11.12.1.1-----------11.14.2.3 11.14.2.2The reducer is sorted within each division (4 fields are used for sorting at the same
size of the. ns file for the new database, in MB# specify. ns file size for new databases.# nssize = # Accout token for Mongo monitoring server.#mms-token = # MONGO the name of the monitoring server# server name for Mongo monitoring server.#mms-name = # MONGO The ping interval of the monitoring server# Ping interval for Mongo monitoring server.#mms-interval = # Replication Options Copy option# in replicated MONGO databases, specify here whether the is a slave or master in replication, specifies
1) Turn off disable firewall:/etc/init.d/iptables status will get a series of messages stating that the firewall is open./etc/rc.d/init.d/iptables Stop shutting down the firewall2) Disable SELinux:To view the SELinux status:1,/usr/sbin/sestatus-v # #如果SELinux The status parameter is enabled is turned onSELinux status:enabled2. Getenforce # #也可以用这个命令检查To turn off SELinux:1, temporarily shut down (do not restart the machine):Setenforce 0 # #设置SELinux become permissive mode# #setenforce 1 set SELin
Conf/mapred-site.xml
Summary: The main configuration Jobtracker address,scheduler,queue and so on.
1. Configure Jobtracker (must be set)
2. There are other configurable items
See Hadoop-0.21.0/mapred/src/java/mapred-default.xml in detail, such as
(1) Set up Job scheduler
(2) Job queue
Mapreduce.jobtracker.system.dir
Mapreduce.cluster.local.dir
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.