Alex's Hadoop cainiao Tutorial: 7th Sqoop2 import tutorial, hadoopsqoop2
For details about the installation and jdbc driver preparation, refer to section 6th. Now I will use an example to explain how to use sqoop2.Data Preparation
There is a mysql table named worker, which contains three pieces of data. We want to import it to
your cluster, and that installing a Hadoop cluster typically extracts the installation software to all the machines in the cluster, referring to the previous section, "Installation configuration on Apache Hadoop single node."Typically, a machine in a cluster is designated as a NameNode and another machine as a ResourceManager. These are all master. Other services, such as the WEB application proxy server a
/mapreduce/hadoop-mapreduce-examples-2.7.3.jar grep input Output ' dfs[a-z. +1(7) View output fileCopy the output file from the Distributed file system to the local file system view:$ bin/hdfs dfs-get Output output$ cat output/*****12Alternatively, view the output file on the Distributed File system:$ Bin/hdfs Dfs-cat output/*1(8) After completing all the actions, stop the daemon:$ sbin/stop-dfs.sh* * You need to learn to continue reading the next cha
configuration file are:
Run the ": WQ" command to save and exit.
Through the above configuration, we have completed the simplest pseudo-distributed configuration.
Next, format the hadoop namenode:
Enter "Y" to complete the formatting process:
Start hadoop!
Start hadoop as follows:
Use the JPS command that comes with Java to query all daemon processes:
Star
Statement
This article is based on CentOS 6.x + CDH 5.x
Zookeeper what to use to see the previous tutorial, you will find multiple occurrences of zookeeper, such as the auto failover Hadoop zookeeper, Hbase Regionserver also have to use zookeeper. In fact, more than Hadoop, including the now small and famous Storm with the zookeeper. So what exactly
tag: CTI log of the http OS Io file on time C Baidu Network Disk: http://pan.baidu.com/s/1hqrER6sI mentioned the CBT nuggets hadoop video tutorial last time. After half a month, I took the time to upload the video to Baidu online storage. There were 20 courses in total, from concept introduction to installation to surrounding projects, it can basically be said that it is a rare thing:01
Label: style blog HTTP Java Ar data 2014 SP LogHadoop Big Data zero-basic high-end practical training series with text mining projectIn the big data hadoop video tutorial, the basic java syntax, database, and Linux are used to go deep into all the knowledge required by hadoop big data technology and design all common components in the
.
DistributedCache can be used to publish jar packages and Local Shared libraries used by map or reduce. Generally, sub-JVM processes can use java. library. path and LD.LIBRARYPATH specifies its own working PATH. The cache library can be loaded through System. loadLibrary or System. load. For more information about using distributed cache to load shared libraries, see Loading native libraries through DistributedCache.
?Related Articles
Hadoop
Copy an objectThe content of the copied "input" folder is as follows:The content of the "conf" file under the hadoop installation directory is the same.Now, run the wordcount program in the pseudo-distributed mode we just built:After the operation is complete, let's check the output result:Some statistical results are as follows:At this time, we will go to the hadoop Web console and find that we have submit
Many tutorials on the web about Hadoop-2.4 package 64-bit encoding tutorial, the latest version 2.7.2 almost the same, here for everyone to retell.Share two more authoritative attached links:Ubuntu User Recommended Reference: http://www.aboutyun.com/forum.php?mod=viewthreadtid=8130extra=page%3D1page=1CentOS Series User reference: Http://www.cnblogs.com/hadoop2015/p/4259899.html1, the early tool preparation:
Trivial-hadoop 2.2.0 pseudo-distributed and fully distributed installation (centos6.4), centos6.4 installation tutorial
The environment is centos6.4-32, hadoop2.2.0
Pseudo distributed document: http://pan.baidu.com/s/1kTrAcWB
Fully Distributed documentation: http://pan.baidu.com/s/1hqIeBGw
It is somewhat different from 1.x, 0. x, especially yarn.
There is an episode here. When configuring yarn in full
Import--connect jdbc:mysql://localhost:3306/sqoop_test--username root--password root--table employee--hive-i Mport--hive-table hive_employee--create-hive-tablewarning:/usr/lib/sqoop/. /hive-hcatalog does not exist! Hcatalog jobs would fail. Please set $HCAT _home to the root of your hcatalog installation. Warning:/usr/lib/sqoop/. /accumulo does not exist! Accumulo imports would fail. Please set $ACCUMULO _home to the root of your Accumulo installation ...... ........... 14/12/02 15:12:13 INFO H
records.NoteThere's a sentence in this journal14/12/05 08:49:46 INFO MapReduce. Job:the URL to track the job:http://hadoop01:8088/proxy/application_1406097234796_0037/This means you can use the browser to access the address to see the implementation of the task, if your task for a long time the card master is not finished is wrong, you can go to this address to see the detailed error logView ResultsMysql> SELECT * from employee;+--------+----+-------+| Rowkey | ID | Name |+--------+----+------
, learn the North wind course "Greenplum Distributed database development Introduction to Mastery", " Comprehensive in-depth greenplum Hadoop Big Data analysis platform, "Hadoop2.0, yarn in layman", "MapReduce, HBase Advanced Ascension", "MapReduce, HBase Advanced Promotion" for the best.Course OutlineMahout Data Mining Tools (10 hours)Data mining concepts, system compositionCommon methods and algorithms for data Mining (regression analysis, classific
Copy an object The content of the copied "input" folder is as follows: The content of the "conf" file under the hadoop installation directory is the same. Now, run the wordcount program in the pseudo-distributed mode we just built: After the operation is complete, let's check the output result: Some statistical results are as follows: At this time, we will go to the hadoop Web
Statement
This article is based on CentOS 6.x + CDH 5.x
In this example, Hbase is installed in cluster mode
This article is based on maven3.5+ and Eclipse 4.3
After the tutorial, we must look at the following
We do not build hbase to use the shell to check the data, we are writing HBase-based applications, so learning how to use Java to invoke HBase is a required course. Setting up the project open Eclipse to build a Maven pr
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.