Course View Address: HTTP://WWW.XUETUWUYOU.COM/COURSE/62
The course out of self-study, worry-free network: http://www.xuetuwuyou.com
First, the software version
Centos6.5, VMware 10
CDH5.2.0 (Hadoop 2.5.0)
Hive-0.13
sqoop-1.4.5
Second, after completing the course, you can:
①, a person to solve the enterprise Hadoop platform Construction and operation of the work, interface efficient operation and monitoring.
②, Topsy-mapreduce programming;
③, Topsy Hive use
④, play turn Sqoop
⑤, alone to solve the Hadoop platform offline analysis, statistics work. Become a high-end technical personnel!
Iii. Outline of the course
1. Distributed and traditional stand-alone mode
2. Hadoop background and HDFs detailed
3. Analysis of the working principle of MapReduce
4, Cloudera Manager5.2.0 installation
5, CDH5.2.0 with parcels way offline installation
6, CM under the Cluster service management
7, CM under the cluster host management and Hadoop job scheduling strategy
8. Detailed Hadoop FS commands
9, the second generation of Mr--yarn principle analysis
10, Linux Eclipse and Hadoop plug-in installation configuration
11, "Mr Development" common API, the official examples package, wordcount transformation
12, "Mr Development" take WordCount as an example to analyze the whole process of map-reduce
13, "Mr Development" actual combat, the region daily PV calculation
14, "Mr Development" combat, regional daily UV computing (DE-heavy mode, multi-job dependency)
15, "Mr Development" an Mr with multiple dependent job development implementation
16. Hadoop Bad block processing
17. Hadoop Storage Equalization and single node multi-disk storage equalization
18. Hive generates background and architecture principles
19. Hive Service Add and Meta data management
20. Hive managed tables, external tables, partitioned tables, storage structures
21, Hive QL Grammar detailed one
22. Hive QL Syntax detailed two, Cli, field type, overwrite
23. Hive Enterprise Code case sharing and HIVE-E tool encapsulation one
24. HIVE-E Tool Package Two
25. Hive UDF Development and JDBC access mode
26. Extract MySQL table data to Hive's sqoop use
27, Sqoop Advanced use
28. "E-commerce log Analysis" load log file to hive partition table
29, "E-commerce log Analysis" Business requirements description
30, "E-commerce Log Analysis" Session Information analysis statistics
31, "E-commerce log Analysis" uv/pv/login/Number of visitors/duration of visit/Two hops/Independent IP/session statistics
32. Quick understanding of ETL and Data Warehouse
33. Namenode HA Implementation
CDH5 Video Tutorial