Course Outline and Content introduction:
About 35 minutes per lesson, no less than 40 lectures
The first chapter (11 speak)
• Distributed and traditional stand-alone mode
· Hadoop background and how it works
· Analysis of the working principle of MapReduce
• Analysis of the second generation Mr--yarn principle
· Cloudera Manager 4.1.2 Installation
· Cloudera Hadoop 4.1.2 Installation
· CM under the cluster management one
· CM under the cluster Management two
· The Hadoop FS command is detailed
Cloudera Manager Management Cluster
Cluster advanced management under Cloudera Manager
Chapter II (about 10 speak)
· Hive data tables and data storage
· Java Extension Development for Hive
· Hive UDF and UDAF development
· Hive JDBC Connection
· Hive common scenes, practical exercises
· Hive-f development of the communication and reference framework
Because Hive has command hive-f cannot pass parameters, the use of hive cross-file is basically paralyzed,
cannot be massively promoted. The framework can be arbitrarily transmitted, making hive enterprise application development more efficient and concise.
Chapter III (about 5 speak)
· Sqoop principle
· Sqoop use of the detailed
• Use Sqoop to achieve hdfs/hive data interaction with relational databases
• Use Sqoop to implement HBase's data interaction with relational databases
Fourth chapter (about 8 Speak)
· HBase principle
· HBase System Architecture
· HBase storage mechanism
· HBase Basic Usage
· HBase table design ideas and solutions
• Common Application Scenarios
• Interacting with Hive
· Java Access, web development
The fifth chapter of the project combat (about 8 Speak)
E-commerce Log Traffic Analysis project, Internet Enterprises on the massive log analysis is an important use of Hadoop applications, but also the site traffic, customer behavior analysis of an important way. The project integrates hive, Hbase, sqoop and other common components, covering every technical link from background processing to foreground rendering.
Including:
• Introduction to Business requirements
• Data Modeling
• Background algorithm design
• Background Business Processing
• Front Office web display, etc.
...
Detailed outline list of courses:
First Lecture: Cloudera Manager Introduction and Installation
Second Lecture: Cloudera manager detailed
The third Lecture: CDH4.1 Introduction and environment to build a
Four: CDH4.1 Environment building Two
Five: Hadoop working principle, scheduling strategy
VI: Hadoop Development Job Form
Seventh: CM under CDH4.1 cluster senior management One
Eighth: CM under CDH4.1 cluster Senior Management II
Nineth Lecture: Summary and how Hadoop works
Tenth: How hive works and basic usage
11th: Hive Meta Data management and syntax explanation
12th: Hive table and storage structure
13th: operation and Maintenance case sharing _ single-machine storage equalization and bad block processing
14th: Hive QL One
15th: Hive QL II
16th Lecture: UDF and UDAF development
17th Lecture: UDAF Development and JDBC Access
18th: Summary of Hive optimization rules
19th: Hive Data compression technology
20th: Hive-f Package supports a
21st: Hive-f Package supports two-parameter
22nd: Sqoop uses a
23rd: Sqoop Use two
24th Lecture: Sqoop Job scheduling
25th Lecture: HBase Architecture
26th Lecture: HBase table Design case
27th Lecture: HBase Data loading (Sqoop and Java)
28th Lecture: hbase storage mechanism
29th: Java Operation HBase One
30th: Java Operation HBase II
31st Lecture: Java Operation HBase Three
32nd: HBase and Hive Interface and project introduction
33rd: Order online Real-time query _schema design and hbase data loading
34th: Order online real-time query _dao layer implementation
35th: Order online real-time query _dao layer and foreground implementation
36th: E-Commerce Log Traffic Analysis _ Project Introduction
37th: E-Commerce Log Traffic Analysis _ Business implementation of a
38th: E-commerce Log Traffic Analysis _ Business implementation Two
39th: E-commerce Log Traffic Analysis _ Business implementation Three
40th: E-Commerce Log Traffic Analysis _ Business implementation Four
41st: CDH5 Build CM5 Installation Deployment
42nd: CDH5 Building and CM interface cluster management
Cloudera Hadoop 4 Combat Course (Hadoop 2.0, cluster interface management, e-commerce online query + log offline analysis)