Cloudera Hadoop 4 Combat Course (Hadoop 2.0, cluster interface management, e-commerce online query + log offline analysis)

Source: Internet
Author: User
Tags sqoop hadoop fs

Course Outline and Content introduction:

About 35 minutes per lesson, no less than 40 lectures

The first chapter (11 speak)

• Distributed and traditional stand-alone mode

· Hadoop background and how it works

· Analysis of the working principle of MapReduce

• Analysis of the second generation Mr--yarn principle

· Cloudera Manager 4.1.2 Installation

· Cloudera Hadoop 4.1.2 Installation

· CM under the cluster management one

· CM under the cluster Management two

· The Hadoop FS command is detailed

Cloudera Manager Management Cluster
Cluster advanced management under Cloudera Manager

Chapter II (about 10 speak)


· Hive data tables and data storage
· Java Extension Development for Hive
· Hive UDF and UDAF development
· Hive JDBC Connection
· Hive common scenes, practical exercises
· Hive-f development of the communication and reference framework
Because Hive has command hive-f cannot pass parameters, the use of hive cross-file is basically paralyzed,
cannot be massively promoted. The framework can be arbitrarily transmitted, making hive enterprise application development more efficient and concise.


Chapter III (about 5 speak)


· Sqoop principle
· Sqoop use of the detailed
• Use Sqoop to achieve hdfs/hive data interaction with relational databases
• Use Sqoop to implement HBase's data interaction with relational databases


Fourth chapter (about 8 Speak)


· HBase principle
· HBase System Architecture
· HBase storage mechanism
· HBase Basic Usage
· HBase table design ideas and solutions
• Common Application Scenarios
• Interacting with Hive
· Java Access, web development


The fifth chapter of the project combat (about 8 Speak)


E-commerce Log Traffic Analysis project, Internet Enterprises on the massive log analysis is an important use of Hadoop applications, but also the site traffic, customer behavior analysis of an important way. The project integrates hive, Hbase, sqoop and other common components, covering every technical link from background processing to foreground rendering.
Including:
• Introduction to Business requirements
• Data Modeling
• Background algorithm design
• Background Business Processing
• Front Office web display, etc.
...

Detailed outline list of courses:

First Lecture: Cloudera Manager Introduction and Installation
Second Lecture: Cloudera manager detailed
The third Lecture: CDH4.1 Introduction and environment to build a
Four: CDH4.1 Environment building Two
Five: Hadoop working principle, scheduling strategy
VI: Hadoop Development Job Form
Seventh: CM under CDH4.1 cluster senior management One
Eighth: CM under CDH4.1 cluster Senior Management II
Nineth Lecture: Summary and how Hadoop works
Tenth: How hive works and basic usage
11th: Hive Meta Data management and syntax explanation
12th: Hive table and storage structure
13th: operation and Maintenance case sharing _ single-machine storage equalization and bad block processing
14th: Hive QL One
15th: Hive QL II
16th Lecture: UDF and UDAF development
17th Lecture: UDAF Development and JDBC Access
18th: Summary of Hive optimization rules
19th: Hive Data compression technology
20th: Hive-f Package supports a
21st: Hive-f Package supports two-parameter
22nd: Sqoop uses a
23rd: Sqoop Use two
24th Lecture: Sqoop Job scheduling
25th Lecture: HBase Architecture
26th Lecture: HBase table Design case
27th Lecture: HBase Data loading (Sqoop and Java)
28th Lecture: hbase storage mechanism
29th: Java Operation HBase One
30th: Java Operation HBase II
31st Lecture: Java Operation HBase Three
32nd: HBase and Hive Interface and project introduction
33rd: Order online Real-time query _schema design and hbase data loading
34th: Order online real-time query _dao layer implementation
35th: Order online real-time query _dao layer and foreground implementation
36th: E-Commerce Log Traffic Analysis _ Project Introduction
37th: E-Commerce Log Traffic Analysis _ Business implementation of a
38th: E-commerce Log Traffic Analysis _ Business implementation Two
39th: E-commerce Log Traffic Analysis _ Business implementation Three
40th: E-Commerce Log Traffic Analysis _ Business implementation Four

41st: CDH5 Build CM5 Installation Deployment

42nd: CDH5 Building and CM interface cluster management

Cloudera Hadoop 4 Combat Course (Hadoop 2.0, cluster interface management, e-commerce online query + log offline analysis)

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.