Chengdu Big Data Hadoop and Spark technology training course

Source: Internet
Author: User
Tags cassandra redis cluster sqoop


    • Chengdu Big Data Hadoop and Spark technology training course


China Information Training Center has launched the Big Data Technology architecture and application of practical training courses, through professional big data Hadoop and Spark technology architecture system and the industry real case to comprehensively improve the work of big Data engineer, development and design staff, Designed to nurture professional big data Hadoop and spark technology architecture experts to better serve the development and implementation of big data projects in various industries.



- recent Open Class arrangement: (Nationwide Tour classes)



August 21--Dalian, August 23



September 23--September 25 Beijing



October 16--Chengdu, October 18



November 27--November 29 Shenzhen



December 24--December 26 Guangzhou



January 27--Hangzhou, January 29



Admissions Target:



Application Development Engineer for big Data Hadoop and spark technology



Big Data analytics and mining engineer



Big Data cluster operation and maintenance engineer



IT managers for big data projects



Consulting staff for Big Data project planning



Enthusiasts interested in Hadoop and spark big Data technology



Plan to launch Big Data Project and enterprise information technology and management personnel in various industries with big data application demand



Have a certain Java and Linux Foundation is preferred.



Certificate of Training: "Senior architect of Big Data Hadoop development" issued by China Information Training Center .



Charging Standard: 5800 Yuan / people






Open Course Training outline: (internal training program can be customized)


Schedule

Training modules

Training Essentials

First day

Morning

First, the Big Data technology basic introduction

1. Background and history of big data generation

2. The relationship between big data and cloud computing

3. Big Data application requirements and potential value analysis

4. Industry's latest big data technology development trend and application trends

5. Technology selection and architecture design of Big Data project

6. Application and application of e-commerce, manufacturing, retail and wholesale industry, telecom operators, internet finance, Online Banking, e-government, mobile Internet, education and information industry in the era of "Internet +"

Second, the industry mainstream big data technology products and project solutions

7. Introduction to major data solutions at home and abroad

8. Comparison of current big data solutions with traditional database scenarios

Analysis of 9.Apache Big Data platform scheme

Analysis of 10.CDH Big Data platform scheme

Analysis of 11.HDP Big Data platform scheme

12. Analysis of open source Big data ecosystem platform

Third, Hadoop and spark big data processing platform

13.Hadoop development process and practical application of industry

14.Hadoop Big Data Platform architecture, and the working principle and mechanism of PB-based large-scale processing

Anatomy of the core components of 15.Hadoop

16.Spark development process and practical application in the industry

17.Spark Real-time large data processing platform architecture, and the principle and mechanism of large memory data processing

Anatomy of the core components of 18.Spark

First day

Afternoon

Four, large  and distributed message subscription system

19.flume-ng Data flow model, platform architecture, cluster deployment and Configuration application in the system

Application introduction, platform architecture, cluster deployment and Configuration application in 20.Kafka distributed message subscription system

21.Scribe Distributed Log Collection system introduction, working principle, platform architecture, cluster deployment and Configuration Application combat

22.ZooKeeper Distributed Coordination Service system working principle, platform architecture, cluster deployment and Configuration Application combat

Five, big data distributed storage System

23. Introduction to Distributed File System HDFs

Master-Slave platform architecture and working principle of 24.HDFS system

25.HDFS Core Technology Explained

26.HDFS Application Development Combat

installation, deployment, configuration, and performance optimization techniques for 27.HDFS clusters

28. Distributed key-value Storage System introduction, platform architecture, core technology and application development

Project case analysis of 29.PB and big data storage systems

VI, Big Data mapreduce and yarn parallel processing platform

30.MapReduce Parallel Computing Model

31.MapReduce job execution and scheduling technology

32. How the second-generation Big Data computing framework yarn works and the Dag parallel execution mechanism

33.MapReduce deployment of application development environments and development of big data parallel processing applications

34.MapReduce advanced programming techniques and performance optimization practices

35.MapReduce and Yarn Project case Practice

Next day

Morning

Seven, big data spark real-time processing platform

36. Memory computing model and real-time processing technology Introduction

37.Spark distributed real-time processing framework and working principle

The platform architecture of 38.Spark cluster and analysis of its ecosystem components

39.Spark SQL Application Practice

40.Spark Streaming Application Practice

41.mlib/mlbase Real-time Machine learning application Practice

Application practice of 42.GraphX real-time graph data processing

Installation deployment and configuration optimization for 43.Spark real-time processing cluster

44.Spark programming development and application of the actual combat

45.Spark and Hadoop Docking Integration solution Practice

The storm flow data processing platform

46.Storm Streaming system introduction, platform architecture and how it works

47.Storm cluster installation deployment and configuration optimization

48.Storm Log Analysis Project Application combat

Next day

Afternoon

Nine, HBase distributed database management system

Introduction of 49.NoSQL Database and Newsql database technology and its application in semi-structured and unstructured big data

Introduction to 50.HBase Distributed database, data model, and how it works

Analysis of platform architecture and key technologies of 51.HBase distributed database cluster

52.HBase Application project development skills, and client development

53.HBase table design and data manipulation and database management API calls

Installation deployment and configuration optimization for 54.HBase clusters

Operation and maintenance of 55.HBase cluster and monitoring management

X. Cassandra Data Management System

Application introduction of 56.CASSANDRA data storage Management System

57.Cassandra cluster platform architecture and core Key technologies

58.Cassandra consistent hashing algorithm and data object distribution strategy

Installation deployment and configuration optimization for 59.Cassandra clusters

60.Cassandra Application Development Combat

Third Day

Morning

XI. Memory Database management system cluster

Application introduction of 61.Impala Real-time query system

62.Impala real-time query system platform architecture, core key technology analysis

Deployment and application development practice of 63.Impala real-time query system

64.Redis Memory Database Introduction, and Industry application case

65.Redis Memory Database cluster architecture and core technology analysis

66.Redis cluster installation deployment and application development combat

12. Large Data Warehouse hive cluster platform

67. Hadoop-based large distributed data Warehouse fundamentals and application practices in the industry

68. Spark-based real-time Data Warehouse cluster basics, and application practices in the industry

Introduction to 69.Hive Big Data Warehouse and application introduction

Analysis of platform architecture and core technology of 70.Hive Data Warehouse cluster

71.Hive Server working principle and application skills

installation, deployment and configuration optimization of 72.Hive Data Warehouse cluster

73.Hive Application Development Tips

74.Hive QL Definition and application

75.Hive Data Warehouse tables and table partitioning, table operations, data import and export, client manipulation tips

76.Hive Data Warehouse report design, HWI, CLI client demonstrations, and development practices for user-defined functions (UDFs)

Third Day

Afternoon

13, Mahout Big Data analysis mining platform

77. Big Data Analysis mining technology introduction, and industry Big Data Mining application case

Architecture, core algorithm and key technology application of 78.Mahout Big Data mining platform

79. Mahout-based data mining application development combat

Installation deployment and configuration optimization for 80.Mahout clusters

81. Integrated Mahout and Hadoop integrated Big Data Mining platform application combat

14, Big Data Intelligent ETL operation and Hadoop cluster operation and maintenance monitoring tool platform Application

Framework for data conversion between 82.Hadoop and DBMS

How 83.Sqoop import and export data works, as well as sqoop cluster installation deployment and configuration

84.Kettle cluster platform architecture, core technology working principle and application case

85.Kettle cluster installation deployment and configuration, and application development combat

86. Using Sqoop to implement data import and export interactions between MySQL and Hadoop clusters

87.Hadoop Big Data operation and maintenance monitoring System installation deployment and configuration optimization of Hue platform

Application of Big Data project

88. Implementation of big Data complete project deployment design and application development practices according to practical application cases


Chengdu Big Data Hadoop and Spark technology training course


Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.