Big Data Spark Enterprise Project combat (stream data processing applications for real-sparksql and Kafka) download

Source: Internet
Author: User
Tags scala ide

Link: http://pan.baidu.com/s/1dFqbD4l Password: treq

1. Curriculum development Environment
Project source code is based on spark1.5.2,jdk8,scala2.10.5.
Development tools: SCALA IDE eclipse;
Other tools: Shell scripts
2. Introduction to the Content
This tutorial starts with the most basic spark introduction, introduces the various deployment modes of spark and hands-on building, and then gradually introduces the calculation model of the RDD, the creation and common operations, and some of the distributed computing, RDD persistence, fault tolerance, shuffle mechanism, shared variables and other content.
Then, on the basis of the RDD, we explain Sparksql's sub-framework, introduce dataframe, use scenes, create methods, support for file formats such as parquet and different types of data sources, compatibility and integration of hive, and support for the JDBC of traditional databases. And the deployment of Thriftserver. Then cooperate with some practical experiments to deepen the understanding and application of dataframe.
Then, explain Sparkstreaming's sub-framework, introduce the concept of dstream, usage scenarios, data source, operation, fault tolerance, performance tuning, and integration with Kafka.
Finally, 2 projects to bring learners to the development environment to do hands-on development, debugging, some based on the sparksql,sparkstreaming,kafka of practical projects, to deepen your understanding of spark application development. It simplifies the actual business logic in the enterprise, strengthens the analysis and the inspiration of the error debugging, makes the learner easier to master Spark's development skill.

Big Data Spark Enterprise Project combat (stream data processing applications for real-sparksql and Kafka) download

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.