This two-year spark technology is very fire, oneself also in the fun, repeated experiments, research, there is a lot of pain and ecstatic, take the time to organize into articles to share to everyone. This series is basically about the spark ecosystem, from introduction, compilation, deployment, to programming model, running architecture, and finally introducing its components sparksql, spark streaming, spark Mlib, and Spark Graphx. The content of the article is generally the first introduction of the principle, followed by practical examples, due to the introduction of the reader, in combat more, please understand. For everyone to experiment convenient, here the experiment related test data and installation package on the Baidu disk to provide download.
The following is a list of articles for this series:
1, Spark Introduction download
2.Spark compilation and Deployment (top)-Basic Environment Setup Download
2.Spark compilation and Deployment (Medium)--hadoop compilation installation Download
2.Spark compilation and Deployment (bottom)--spark compile and install download
3.Spark programming Model (above)--concept and Sparkshell actual combat download
3.Spark programming model (bottom)--idea Construction and practical download
4.Spark Run schema download
5.Hive (UP)--hive Introduction and Deployment Download
5.Hive (next)--hive actual download
6.SparkSQL (a)--sparksql introduction download
6.SparkSQL (ii)--in-depth understanding of operational plans and tuning downloads
6.SparkSQL (three)--spark practical application Download
7.Spark Streaming (one)--Principle Download
7.Spark Streaming (b)--Practice Download
8.Spark MLlib Download
9.Spark GraphX Download
Big Gift--spark Introduction Combat series