2016 year-end summary

Source: Internet
Author: User
Tags shuffle spark rdd

1.spark Core:spark RDD Core summary; Spark operator selection strategy; Spark core job scheduling and task scheduling; Spark parameter tuning; Spark Operational Architecture Core Summary; Spark Shuffle principle, shuffle operational problem solving and parameter tuning

2.spark SQL or SQL: This has not been the opportunity to go deep, just stay in the basic understanding stage;

3.spark ml or mllib: Learning is relatively loose, basic understanding, but with less time, only in the building of user portrait of the use of some common classification model

Plan in-depth: Spark implements Item-base cf,xgboost support for Spark; SPARK-KNN

4.python: "Machine learning Combat" code part, part Leetcode code; Contact Numpy,matplotlib,pandas,scikit-learn

5. Machine learning: "Statistical learning Method" "machine learning Combat"; Machine Learning Summary

Plan in-depth: "Statistical learning basic data Mining inference prediction", "data mining concept and technology"

2016 year-end summary

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.