Spark Rdd Flex 7 Point

Source: Internet
Author: User
Tags spark rdd

1. Automatic seamless switching between disk data and memory

2, based on lineage of high-efficient fault tolerance, nth error, will be executed from the beginning of n-1

3. Task failure will be retried a specific number of times

4, the stage failure will automatically make a specific number of retries, and only run the calculation of the failed data shards

5, checkpoint (similar to the archive in a single game) and presist, persistent cache

6, data scheduling flexibility, DAG task is not related to resource management

7, data fragmentation of the high elasticity, repartition,1w a large, into a small 10W, 10W small to become 1W large.

Spark Rdd Flex 7 Point

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.