TEZ MRR optimize to MR?

Source: Internet
Author: User

https://issues.apache.org/jira/browse/HIVE-2340

Select Userid,count (*) from the U_data group by UserID ORDER by UserID would product MRR.

I think when the result of Userid,count (*) is small (one reduce can process the result). This query plan can optimize to MR?


To prevent-reducer merging, the reducer merging only kicks

Optimizer thinks it gets a perf boost.

MR-MRR is not a big win when it comes Tez, due to Container-reuse-

Going wide on the large cardinality in case of missing map-side

Aggregation'll be safer.

If Hive.map.aggr=true and the UserID set fits within memory, then smushing

The reducers would be nicer.

To reset the wide-narrow checks, do

Set hive.optimize.reducededuplication.min.reducer=1;

But being aware that it'll fail (I1ve seen full disks) as a scale upwards

to the + + Tb cases.

Cheers,

Gopal

Hive.optimize.reducededuplication.min.reducer
    • Default Value:4

    • Added in:hive 0.11.0 with HIVE-2340

Reduce deduplication Merges-RSs (reduce sink operators) by moving Key/parts/reducer-num of the child RS to PA Rent RS. That means if reducer-num of the child RS was fixed (order by or forced bucketing) and small, it can make very slow, single MR the optimization would be disabled if number of reducers are less than specified value.


TEZ MRR optimize to MR?

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.