Several parameters of Hive about merge

Source: Internet
Author: User
Hive. merge. mapfiles (defaulttrue): this parameter is used to control whether the file output of the mergeMAP-ONLY Job is used (if DP is supported in Hive, and

Hive. merge. mapfiles (defaulttrue): this parameter is used to control whether the file output of the merge MAP-ONLY Job is used (if DP is supported in Hive, and

Analyze the three obfuscated parameters of Hive about merge:

Hive. mergejob. maponly

Hive. merge. mapfiles

Hive. merge. mapredfiles

The three parameters are the method in which merge outputs the file, and the method in which merge. Does it sound like J?

Hive. merge. mapfiles (defaulttrue): this parameter is used to control whether the file output of the merge MAP-ONLY Job is used (if DP is supported in Hive and the table containing the DP column is involved, if Hadoop does not support CombineInputFormat, that is, version pre-0.20, then this parameter and the next parameter are invalid, that is, both false)

Hive. merge. mapredfiles (defaultfalse): this parameter is used to control the file output of the merge MAP-REDUCE Job.

Hive. mergejob. maponly (defaulttrue): this parameter is used to control merge in the form of MAP-ONLY (a prerequisite is that Hadoop supports CombineInputFormat ,, even if this parameter is set to true before 0.20, it will not take effect) or perform merge in the form of MAP-REDUCE. but there is an unfixed Bug about this parameter (https://issues.apache.org/jira/browse/HIVE-2869) So the default value is true

Hive details: click here
Hive: click here

Related reading:

Hadoop cluster-based Hive Installation

Differences between Hive internal tables and external tables

Hadoop + Hive + Map + reduce cluster installation and deployment

Install in Hive local standalone Mode

WordCount word statistics for Hive Learning

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.