Tips for using Hbase Scan in MR

Last Update:2018-06-11 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

In Hadoop's MR operation, Hbase can be used as the input data source for calculation. The following describes how to use Hbase as the HTable iterator Scan: publicvoidsetBatch (intbatch) publicvoidsetCaching (intcaching) publicvoidsetCacheBlocks (booleancacheBlocks) publicvoidsetB

In Hadoop's MR operation, Hbase can be used as the input data source for calculation. The following are some tips for using Hbase as the HTable iterator Scan: public void setBatch (int batch) public void setCaching (int caching) public void setCacheBlocks (boolean cacheBlocks) public void setB

In Hadoop's MR operation, Hbase can be used as the input data source for calculation. As an HTable iterator, Scan has several usage skills.

The method involved is as follows:

public void setBatch(int batch)public void setCaching(int caching)public void setCacheBlocks(boolean cacheBlocks)

Public void setBatch (int batch ):

To set the number of columns to retrieve records, the default value is unlimited, that is, all columns are returned.

Public void setCaching (int caching ):

The number of lines read from the server each time. The default value is set in the configuration file.

Public void setCacheBlocks (boolean cacheBlocks ):

This parameter indicates whether a block is cached. The default cache is used. Three methods are available: memory, cache, and disk. Generally, data is read from memory-> cache-> disk. When MR is used, data is non-hotspot, therefore, no cache is required.

Therefore, it is best to set MR as follows:

Scan. setCacheBlocks (false); scan. setCaching (200); // memory usage is high, but rpc does not scan. setBatch (6); // The column you need

Existing 0People comment, slam-> Here<-Participate in the discussion

ITeye recommendation

-Software talents free of language and low guarantee paid study in the United States! -

Original article address: Tips for using Hbase Scan in MR. Thank you for sharing it with me.

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Tips for using Hbase Scan in MR

Contact Us

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support