International - English

Cart Console

Topic Center

Contact Sales

Home > Others

Understanding of compaction in hbase and the use of regionserver memory, cacheblock mechanism

Last Update:2016-08-04 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

There are two types of compaction:

(1) Minor compaction: lightweight. Rewriting multiple small storefile files to a smaller number of large storefile files, reducing the number of files stored, is actually a multi-way merge process. It does not delete data that is marked as "deleted" and previously expired data, and there are multiple StoreFile files after the minor merge operation has been performed once. Because each file in the hfile is categorized, the merge is fast and is only affected by disk I/O performance.

(2) Major compaction: it belongs to the weight class. A region, a list of a series of StoreFile rewrite as a storefile, it can scan all <key,value> pairs, sequentially rewrite all the data, the process of rewriting the data, will skip the deletion of the marked data, assert that the deletion takes effect at this time , blocking all client requests for the region to which the operation belongs until the merge is complete, and the merged storefile file is finally deleted

Regionserver memory, when set, in general this configuration:

(1) Memstore, approximately 40% of memory space (mainly used for writing):

Write requests are written first Memstore,regionserver will provide a memstore for each region, and Memstore will start flush flush to disk after the write is full. When the total size of the memstore exceeds the limit, the flush process is forced to start and flush is known to be below the limit from the largest memstore.

(2) Blockcache, approximately 40% of the memory space (mainly for reading):

Read the request to Memstore first to check the data, can not be found in the Blockcache, and then found on the disk to read, and read the results into Blockcache. Blockcache uses the LRU algorithm, when the Blockcache reached the upper limit, the elimination of the most recent unused batch of data eliminated, each regionserver only one Blockcache

(3) Other, about 20% of the memory space.

In the context of application scenarios that focus on read response time, you can set the Blockcache to a larger, memstore set smaller to increase the cache hit ratio.

Blockcache Grading Idea:

(1) First through the InMemory type cache, you can selectively place the InMemory column famlies into Regionserver memory, such as meta metadata information;

(2) by distinguishing between single and multi types of caches, you can prevent frequent bumps due to the scan operation and add the least used block to the elimination algorithm.

The default configuration. For the entire Blockcache memory, use the following percentages for single, Multi, InMemory: 0.25,0.50 and 0.25

Understanding of compaction in hbase and the use of regionserver memory, cacheblock mechanism

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

What's Trending

Top 10 Tags

datastax versions naming convention zookeeper client class definition md5 microsoft sql server 2005 data structures exception handling error handling

Top 10 Keywords

microsoft download center down wordpress address url site address url wordpress address url windows installer 4 0 download 302 not found web address url definition site address url wordpress db2 integer mac os installation step by step pdf abbreviation for return

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Understanding of compaction in hbase and the use of regionserver memory, cacheblock mechanism

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support