HBase Summary (vi) hbase37 a pen question

Last Update:2016-04-13 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Transferred from: http://blog.csdn.net/lifuxiangcaohui/article/details/39961643

The following questions are excerpted from the Internet on the basis of their own added option explanation to make it easy for you to see later to save time 1. Which blog post does hbase originate from? CA the Google File systemb mapreducec bigtabled Chubby

2. What is the correct description of HBase? b, C, DA is not open source B is column-oriented C is distributed D is a NoSQL database

3. HBase relies on () storing the underlying data AA HDFSB HADOOPC memoryd MapReduce

4. HBase dependency () provides message communication mechanism AA Zookeeperb CHUBBYC rpcd Socket

5. HBase Dependency () provides powerful computing power DA zookeeperb CHUBBYC rpcd MapReduce

6. What is the correct description of the relationship between MapReduce and HBase? B, CA both indispensable, MapReduce is hbase can be normal operation of the guarantee B is not a strong association relationship, no mapreduce,hbase can run normally C MapReduce can directly access hbased there is no relationship between them

7. Which of the following options correctly describes the properties of HBase? A, B, C, DA high reliability B performance C-oriented column D scalable

8. What are the following frameworks similar to zookeeper? DA protobufb JavaC Kafka (Kafka is a high-throughput distributed messaging system. LinkedIn Open source Kafka. Kafka is like this name, the design is very unique. First of all, Kafka developers think that there is no need to cache what data in memory, the operating system's file cache is well enough and powerful, as long as you do not write random, sequential read and write performance is very efficient. Kafka data is only sequential append, the deletion strategy of the data is accumulated to a certain extent or more than a certain amount of time to delete. Kafka another unique place is to store consumer information on the client instead of the MQ server, so that the server does not have to log the message delivery process, each client knows where they should be the next time to read the message, the message delivery process is also the use of the client active pull model, This greatly reduces the burden on the server. Kafka also emphasizes the reduction of serialization and copy overhead of data, which organizes messages into message set for bulk storage and delivery, and the client, when pulling data, as much as possible in a zero-copy way, using Sendfile (corresponding to Java Filechannel.transferto/transferfrom) such as advanced IO functions to reduce copy overhead. As can be seen, Kafka is a well-designed MQ system that is specific to certain applications, which I estimate will be more and more, perpendicular to the consideration of product strategy values) D Chubby (MapReduce Many people already know, but about Chubyy seems familiar with it is very limited , which is not surprising, because MapReduce is a developer-oriented Programmingmodel, and there will naturally be a lot of people to learn about it, and chubby is more of an internal tool built to implement MapReduce or BigTable, is basically transparent to developers. Chubby first is a distributed file system. Chubby can provide mechanisms that allow clients to create files on the chubby service and perform basic operations on some files. Says it is a distributed file system because a chubby cell is a distributed system that typically contains 5 machines, and the entire file system is deployed on these 5 machines.
However, from a higher semantic level, chubby is a lock service, a lock service for loosely coupled distributed systems. The so-called lock service is the service that provides the "lock", "unlock" functionality that developers often use. With chubby, thousands of clients in a distributed system are able to "lock" and "unlock" a resource.
So, how does chubby implement such a "lock" function? is through the file. Chubby in the "lock" is the file, in the above example, the creation of a file is actually a "lock" operation, the creation of a successful file of the server is actually preemption to the "lock." The user acquires a shared or exclusive lock by opening, closing, and reading the file, and sends the update information to the user through a communication mechanism.

In summary, Chubby is a lock service that solves the problem of consistency in the distribution through this lock service, and the implementation of this lock service is a distributed file system. ）

9. What is the following framework similar to HDFS? CA NTFSB fat32c GFS (also Distributed File system, Google's own Distributed File system) D EXT3

10. Which of the following concepts are used in the HBase framework? A, CA hdfsb GRIDFSC zookeeperd EXT3

Part Two: HBase Core knowledge points (the core LSM for HBase implementation will be described in the next article)

What does LSM mean? AA Log Structure Merge tree (log-structured merge trees) B Two fork tree C balanced binary tree D long balanced binary tree

12. The next to the LSM structure is described correctly? A, CA sequential storage b Direct write hard drive C need to flush data to disk D is a search balance tree

What is the performance of the LSM more guaranteed? BA read B write c random read D merge

Are the read and write operations of the LSM Independent? AA is. B No. C LSM does not differentiate between read and write D LSM in the same operation

The data for the LSM structure is first stored in (). BA hard disk on B in memory C disk array in D Flash

The data field in the hfile format is used for (). AA stores the actual keyvalue data b stores data from the beginning C specifies the length of the field D stores the starting point of the data block

The Metaindex field in the hfile data format is used for (). DA meta block length b meta block end point C meta block data content D Meta block start point

The Magic field in the hfile data format is used for (). AA stores random numbers to prevent data corruption B store data starting point C store data block Start D Specify the length of the field

The KeyValue data format in the hfile data format, the following options are described correctly (). A, DA is byte[] Array B does not have a fixed structure C data size is fixed length d has a stationary structure

The value portion of the KeyValue data format in the hfile data format is (). CA has a complex structure of string B string c binary data D compressed data

Part III: Introduction to HBase Advanced applications

The bulk load in HBase is implemented using the () layer. AA mapreduceb Hivec Coprocessord Bloom Filter

What are the following options for HBase performance optimization? A, B, C, DA read optimized B write optimization C configuration optimization D JVM optimization

Rowkey design principles, which of the following options are described correctly? A, B, CA try to ensure that the shorter the better B can use Chinese characters C can use the string D itself is unordered

What is the implementation of the two-level index constructed by HBase? A, BA mapreduceb coprocessor (HBase introduced a coprocessor (coprocessors) after 0.92 to achieve some exciting new features: the ability to easily build two indexes, complex filters (predicate push), access control, etc.) C Bloom Filterd Filter

35. What is the correct description of the HBase two-level index? A, BA core is inverted table B Two-level index concept is the corresponding Rowkey this "level" Index c two index using the LSM structure with balanced binary tree D two level index

36. The following description of the Bloom filter is correct? A, CA is a very long binary vector and a series of random mapping function B No false rate C has a certain error rate D can delete elements in bloom filter

Part IV: HBase installation, deployment, startup

What operating system can be installed on the official version of the. HBase? A, B, CA centosb UBUNTUC redhatd Windows

A. HBase virtual distributed mode requires () nodes? AA 1 B 2C 3D minimum of 3

Four. HBase distributed mode best needs () nodes? CA 1 B 2C 3D minimum

40. Which of the following options are required before installing HBase? A, BA operating system B JDKC Shell scriptd Java Code

41. Unzip the. tar.gz end of the HBase compression package using the Linux command is? AA tar-zxvfb tar-zxc tar-sd tar-nf

HBase Summary (vi) hbase37 a pen question

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

HBase Summary (vi) hbase37 a pen question

Contact Us

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support