The internet has made great progress today. Today, a huge amount of data is uploaded to the Internet. As the size of the data grows, a large number of business scenarios begin to consider the level of data storage expansion, allowing storage services to be added and deleted, while current relational databases are more focused on a single machine. The massive data storage becomes the bottleneck, the single machine cannot load the massive data. HBase is the Apache top open source project separated from Hadoop. Because of its good Java implementation of most of Google's bigtable system features, so in the volume of data explosion today is very popular. How to load the massive data into the HBase is the first step of using HBase, in this paper, we carry on the research and performance comparison of several different loading data methods of hbase, and realize the custom parallel loading data method, the experiment shows that it has good efficiency.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.