Hadoop is a software framework that enables distributed processing of large amounts of data. Hadoop handles data in a reliable, efficient, and scalable way.
The Hadoop distribution provides its own commercial version in addition to Apache hadoop, cloudera, hortonworks, mapR, Huawei, and DShadoop.
The commercial distribution mainly provides more professional technical support, which is more important for large enterprises. Different distributions have their own characteristics. This article makes a simple comparison of each release.
Comparison version selection: DKhadoop distribution, cloudera distribution, hortonworks distribution.
1, DKhadoop distribution: effectively integrated all the components of the entire HADOOP ecosystem, and deep optimization, recompiled into a complete higher performance general data platform for big data, to achieve the organic coordination of various components. Therefore, compared to the open source big data platform, DKH has up to 5 times (maximum) performance improvement in computing performance. DKhadoop simplifies the complex big data cluster configuration to three types of nodes (master node, management node, and compute node), which greatly simplifies the management and operation of the cluster and enhances the high availability, maintainability, and stability of the cluster.
2, Cloudera distribution: CDH is Cloudera's hadoop distribution, completely open source, more compatible than Apache hadoop, security, stability.
3, Hortonworks distribution: Hortonworks's flagship product is Hortonworks Data Platform (HDP), which is also 100% open source product. Its version features: HDP includes all the key components of the stable version of Apache Hadoop; easy to install, HDP includes a Modern, intuitive user interface installation and configuration tools.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.