Hadoop crisis? 8 Excellent alternatives to HDFs

Source: Internet
Author: User
Keywords Dfs dfs excellent DFS excellent high-performance DFS excellent high-performance because DFS excellent high-performance because through
HDFS (Hadoop distributed http://www.aliyun.com/zixun/aggregation/19352.html ">file System") is the core subproject of the Hadoop project, Is the basis of data storage management in distributed computing, frankly speaking HDFs is a good distributed file system, it has many advantages, but there are some disadvantages, including: not suitable for low latency data access, not efficient storage of large number of small files, do not support multi-user writing and arbitrary modification of files.

When the Apache Software Foundation was established, HDFs was looking for ways to improve its performance and usability, and frankly, it might be more appropriate for pilot projects, unconventional projects, and less demanding environments, but for some Hadoop users, they are for performance, usability, With the high requirements of enterprise-class features and a focus on direct attached storage (DAS) architecture, especially when older versions of Hadoop do not have high-performance master nodes, the next 8 products are the perfect alternative to HDFs.

1. Cassandra (DataStax)

Not a full file system, but an open source, NoSQL key value (Key-value) store. This gives a hdfs choice of Web applications that rely on fast data access. In short, it blends Hadoop into the Cassandra, enabling Web applications to quickly access data through Hadoop, and Hadoop can quickly access data flowing into Cassandra.

2. Ceph

Ceph is an open source, multi-pronged operating system, because of its high-performance parallel file system features, some even think it is based on Hadoop Environment HDFs successor, because since 2010 researchers have been looking for this feature.

3. Cleversafe: Decentralized storage network

This week Cleversafe announced the integration of Hadoop's parallel programming technology and its own decentralized storage network. The principle is that by distributing the entire metadata in a cluster (not relying on a single master node, not relying on replication), Cleversafe says it is faster, more stable, and more scalable than HDFs.

4. GPFS (IBM)

IBM has been selling its parallel file systems to high-performance users, including the world's fastest supercomputer, 2010 years after it launched the GPFS based on Hadoop, and announced that GPFS does not share the cluster version much faster than Hadoop because it runs at the kernel level, Rather than running in an operating system such as HDFs.

5. Isilon (EMC)

EMC has been delivering the Hadoop release for a year, but in January 2012 it was transformed into the Onefs file system of the new HDFS enterprise-level solution--isilon. Because Isilon can read NFS, CIFS, and HDFS protocols, a separate Isilon NAS system can ingest, process, and analyze data.

6. Lustre

The HPC storage provider, Xyratex, wrote in a 2011 report that lustre clusters are faster and cheaper than HDFS based clusters.

7. mapr File System

MAPR file system has a certain reputation in the industry, not only MAPR announced its own file system faster than HDFs 2-5 times (actually 20 times times), it also has mirrors, snapshots, high-performance these corporate users like features.

8. NetApp Hadoop Open Solution

NetApp has revamped the physical Hadoop architecture by placing HDFs in the disk array, which enables faster, more stable, and more secure Hadoop work.

(Responsible editor: The good of the Legacy)

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.