KeywordsDfsdfs excellent DFS excellent high-performance DFS excellent high-performance because DFS excellent high-performance because through
HDFS (Hadoop distributed http://www.aliyun.com/zixun/aggregation/19352.html ">file System") is the core subproject of the Hadoop project, Is the basis of data storage management in distributed computing, frankly speaking HDFs is a good distributed file system, it has many advantages, but there are some disadvantages, including: not suitable for low latency data access, not efficient storage of large number of small files, do not support multi-user writing and arbitrary modification of files.
When the Apache Software Foundation was established, HDFs was looking for ways to improve its performance and usability, and frankly, it might be more appropriate for pilot projects, unconventional projects, and less demanding environments, but for some Hadoop users, they are for performance, usability, With the high requirements of enterprise-class features and a focus on direct attached storage (DAS) architecture, especially when older versions of Hadoop do not have high-performance master nodes, the next 8 products are the perfect alternative to HDFs.
1. Cassandra (DataStax)
Not a full file system, but an open source, NoSQL key value (Key-value) store. This gives a hdfs choice of Web applications that rely on fast data access. In short, it blends Hadoop into the Cassandra, enabling Web applications to quickly access data through Hadoop, and Hadoop can quickly access data flowing into Cassandra.
2. Ceph
Ceph is an open source, multi-pronged operating system, because of its high-performance parallel file system features, some even think it is based on Hadoop Environment HDFs successor, because since 2010 researchers have been looking for this feature.
3. Cleversafe: Decentralized storage network
This week Cleversafe announced the integration of Hadoop's parallel programming technology and its own decentralized storage network. The principle is that by distributing the entire metadata in a cluster (not relying on a single master node, not relying on replication), Cleversafe says it is faster, more stable, and more scalable than HDFs.
4. GPFS (IBM)
IBM has been selling its parallel file systems to high-performance users, including the world's fastest supercomputer, 2010 years after it launched the GPFS based on Hadoop, and announced that GPFS does not share the cluster version much faster than Hadoop because it runs at the kernel level, Rather than running in an operating system such as HDFs.
5. Isilon (EMC)
EMC has been delivering the Hadoop release for a year, but in January 2012 it was transformed into the Onefs file system of the new HDFS enterprise-level solution--isilon. Because Isilon can read NFS, CIFS, and HDFS protocols, a separate Isilon NAS system can ingest, process, and analyze data.
6. Lustre
The HPC storage provider, Xyratex, wrote in a 2011 report that lustre clusters are faster and cheaper than HDFS based clusters.
7. mapr File System
MAPR file system has a certain reputation in the industry, not only MAPR announced its own file system faster than HDFs 2-5 times (actually 20 times times), it also has mirrors, snapshots, high-performance these corporate users like features.
8. NetApp Hadoop Open Solution
NetApp has revamped the physical Hadoop architecture by placing HDFs in the disk array, which enables faster, more stable, and more secure Hadoop work.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.