International - English

Cart Console

Topic Center

Contact Sales

Home > Internet > Online Trends

What is the difference between a Hadoop distributed file system and a OpenStack object store?

Last Update:2015-03-17 Source: Internet

Author: User

Keywords DFS Distributed File system what's the difference

Tags data design difference differences different distributed distributed file system environment

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Recently, a question was raised on Quora about the differences between Hadoop Distributed file systems and OpenStack object storage.

The original question is as follows:

"HDFS (Hadoop Distributed File System) and OpenStack object Storage (OpenStack object Storage) all seem to have similar purposes: to achieve redundancy, fast, networked storage. What are the technical features that make the two systems so different? Is it significant that these two storage systems eventually converge? "

After the problem was raised, a OpenStack developer soon responded. This article extracts the first two replies for translation, for your reference.

The first answer comes from Rackspace's OpenStack Swift developer Chuck misspelling:

Although there are some similarities between HDFs and OpenStack object Storage (Swift), the overall design of the two systems is very different.

1. HDFs uses a central system to maintain file metadata (Namenode, name nodes), while in swift metadata is distributed and replicated across clusters. The use of a central metadata system is no different from a single point of failure for HDFs, so it is more difficult to scale to a very large environment.

2. Swift has taken into account the multi-tenant architecture in its design, and HDFs does not have the concept of a multi-tenant architecture.

3. HDFs is optimized for larger files (which is usually the case when data is processed), Swift is designed to store files of any size.

4. In HDFs, the file is written once and only one file is written at a time, while in swift the file can be written multiple times, and in the concurrent operating environment, the most recent operation.

5. HDFs is written in Java, and Swift is written in Python.

In addition, HDFs was designed to store a large number of medium sized files to support data processing, and Swift was designed to be a more generic storage solution that reliably stores a very large number of files of varying sizes.

The second answer comes from Joshua McKenty, the chief architect of NASA's Nebula Cloud Computing project, and one of the early developers of OpenStack Nova Software, currently a member of the OpenStack Project Supervision Committee, piston.cc, founder of the OpenStack based company.

Chuck gave a detailed description of the technical differences, but did not discuss the two imaginable fusion, OpenStack Design summit threw the topic of integration. In short, HDFs is designed to use Hadoop to implement MapReduce processing across objects within a storage environment. For many OpenStack companies, including my own, supporting Swift's processing is a goal on the road map, but not everyone thinks MapReduce is the solution.

We've discussed writing wrappers for HDFs, which will support the OpenStack internal Storage Application Programming interface (API) and allow users to perform Hadoop queries against that data. Another option is to use HDFs in Swift. But none of these methods seems ideal.

The OpenStack community is also doing some work on research and development, carefully studying other alternative mapreduce frameworks (Riak and COUCHDB, etc.).

Finally, there are other storage items that are currently "affiliated" to the OpenStack community (Sheepdog and HC2). Taking full advantage of data locality and making object storage "smarter" is an area that is expected to make progress.

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

what is the difference between a domain and a subdomain what is the difference between a module and a function what is the difference between a host and a server what is the difference between an android and a robot what is the difference between a smartphone and an android what is the difference between linux and windows operating system what is the difference between windows and linux operating system

Front-end Must Learn: CDN Acceleration Principle 12-02

Cloud Security Issues Derived from the Development of Cloud C... 11-26

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

Hot Article

Hot Tags

computing conference access forum computer class data get http html applications

Popular Keywords

direct digital landing development documentation data user director of marketing deploy it ddos how to description of products and services ddos information data website domain to dns

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

What is the difference between a Hadoop distributed file system and a OpenStack object store?

Contact Us

Hot Article

Hot Tags

Popular Keywords

Recommend Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support