Misunderstanding: hadoop = cloud computing
Author: chszs, reprinted with note.
Author blog homepage: http://blog.csdn.net/chszs
Time: 2012-12-9
Recently, I saw a new book published by Tsinghua University Press. Its publication date is, as shown in 1. I think this is a science-oriented book that deepens users' understanding of cloud computing. After all, cloud computing covers a wide range of fields.
This is not the case after extensive reading. Basically, the book focuses on tools and technologies in hadoop and hadoop ecosystems. 2:
I think that some people in the industry have had technical exchanges some time ago. When talking about cloud computing, what about hadoop? How about mapreduce? In my opinion, there is a misunderstanding.
I would like to emphasize that hadoop is not cloud computing
Currently, cloud computing is not widely defined, and NIST (National Institute of Standards and Technology) is widely recognized for its definition of cloud computing. NIST defines cloud computing in this way.
Cloud computing is a pay-as-you-go mode. This mode provides available, convenient, and on-demand network access and uses configurable computing resource sharing pools, resources in the Shared Pool include networks, servers, storage, application software, and services. These resources can be quickly provided to customers through the cloud computing platform, with little investment in management, or very little interaction with service providers.
Cloud computing is the product of the integrated development of distributed computing, parallel computing, grid computing, multi-core computing, network storage, virtualization, Server Load balancer, and other traditional computer and Internet technologies. Provides powerful computing capabilities to end users using IAAs, paas, SAAS, and other business models.
Cloud computing is called the fourth IT Industry Revolution following large-scale computers, personal computers, and the Internet, it will become the basis for driving the strong growth of it, Iot, e-commerce and many other industries and promoting the overall upgrade of the information industry.
Hadoop is an open-source software under the Apache Foundation. It implements the basic architecture of cloud computing software platforms, including distributed file systems and mapreduce frameworks, it also integrates a series of platforms, including databases, cloud computing management, and data warehousing. It has become a standard platform for cloud computing applications and research in the industry and academia.
If hadoop is strictly defined, hadoop is an open-source distributed computing platform for big data processing and analysis. It is an open-source Java-based software architecture that runs distributed and data-intensive applications.
Therefore, hadoop is only one of the PAAs layer solutions for cloud computing. It is not equivalent to paas, not to mention cloud computing. Figure 3 clearly divides the delivery mode layers of cloud computing, as shown below:
Therefore, it is wrong to make a conceptual mistake to equate hadoop with cloud computing, or to compare hadoop with cloud computing. If you change the title to "in-depth big data processing" or "in-depth massive book processing", it is more appropriate.