International - English

Cart Console

Topic Center

Contact Sales

Home > Internet > Big Data

Hadoop development process and supporting projects

Last Update:2014-12-22 Source: Internet

Author: User

Keywords Development process a section run Google

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

What is the development process of Hadoop?

Hadoop originally came from a Google programming model package called MapReduce. Google's MapReduce framework can decompose an application into many parallel computing instructions, running very large datasets across a large number of compute nodes. A typical example of using this framework is the search algorithm that runs on the network data.

Hadoop was initially associated with web indexing and rapidly developed into a leading platform for analyzing large data. Cloudera is an enterprise software company that began providing software and services based on Hadoop in 2008.

Gogrid is a cloud computing infrastructure company that, in 2012, worked with Cloudera to accelerate the adoption of a Hadoop based application. Dataguise, a data security firm, also launched a data protection and risk assessment for Hadoop in 2012.

Apache Hadoop Support Project

The Apache Software Foundation maintains several supporting projects for Hadoop:

· Apache Cassandra is a database management system designed for large data-volume scenarios. Its key features are fault tolerance, scalability, Hadoop integration, and replication support.

· HBase is a non relational, fault-tolerant, distributed database designed to store large amounts of sparse data.

· Hive is a data warehouse system designed for Hadoop that supports simple data aggregation.

· The Apache pig consists of high-level languages that create data analysis programs, as well as the basis for evaluating those applications.

· Apache Zookeeper is a centralized service for distributed applications. It maintains configuration information and provides naming registrations, distributed synchronization, and group services.

· Chukwa is a data collection system that can monitor large distributed systems, including a toolkit for analyzing results.

· The Apache Mahout project is designed to generate rich implementations in the Hadoop platform, an extensible machine learning algorithm.

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

In order to get "big data", Strategic investment Love Station... 04-30

The internet industry in China, we still keep the data very t... 04-29

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

Hot Article

Hot Tags

computing conference access forum computer class data get http html applications

Popular Keywords

html add blank space register business logo register ssl certificate full site sign in sign up node js build cloud register register a subdomain in python network management system tutorial how to learn computer science by myself

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Hadoop development process and supporting projects

Contact Us

Hot Article

Hot Tags

Popular Keywords

Recommend Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support