A preliminary understanding of Hadoop

Source: Internet
Author: User
Keywords nbsp Understanding Preliminary name point on
A preliminary understanding of Hadoop Blog Category: Cloud computing Hadoopmapreducegoogle Framework &http://www.aliyun.com/zixun/aggregation/37954.html ">nbsp;

What the hell is 1:hadoop?

He is a solution, a distributed processing solution that can handle large amounts of data, and is a copycat derivative of Google.

It is the use of Google published MapReduce paper writing into models and frameworks. He mainly divides large tasks into smaller tasks, and gives these small tasks to a single point of execution on the cluster.

What is job, in MapReduce, an application that prepares to commit execution is called a job (job, like a project), and the job, too large, is split into N, executed on each node of the computer, which is called a task.

The Distributed File System (HDFS) provided by Hadoop is mainly to handle storage on each node and achieve high throughput data compilation.

Simply put, it's a resource's storage, and a resource's lookup.

Hadoop uses a master/Master/slave architecture for distributed storage and distributed computing. There is a series of backstage (Deamon) programs. Different background programs play different roles, these roles: Namenode Secondarynamenode,jobtracker,tasktracker,datanode, these names, as long as you touch Hadoop will see, On the master node, the main ones are Namenode,secondarynamenode,jobtracker, and the slave node is mainly made up of datanode,tasktracker.

The

         master node depends on the size of the system for different deployments. When Master is large, the Namenode and Secondarynamenode nodes in master, and Jobtracker allocations are deployed on two servers.

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.