Basic building blocks for Hadoop overview

Source: Internet
Author: User
Tags file system mongodb postgresql redis firewall

This article shows the method of building blocks of hadoop.


The Hadoop overall architecture is a distributed master/from architecture consisting of a set of daemons and a set of host programs, and daemons are: Namenode,datanode,secondary namenode,jobtracker,tasktracker

The Namenode,datanode,secondary namenode is divided into stored process classes, while Jobtracker and Tasktracker are divided into computational process classes.


Namenode:

Namenode is the master node of the Hadoop distributed Storage System (HDFS), which itself does not participate in I/O tasks, but instead gives these tasks to the datanode that it manages. Namenode the file system's metadata is stored in memory.

Datanode:

Datanode is a HDFS node from the Hadoop distributed storage System (slave node), which is responsible for the actual task of reading and writing HDFS blocks (a large file is divided into HDFS block) and continuously reporting status to Namenode.


Secondary Namenode:

Secondary Namenode is a worker process used in the cluster to monitor the state of the HDFs cluster. It is also not the same as namenode that it does not accept and record any real-time changes in HDFs. Instead, it deals only with Namenode, and periodically collects snapshots of HDFs states (snapshot), which are used primarily to restore work when the Namenode fails.

Job Tracker:

Job Tracker is our contact for applications and Hadoop, and when we submit code to the Hadoop cluster, it determines the execution plan, including deciding which files to process, assigning different tasks to each node (which is actually assigned to task Tracker, and then forwarding), and monitor all tasks that are running. This process typically runs on the primary node of the cluster.





Alibaba Cloud Hot Products

Elastic Compute Service (ECS) Dedicated Host (DDH) ApsaraDB RDS for MySQL (RDS) ApsaraDB for PolarDB(PolarDB) AnalyticDB for PostgreSQL (ADB for PG)
AnalyticDB for MySQL(ADB for MySQL) Data Transmission Service (DTS) Server Load Balancer (SLB) Global Accelerator (GA) Cloud Enterprise Network (CEN)
Object Storage Service (OSS) Content Delivery Network (CDN) Short Message Service (SMS) Container Service for Kubernetes (ACK) Data Lake Analytics (DLA)

ApsaraDB for Redis (Redis)

ApsaraDB for MongoDB (MongoDB) NAT Gateway VPN Gateway Cloud Firewall
Anti-DDoS Web Application Firewall (WAF) Log Service DataWorks MaxCompute
Elastic MapReduce (EMR) Elasticsearch

Alibaba Cloud Free Trail

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.