Several Hadoop daemon and Hadoop daemon
After Hadoop is installed, several processes will appear when jps is used.
Master has:
Namenode
SecondaryNameNode
JobTracker
Slaves has
Tasktracker
Datanode
1.NameNode
It is the master server in Hadoop, managing the file system namespace and accessing the files stored in the cluster.
2.SecondaryNameNode
It is not a redundant daemon of namenode, but provides periodic checkpoints and cleanup tasks.
For the sake of scalability and fault toleranceSecondaryNameNodeRun on a non-NameNode.
3.DataNode
It is responsible for managing the storage of connections to nodes (a cluster can have multiple nodes ). Each node that stores data runs a datanode daemon.
4 and 5.JobTrackerAndTaskTracker
JobTrackerSchedulingDataNode. EachDataNodeThere isTaskTracker.
JobTrackerAndTaskTrackerIn master-slave mode,JobTrackerCrossDataNodeDistribution work, whileTaskTrackerRun the task.
JobTrackerAlso checks the request work, ifDataNodeFailed for some reason,JobTrackerThe previous task is rescheduled.