Chapter 3 parallel distributed file system parallel Distributed File System

Source: Internet
Author: User
Chapter 3 the storage size of the search engine of the parallel distributed file system is at least TB. How can we effectively manage and organize these resources? And get results in a very short time? Mapreduce: simplified data processing on large clusters provides a good analysis.

The implementation of the Distributed File System must implement two kinds of critical resource interfaces: one is the ing table from the file name to the namespace, and the other is the block table corresponding to the node machine list. The namespace indicates the ing of file names to a group of machines. The specific hash function may need to look at the namespace size, which is actually a map process. The block table corresponds to the machine list, it is actually a reduce process. It is stored in blocks to the controlled machine group (inodes). In other words, it is the slave-master architecture. The higher the communication mode, the higher the efficiency of the underlying protocol execution. For specific implementation, refer to hadoop.
Of course, there are still many details to consider, such as turning up and turning down of inode machines, identifying these new machines in real time, and automatically deleting the disabled machines from the list; select the number of backup data, load balancing between backups, maintenance of inode configuration files, and virtualization of the file system for end users;

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.