Nfs principles and protocols of the nfs Module source code analysis (I) of Glusterfs

Source: Internet
Author: User
Tags ftp protocol nfsd glusterfs
Document directory
  • I. Network File System Overview

My Sina Weibo: http://weibo.com/freshairbrucewoo.

You are welcome to exchange ideas and improve your technology together.

 

I. Network File System Overview

 

Sun Microsystems launched a widely accepted Remote File Access Mechanism throughout the computer industry in 1984, known as Sun's Network File System ), or NFS for short. This mechanism allows a server to run on a computer so that some or all files on it can be remotely accessed and applications on other computers can access these files.

It enables file sharing. When users want to use remote files, they only need to use "mount" to connect the remote file system to their own file system, so that the remote file operations are no different from those on local machines. An application can Open (Open) a remote file for access. It can Read (Read) data from the file, Write (Write) data to the file, and locate (Seek) to a specified position (START, end, or other place) in the file, and Close the file after use. These operations are transparent to programmers, and the operation methods are the same as those for local files.

Ii. NFS Protocol

The NFS protocol uses NFS, and the client can transparently access the file system on the server, which is different from the FTP protocol that provides file transmission. FTP generates a complete copy of the file. NFS only accesses the part of the file referenced by one process, and the purpose is to make the access transparent. This means that any client program that can access a local file can access an NFS file without any modification. NFS is a client/server application constructed using SunRPC. Its client sends an RPC request to an NFS server to access the files. Although this work can be implemented by a general user process, that is, the NFS client can be a user process that explicitly calls the server, and the server can also be a user process. NFS is generally not implemented in this way for two reasons. First, access to an NFS file must be transparent to the client. Therefore, NFS client calls are performed by the client operating system on behalf of the user process. Secondly, for efficiency considerations, the NFS server is implemented in the server operating system. If the NFS server is a user process, each client request and server response (including read and write data) will have to switch between the kernel and the user process, which is too costly. The NFS 3rd Protocol was released in 1993, as shown in a typical structure of an NFS client and an NFS server.

 

Figure 1 Typical Structure of the NFS client and NFS server

(1) access to a local file or an NFS file is transparent to the client. It is determined by the kernel when the file is opened. After the file is opened, the kernel passes all references to the local file to the "local file access" box, pass all references to an NFS file to the box named "NFS client.

(2) the NFS client sends an RPC request to the NFS server through its TCP/IP module. NFS mainly uses UDP, and the latest implementation can also use TCP.

(3) the NFS server receives client requests as UDP packets on port 2049, although NFS can be implemented as a port er, allowing the server to use a temporary port, however, most implementations directly specify UDP port 2049.

(4) When the NFS server receives a client request, it passes the request to the local file access routine and then accesses a local disk file on the server host.

(5) It takes a certain amount of time for the NFS server to process a client request. It usually takes some time to access the local file system. During this interval, the server should not block requests from other clients. To implement this function, most NFS servers are multi-threaded. In fact, multiple NFS servers are running in the NFS lock management program, the specific implementation depends on different operating systems. Since most UNIX kernels are not multi-threaded, a common technology is to start multiple instances of a user process (often referred to as "nfsd. This instance executes a system call and keeps it as a kernel process in the kernel of the operating system.

(6) On the client host, it takes some time for the NFS client to process requests from a user process. The NFS client sends an RPC call to the server host and waits for the server to respond. To provide more concurrency for user processes on NFS client hosts, multiple NFS clients are generally run in the client kernel, and the specific implementation also depends on the operating system.

Iii. How NFS works and the role of service processes

In Linux, NFS and service processes are two different concepts, but they are closely linked. First, introduce how NFS works.

Section 1. How NFS works

When the NFS file server is started,/etc/rc. local will automatically start the exportfs program to specify the files or directories that can be exported, and the files or directories that can be mounted can only be specified.
NFS is based on the XDR/RPC protocol. XDR (eXternal Data Representation, eXternal Data Representation) provides a way to convert Data from one format to another standard Data format Representation, make sure that all data represents the same meaning in different computers, operating systems, and programming languages.
RPC (Remote Procedure Call, called by a Remote program) requests Remote computers to provide services. The client sends RPC to a remote computer over the network to request services.
NFS uses RPC to send data in the following steps:
(1) The customer sends information and requests the service.
(2) The customer placeholder program converts the parameters sent by the customer into the XDR standard format and sends the information to the network using system calls.
(3) information is sent to the remote host system through the network.
(4) the remote host transmits the received information to the server placeholder program.
(5) convert the XDR data to a host-based format, retrieve the service request parameters sent by the customer, and send them to the server.
(6) The reverse transmission process of Services sent by the server to the customer.

Section II. Roles of service processes

A service process is a program that automatically runs after the system starts the computer, including network connection, network protocol loading, graphic desktop display, and file system loading, common processes in Linux include the following.
(1) nfsd
According to the requirements of the client on the file system, start the file system to request the service process and respond to the customer's request. Generally, the number of file system request service processes is 8, which is also in the rc. the reason why nfsd 8 is written in local.
(2) biod
This process is used on the NFS client to start an asynchronous block I/O service process to create a Buffer Cache for reading and writing on the client. (3) mountd
This is an RPC server. After the rpc. mountd service process is started, mountd reads/etc/xtab to check which client is mounting the file system and responds to the path to be mounted by the client.
(4) inetd Internet services Service Process
When the system starts, rc. local starts inetd to read the inetd. conf configuration file, read the addresses of all servers on the network, and start all servers in inetd. conf. When the client requests the service, inetd starts the relevant service process. For example, when the user uses telnet, inetd starts telnetd to meet the user's requirements for telnet, for other applications such as ftp, finger, and rlogin, inetd also starts the corresponding service programs ftpd, fingerd, and rloingd.
(5) portmap service program
The main function is to convert the port number of the TCP/IP communication protocol to the number of the RPC program, because the client can call the RPC program. Generally, the RPC server is started by inet, so portmap must be started before inetd. Otherwise, RPC cannot be called.

Iv. RPC for NFS servers

NFS supports many functions, and different functions are started using different programs. Each time you start a function, some ports are enabled to transmit data. Therefore, the ports corresponding to the NFS function are not fixed, instead, it uses random access to unused ports less than 724 for transmission. In this case, the client needs to connect to the server because the client needs to know the port of the server to connect to the server. In this case, we need to remotely call the (RPC) service. The main function of RPC is to specify the port number corresponding to each NFS function and return it to the client so that the client can connect to the correct port. When the server starts NFS, it selects several ports randomly and actively registers with RPC. Therefore, RPC can understand the NFS function of each port. Then RPC uses port 111 to listen to client requests and return the correct port of the client, which makes NFS startup easier. Note: Before starting NFS, you must start RPC first; otherwise, NFS cannot register with RPC. In addition, when RPC is restarted, the data originally registered will not be seen. Therefore, after RPC is restarted, all programs managed by RPC need to be restarted to register with RPC.
When the client has an NFS file to access the request, how does it request data from the server?
(1) the client sends an NFS file access request to the server RPC (port 111.
(2) After the server finds the corresponding registered NFS daemon port, it returns it to the client.
(3) After the client understands the correct port, it can directly connect to the NFS daemon.
Since all NFS functions must be registered with RPC, RPC can understand the port number, PID, and IP address of NFS functions on the host, the client can find the correct port through RPC inquiry. That is to say, NFS can successfully provide services only when RPC exists. Therefore, NFS is called an RPC Server. In fact, many such servers are registered with RPC. For example, NIS (Network Information Service) is also a type of RPC Server. As shown in, RPC must be started to use NFS on both the client and server.

 

 

Figure 2 correlation between NFS and RPC services and Operating Systems

 

 

The NFS protocol has been available in multiple versions since its birth, such as NFS V2 (rfc794) and NFS V3 (rfc1813) (the latest version is V4 (rfc307 )). At first, SUN designed NFS V2 to only use UDP, mainly because of the influence of the memory, network speed and CPU of the machines at that time, and had to choose a method with a lighter burden on the machines. In NFS V3, SUN chose TCP as the default transmission mode. The main differences between V3 and V2 are as follows:
(1) file size: V2 supports a file size of up to 32 bits (4 GB), while V3 adds a technology that supports 64-bit file size.
(2) File Transfer size: V3 does not limit the transfer size. V2 can only be set to 8 KB at most. You can use-rsize and-wsize to set it.
(3) return complete information: V3 adds and improves returned errors and success information, which can bring great benefits to server settings and management.
(4) added support for TCP transmission protocol: V2 only provides support for UDP, which is limited in some demanding network environments; V3 adds support for TCP. UDP features fast transmission speed and non-connection transmission convenience, but it is not stable over TCP during transmission. When the network is unstable or hackers intrude into the network, NFS performance can be greatly reduced or even paralyzed. Therefore, for different situations, the network must select a specific transmission protocol. The default NFS transmission protocol is UDP, but the RHEL 4.0 Kernel provides support for NFS over TCP. To use NFS over TCP, mounting the file system exported from NFS on the client system includes a "-o tcp" option. The advantages and disadvantages of using TCP are as follows:
1) The Connection persistence is improved, so the NFS stale file handles messages obtained are less.
2) The performance of the network with a large load will be improved, because TCP confirms each group, while UDP only confirms it when it is completed.
3) TCP has the congestion control technology (UDP does not exist at all). In a network with severe congestion, UDP groups are the first type to be revoked. Using UDP means that if NFS is writing data (the unit is 8 KB), all the 8 KB data needs to be re-transmitted. Due to the reliability of TCP, only a portion of 8 KB data needs to be re-transmitted.
4) error detection. When the TCP connection is interrupted (because the server is stopped), the client stops sending data and starts to reconnect. UDP is connectionless, and the client that uses it will continue to send data to the network until the server goes online again.
5) The cost of TCP is not significantly improved in terms of performance.
(5) asynchronous write feature.
(6) improved the server's mount performance.
(7) Better I/O write performance.
(8) enhanced network operation efficiency, making network operation more effective.
(9) Stronger disaster recovery functions.

In Linux, UDP is the default protocol. There is no choice as a server. However, as a client, you can use TCP to interconnect with other unix nfs servers that use TCP. It is better to use UDP in the LAN because the LAN has stable network guarantee. UDP can provide better performance. V2 is used by default in Linux, but nfsvers = n of mount option can also be used. NFS uses the protocols and services provided by TCP/IP to run on the application layer of the OSI layered model, as shown in table 1.

Table 1 NFS on the OSI layered model

 

Number of Layers

Name

Merit

1

Application Layer

NFS

2

Presentation Layer

XDR

3

Session Layer

RPC

4

Transport Layer

UDP, TCP

5

Network Layer

IP

6

Data Link Layer

 

7

Physical Layer

Ethernet

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.