MooseFS source code analysis (2)-mfsmount Module

Last Update:2018-12-08 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

1. fuse Introduction

Fuse must be installed on the client. Mfsmount uses the fuse module to have the same operation portal as file operations.

Fuse provides an interface for file operations. fuse then listens to file operations on this interface, and pass these operations on the file to our own code for processing. That is to say, fuse provides a mount point and then listens to it. Once an operation (new, read, or modify) occurs, it gives control to the code, the code for these operations is implemented in the mfsmount module.

The fuse_lowlevel_ops structure in fuse is used to specify custom file operations. The structure defined in mfsmount is as follows:

1. mfsmount Module main. c code snippet

Static struct fuse_lowlevel_ops mfs_servers = {
. Statfs = mfs_statfs,
. Lookup = mfs_lookup,
. Getattr = mfs_getattr,
. Setattr = mfs_setattr,
. Mknod = mfs_mknod,
. Unlink = mfs_unlink,
. Mkdir = mfs_mkdir,
. Rmdir = mfs_rmdir,
. Symlink = mfs_symlink,
. Readlink = mfs_readlink,
. Rename = mfs_rename,
. Link = mfs_link,
. Opendir = mfs_opendir,
. Readdir = mfs_readdir,
. Releasedir = mfs_releasedir,
. Create = mfs_create,
. Open = mfs_open,
. Release = mfs_release,
. Flush = mfs_flush,
. Fsync = mfs_fsync,
. Read = mfs_read,
. Write = mfs_write,
. Access = mfs_access,
};

Mfs_statfs, mfs_lookup... Mfs_write and mfs_access are functions implemented by mfsmount. There are 23 file operations in total, and mfsmount must be implemented by yourself. When you perform file operations on the mount point registered by fuse (corresponding to/mnt/mfs by default), fuse uses the mfsmount implementation to perform these operations. For example, if we create a text.txt file in/mnt/mfs, fuse will call mfs_create to perform the operation (of course, the file name and other necessary parameters must be passed ), fuse does not care where our code is used to create specific files.

Let's take a look at the definitions of these file operation functions in the header file in mfsmount:

Mfs_fuse.h code snippet

Void mfs_access (fuse_req_t req, fuse_ino_t ino, int mask );
Void mfs_lookup (fuse_req_t req, fuse_ino_t parent, const char * name );
Void mfs_getattr (fuse_req_t req, fuse_ino_t ino, struct fuse_file_info * fi );
Void mfs_setattr (fuse_req_t req, fuse_ino_t ino, struct stat * stbuf, int to_set, struct fuse_file_info * fi );
Void mfs_mknod (fuse_req_t req, fuse_ino_t parent, const char * name, mode_t mode, dev_t rdev );
Void mfs_unlink (fuse_req_t req, fuse_ino_t parent, const char * name );
Void mfs_mkdir (fuse_req_t req, fuse_ino_t parent, const char * name, mode_t mode );
Void mfs_rmdir (fuse_req_t req, fuse_ino_t parent, const char * name );
Void mfs_symlink (fuse_req_t req, const char * path, fuse_ino_t parent, const char * name );
Void mfs_readlink (fuse_req_t req, fuse_ino_t ino );
Void mfs_rename (fuse_req_t req, fuse_ino_t parent, const char * name, fuse_ino_t newparent, const char * newname );
Void mfs_link (fuse_req_t req, fuse_ino_t ino, fuse_ino_t newparent, const char * newname );
Void mfs_opendir (fuse_req_t req, fuse_ino_t ino, struct fuse_file_info * fi );

Note that each function has a fuse_ino_t type parameter ino. Each file corresponds to a unique ino value. This value is required for fuse. In the directory structure of the file (in the masterserver metadata), this ino is used as the identifier of the file and folder. In the process of creating files and folders, the directory structure data is stored on masterserver, and the client does not have the data. The ino of the root directory (/mnt/mfs) is set to 0.

Take deleting a directory as an example to explain the code. The following function is the function corresponding to the "delete directory" Operation registered in the mfsmount module:

2. Delete the corresponding function in the directory

Void mfs_rmdir (fuse_req_t req, fuse_ino_t parent, const char * name ){
Uint32_t nleng;
Int status;
Const struct fuse_ctx * ctx;
/*...........*/
Ctx = fuse_req_ctx (req );
Status = fs_rmdir (parent, nleng, (const uint8_t *) name, ctx-> uid, ctx-> gid );
Status = mfs_errorconv (status );
If (status! = 0 ){
Fuse_reply_err (req, status );
} Else {
Fuse_reply_err (req, 0 );
}
}

The parameters of the mfs_rmdir function are transmitted by fuse. Note that the fs_rmdir of Line 1 will communicate with the masterserver to perform the real deletion operation.

In short, fuse is used to provide an intermediate layer to capture the file operations on the user (or program) on the mount point and forward them to mfsmount's own code for processing, then, the processing structure is fed back to the user. For users, reading and writing files on this mount point is the same as reading and writing files in normal folders, and users do not feel any difference.

2. Working Process of mfsmount

▲After mfsmount is started, the command line parameters and configurations are read first.

▲Use fuse to define the mount point and correspond to the file operation function implemented by mfsmount.

▲Set the connection parameters to masterserver.

▲Start thread fs_nop_thread is used to maintain the connection status with masterserver. Fs_receive_thread connects to the masterserver for communication, registers on the masterserver, and performs initialization.

▲Start to respond to the input from the client (transmitted by fuse ).

The following describes the process of responding to user input. input is the operations we perform to create and modify files. These operations are handed over to specific functions defined in fuse_lowlevel_ops, communicate with masterserver in these specific functions.

Each time you perform a file operation, a threc type data is generated:

Threc Structure

Typedef struct _ threc {
Pthread_t thid;
Pthread_mutex_t mutex;
Pthread_cond_t cond;
Uint8_t * buff;
Uint32_t buffsize;
Uint8_t sent;
Uint8_t status;
Uint8_t release; // cond variable
Uint32_t size;
Uint32_t cmd;
Uint32_t packetid;
Struct _ threc * next;
} Threc;

There are mainly some status fields, the most important of which is buff. The data in the buff will be sent to masterserver:

The first 12 bytes of a buff are three integers. The following space is used to store the specific sending information. The buff content is as follows:

▲ 1st ~ 4 bytes (store an integer) is a specific command that describes what data needs to be taken.

▲ 5th ~ 8 bytes (an integer), the size of the data sent.

▲ 9th ~ 12 bytes (containing an integer) indicate packetid, which is unique and corresponds to each operation.

▲ 13th ~ ** Bytes, with a length of 5th ~ 8 bytes.

Let's take a look at the code snippet when creating a directory (fs_makedir:

Fs_makedir code snippet

...
Ptr = fs_createpacket (rec, CUTOMA_FUSE_MKDIR, 15 + nleng );
PUT32BIT (parent, ptr );
PUT8BIT (nleng, ptr );
Memcpy (ptr, name, nleng );
Ptr + = nleng;
PUT16BIT (mode, ptr );
PUT32BIT (uid, ptr );
PUT32BIT (gid, ptr );
Ptr = fs_sendandreceive (rec, MATOCU_FUSE_MKDIR, & I );
...

In the above 2nd lines of code, fs_createpacket will create a threc structure and set its buffer size to 15 + nleng + 12 bytes.

The following 12 bytes are fixed as the buffer header, which is filled in when the threc structure is created, as described earlier in the threc structure.

The content of the 15 + nleng bytes is the content sent to masterserver from the following lines 3rd, 4, 5, 7, 8, and 9. For different operations, the content sent to masterserver is different. The data returned by masterserver is also stored in the threc structure. The following is the content of the buffer in the threc structure corresponding to the Directory Creation operation when sending data to the masterserver:

After the data is returned from the masterserver, the data is retrieved from the threc structure and the folder parameters are handed over to fuse. The rest is that fuse uses these parameters to do its own thing (for example, display information to the user ).

3. Processing of reading and writing file data

Previously, when introducing fuse, I mentioned that mfsmount will define its own file operations. The read and write operations are special in dealing with actual data. Except read and write, all other operations only need to obtain data from masterserver. The read and write operations must first communicate with the masterserver to obtain the address and port of the chunkserver where the data resides, and then read and write data from the chunkserver. The threc structure is still used for communication between read and write and masterserver.

Obtain the IP address and Port of the chunkserver from the masterserver. This process is similar to other non-read and write operations. The main code is as follows:

Fs_readchunk code snippet

Ptr = fs_createpacket (rec, CUTOMA_FUSE_READ_CHUNK, 8 );
PUT32BIT (inode, ptr );
PUT32BIT (indx, ptr );
Ptr = fs_sendandreceive (rec, MATOCU_FUSE_READ_CHUNK, & I );
//......
GET64BIT (t64, ptr );
* Length = t64;
GET64BIT (t64, ptr );
* Chunkid = t64;
GET32BIT (t32, ptr );
* Version = t32;
If (I = 20 ){
* Csip = 0;
* Csport = 0;
} Else {
GET32BIT (t32, ptr );
* Csip = t32;
GET16BIT (t16, ptr );
* Csport = t16;
}

Masterserver returns the file version, length, ID, and chunkserver information. Mfsmount then communicates with the chunkserver to obtain data. After the data is read, the control is returned to fuse.

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

MooseFS source code analysis (2)-mfsmount Module

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support

MooseFS source code analysis (2)-mfsmount Module

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support