Fork () function

Source: Internet
Author: User

For users who write multi-process programs in Linux, fork is one of the most difficult concepts to understand: it executes but returns two values at a time.

First, let's look at the prototype of the fork function:

# I nclude <sys/types. h>

# I nclude <unistd. h>

Pid_t fork (void );

Return Value:

Negative number: if an error occurs, fork () returns-1, and no new process is created. The initial process is still running.

Zero: in the sub-process, fork () returns 0

Positive: In the parent process, fork () returns the PID of the positive child process.

Next, let's take a look at how to use fork to create sub-processes.

The sample code for creating a sub-process is as follows:

Pid_t child;

If (child = fork () <0)

/* Handle errors */

Else if (child = 0)

/* This is a new process */

Else

/* This is the initial parent process */

The Fock function is called twice but returns the ID of the child process to the parent process, and returns 0 to the child process,

This is because the parent process may have many child processes, so the child process ID returned must be used to track the child process,

The child process has only one parent process, and its ID can be obtained through getppid.

The following two examples are compared:

First:

# Include <unistd. h>

# Include <stdio. h>

Int main ()

{

Pid_t PID;

Int COUNT = 0;

PID = fork ();

Printf ("this is first time, pid = % d \ n", pid );

Printf ("this is second time, pid = % d \ n", pid );

Count ++;

Printf ("Count = % d \ n", count );

If (pid> 0)

{

Printf ("this is the parent process, the child has the PID: % d \ n", pid );

}

Else if (! PID)

{

Printf ("this is the child process. \ n ")

}

Else

{

Printf ("fork failed. \ n ");

}

Printf ("this is third time, pid = % d \ n", pid );

Printf ("this is fouth time, pid = % d \ n", pid );

Return 0;

}

The running result is as follows:


Problem:

This result is strange. Why does the printf statement run twice, but the "count ++;" statement only runs once?

Next, let's see:

# Include <unistd. h>

# Include <stdio. h>

Int main (void)

{

Pid_t PID;

Int COUNT = 0;

PID = fork ();

Printf ("now, the PID returned by calling fork () is % d \ n", pid );

If (pid> 0)

{

Printf ("this is the parent process, the child has the PID: % d \ n", pid );

Printf ("in the parent process, Count = % d \ n", count );

}

Else if (! PID)

{

Printf ("this is the child process. \ n ");

Printf ("Do your own things here. \ n ");

Count ++;

Printf ("in the child process, Count = % d \ n", count );

}

Else

{

Printf ("fork failed. \ n ");

}

Return 0;

}

The running result is as follows:

Now let's explain the question above.

When you look at this program, you must first understand the concept: Before the statement pid = fork (), only one process is executing this code, but after this statement, the two processes are executing. The Code of these two processes is completely the same. The next statement to be executed is if (pid> 0 ).......

In the two processes, the original one is called the "parent process" and the new one is called the "Child process ". The difference between parent and child processes is not only the process ID, but also the variable PID value. The PID stores the fork return value. One of the wonders of fork calling is that it is called only once, but can return twice. It may have three different return values:

1. In the parent process, fork returns the ID of the newly created sub-process;

2. In the sub-process, fork returns 0;

3. If an error occurs, fork returns a negative value;

There are two possible reasons for Fork errors: (1) the current number of processes has reached the limit set by the system, and the errno value is set to eagain. (2) system memory

The memory is officially called internal memory, which is separated from external memory. It is physically installed inside the computer, usually installed on the motherboard, so it is called memory. It is used to temporarily store the data to be processed by the processor or the result after processing. It can be seen that the memory is the workspace of the computer processor. It is a very important part of a computer.

The errno value is set to enomem.

Description of fork in E2:

The new process created by fork is called the child process. This function is called once but returns twice. The only difference in the returns is that the return value in the child is 0, whereas
The return value in the parent is the process ID of the new child. the reason the child's process ID is returned to the parent is that a process can have more than one child, and there is no function that allows a process to o ^ ain the process IDs of its children.
The reason fork returns 0 to the child is that a process can have only a single parent, and the child can always call getppid to o ^ ain the process ID of its parent. (process ID 0 is reserved for use by the kernel, so it's not possible for 0 to be the process
ID of a child .)

A new process created by fork is called a self-process. The fork function is called once, but is returned twice. The only difference in returned values is that 0 is returned in the child process, and the PID of the child process is returned in the parent process. In the parent process, the PID of the child process is returned because the parent process may have more than one child process, and no function can be used by a process to obtain the PID of the child process.

Both the child and the parent continue executing with the instruction that follows the call to fork. the child is a copy of the parent. for example, the child gets a copy of the parent's data
Space, heap, and stack. note that this is a copy for the child; the parent and the child do not share these portions of memory. the parent and the child share the text segment (section 7.6 ).

Both the child process and the parent process execute the code after the fork function call. The child process is a copy of the parent process. For example, the data space and stack space of the parent process are copied to the child process, rather than shared with the child process.Memory

The memory is officially called internal memory, which is separated from external memory. It is physically installed inside the computer, usually installed on the motherboard, so it is called memory. It is used to temporarily store the data to be processed by the processor or the result after processing. It can be seen that the memory is the workspace of the computer processor. It is a very important part of a computer.

.

Current implementations don't perform. a complete copy of the parent's data, stack, and heap, since a fork is often followed by an exec. instead, a technique called copy-on-write (COW) is used.
These regions are shared by the parent and the child and have their protection changed by the kernel to read-only. if either process tries to modify these regions, the kernel then makes a copy of that piece of memory only, typically a "page" in a virtual memory
System. Section 9.2 of Bach and sections 5.6 and 5.7 of mckusick et al. [1996] provide more detail on this feature.

Let's give a detailed comment.

# Include <unistd. h>

# Include <stdio. h>

Int main (void)

{

Pid_t PID;

Int COUNT = 0;

/* Execute the fork call and create a new process, which shares the data and stack space of the parent process. The subsequent Code commands create a copy for the child process. A Fock call is a replication process. Unlike a thread, a function is provided as an entry. After a Fock call, the entry of a new process is located in the Next statement of the Fock. */

PID = fork ();

/* The PID value here indicates whether the fork is currently executing the parent process or child process */

Printf ("now, the PID returned by calling fork () is % d \ n", pid );

If (pid> 0)

{

/* When fork is returned in the child process, the fork call returns the PID of the child process to the parent process. If the code is executed, but note that the Count value is still 0, because the Count value in the parent process is never re-assigned, we can see that the data and stack space of the child process are independent of the parent process, rather than sharing data */

Printf ("this is the parent process, the child has the PID: % d \ n", pid );

Printf ("in the parent process, Count = % d \ n", count );

}

Else if (! PID)

{/* Perform the auto-increment operation on count in the sub-process, but the Count value in the parent process is not affected. The Count value in the parent process is still 0 */

Printf ("this is the child process. \ n ");

Printf ("Do your own things here. \ n ");

Count ++;

Printf ("in the child process, Count = % d \ n", count );

}

Else

{

Printf ("fork failed. \ n ");

}

Return 0;

}

That is to say, the next process in Linux has three parts of data in the memory: "code segment", "Stack segment", and "data segment ". "Code segment", as its name implies, stores the data of program code. If several processes on the machine run the same program, they can use the same code segment. The "Stack segment" stores the return address of the subroutine, the parameters of the subroutine, and the local variables of the program. The data segment stores the global variables, constants, and dynamic data space allocated by the Program (for example, space obtained using functions such as malloc ). If the system runs several identical programs at the same time, the same stack segment and data segment cannot be used between them.

After careful analysis, we can know:

Once a program calls the fork function, the system prepares the preceding three segments for a new process. First, the system allows the new process and the old process to use the same code segment, because their programs are the same, the system copies a copy of the data segment and stack segment to the new process. In this way, all data of the parent process can be left to the child process. However, once a child process starts running, it inherits all the data of the parent process, but in fact the data has been separated and there is no impact between them, that is, they no longer share any data.

Fork () not only creates child processes with the same code as the parent process, but also automatically copies all context scenarios of the parent process at the fork execution point to the child process, including:

-- Global and local variables

-- Open file handle

-- Shared memory, messages, and other synchronization objects

If the two processes want to share any data, they need to use another function (shmget, shmat, shmdt, and so on. Now there are two processes. For the parent process, the fork function returns the process Number of the subroutine, and for the subroutine, the fork function returns zero. In this way, for the program, as long as you determine the return value of the fork function, you will know whether you are in the parent process or child process.


Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.