9. distributed applications

Last Update:2014-09-27 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Definition

In a distributed system that contains several Erlang nodes, applications may need to be controlled in a distributed manner. If an application runs on a node, the application will be restarted on another node.

Such an application is calledDistributed Application. Note that application control is distributed, and all applications can be distributed, for example, services on other nodes.

Because a distributed application may move between nodes, there must be some addressing mechanism to ensure that it can be found by other applications, regardless of the node on which it is currently running. This issue will not be discussed here, the kernel moduleglobalAnd stdlib modulespgCan be used for this purpose.

Specify Distributed Application

A Distributed Application is controlled by an application controller and a distributed application.dist_acTo control. These processes arekernelPart of the application. Therefore, distributed applications must be configured throughkernelUse the following configuration parameters (seekernel(6)):

distributed = [{Application, [Timeout,] NodeDesc}]: Application Application = atom()Where to execute. NodeDesc = [Node | {Node,...,Node}]Is the list of node names, sorted by priority. The priority of a node in a meta-group is uncertain. Time = integer()Specifies the number of milliseconds to wait before restarting the application on another node.

In order for the distribution of application control to work properly, the nodes running the distributed application must communicate with each other and determine where to start the application. This is to use the followingkernelConfigured parameters:

sync_nodes_mandatory = [Node]: Specify which other nodes must be started ( sync_nodes_timeoutWithin the specified timeout period)
sync_nodes_optional = [Node]: Specify which other nodes can be started ( sync_nodes_timeoutWithin the specified timeout period)
sync_nodes_timeout = integer() | infinity: Specify how many milliseconds wait before other nodes start

After startup, the node will wait for allsync_nodes_mandatoryAndsync_nodes_optionalThe specified node is ready. When all nodes are ready, or when all forced nodes are readysync_nodes_timeoutAfter the specified time, all applications will be started. If not all the forced nodes are ready, the node will be terminated.

For example, an applicationmyappShould run on nodes[email protected]. If the node failsmyappIt should be in[email protected]Or[email protected]. For[email protected]A system configuration file of should be:

[{kernel,  [{distributed, [{myapp, 5000, [[email protected], {[email protected], [email protected]}]}]},   {sync_nodes_mandatory, [[email protected], [email protected]]},   {sync_nodes_timeout, 5000}  ] }].

[email protected]And[email protected]Except for the list of forced nodes[email protected]It should be[[email protected], [email protected]],[email protected]It should be[[email protected], [email protected]].

Start and Stop distributed applications

After all participating (forced) nodes are started, you canOn all these nodesCallapplication:start(Application)To start distributed applications.

Of course, you can also use the startup script (seeRelease) Automatically start the application.

The application willdistributedConfigure the node that has been started and runs as specified by the parameter. The application is started normally. That is to say, the application master program is created and the application callback function is called:

Module:start(normal, StartArgs).

For example, in the previous section, all three nodes are started and the following system configuration files are specified:

> erl -sname cp1 -config cp1> erl -sname cp2 -config cp2> erl -sname cp3 -config cp3

When all the nodes are up and running, you can startmyapp. This is done by callingapplication:start(myapp). Then, it is started oncp1As shown in.

Similarly, the application must callapplication:stop(Application)To stop the application.

Failover

If the node running the application fails, the application will be restarted after the specified timeout.distributedConfigure the first startup and running node specified by the parameter. This process is calledFailover(Failover ).

The application is started on the new node in the normal way, that is, it is called through the Application main program:

Module:start(normal, StartArgs)

Note: If the application definesstart_phasesKey (seeIncluded applications), The application must also use the following calls:

Module:start({failover, Node}, StartArgs)

WhereNodeIs the terminated node.

Example: Ifcp1Then the system checks other nodes --cp2Orcp3-- The minimum number of running applications, but it takes 5 seconds to getcp1Restart. Ifcp1No restart, at the same timecp2Running application ratiocp3Less, somyappThecp2.

Assume thatcp2It also fails and does not restart within 5 seconds. So nowmyappThecp3.

Take over

If a node is starteddistributedIt has a higher priority than the node currently running the distributed application, so the application will first restart on the new node and stop on the old node. This process is calledTake over.

In this case, the following application main program is called to start the application:

Module:start({takeover, Node}, StartArgs)

WhereNodeIt is an old node.

For example, ifmyappCurrently running incp3, Ifcp2If it is restarted, it will not restartmyappBecause the nodecp2Andcp3The order between them is uncertain.

However, ifcp1Also restarted, Functionapplication:takeover/2WillmyappMovecp1Becausecp1Priority Ratio of the applicationcp3High. In this casecp1RunModule:start({takeover, [email protected]}, StartArgs)To start the application.

9. distributed applications

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

9. distributed applications

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support

9. distributed applications

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support