DRBD-configuration considerations

Last Update:2014-05-08 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

1. Online adjustment parameters:
Modify the configuration file of existing resources. The two peer nodes must be consistent, and then run drbdadm adjust <resource> on both nodes;
2. Online verification of data integrity; (this has a great impact on performance)
It verifies that the source calculates the encryption digest for a block storage device of a resource of each underlying device at a time and transmits it to the peer node to verify the local copy block corresponding to the digest, if they do not match, they are identified and re-synchronized. During online verification, resource replication is not blocked and the system is not interrupted;
Operation Method:
Modify the configuration file:

Resource <resource>
Net {
Verify-alg <algorithm>
}
} (Ps: can also be configured to the common block, applicable to all resources)

Run the following command:

Drbdadm verify <resource>
If the out-of-sync block appears during verification, you need to use it after the verification is completed:
Drbdadm disconnect <resource>
Drbdadm connect <resource>
This method is rarely used, but it can be configured as weekly or once a month;
3. Configure the synchronization rate:
In general, it is good to be suitable. It depends on the disk speed and Nic I/O. If the backend bandwidth is full, replication and programs are affected;
The recommended bandwidth is 30% of the available bandwidth.
Fixed synchronization rate:
Resource <resource>
Disk {
Sync-rate 40 M;
} (Ps: can be configured to the common block for all resources)

Temporary adjustment rate:
It may be used to speed up Synchronization After expected maintenance:

Drbdadm disk-options -- resync-rate = 200 M <resource>

To restore to the original synchronization rate: drbdadm adjust <resource> is executed on both nodes.
A range for configuring synchronization rates for multiple resources [Skipped for the moment]
Congestion policies and suspension of replication mostly appear in the Wide Area Network. We will not discuss them for the moment;

4. Disk IO error handling
Resource <resouce> {
Disk {
On-io-error <strategy>;
}

} (Ps: can be set in the common block, effective for all resources)

Several options to handle disk errors:

Detach: the default option. If an I/O error occurs on the underlying disk of the node, the device runs in diskless mode;

Pass_on: drbd reports errors to the upper layer, that is, the file system, but is often ignored;

Local-io-error calls the commands defined in the local disk I/O handler; local-IO-error is required to define the commands for error handling;

5. Disk Flushing
As long as the disk Controller supports DRBD disk flushing (most of them are supported ),
In a RAID environment containing the BBC, you can disable the DRBD disk flushing function to achieve higher performance;
Resource <resource>
Disk {
Disk-flushes no;
...
}
6. Split-brain notification:
Handlers {split-brain "/usr/lib/drbd/notify-split-brain.sh root ";...}
7. Split-brain repair strategies:
In most cases, manually fix the problem:
After-sb-0pri: Split brain has been detected, but now no node in the master role, for this option, drbd has the following keywords:
Disconnect: automatic recovery is not required. It only calls the script of the split brain processing program (IF configured), and the connection is closed and in the disconnection mode.
Discard-younger-primary: discard and roll back the last modification made above as the master.
Discard-least-changes: discard and roll back the changes on the host with few changes.
Discard-zero-changes: If no node changes, you only need to apply for modification on the same node.
After-sb-1pri: Split brain has been detected, there is a node in the master role, for this option, drbd has the following keywords:
Disconnect: like the after-sb-0pri, call the split brain processing script (IF configured), disconnect and in disconnected mode.
Consensus: same repair policy as in after-sb-0pri. If you can choose to use these policies to split your brain, you can automatically resolve them. Otherwise, the specified action is also disconnected.
Call-pri-lost-after-sb: The same repair policy as in the after-sb-0pri. If you can choose to use these policies to split your brain, call the pri-lost-after-sb program on the compromised node. This program must be configured in handlers and consider removing the node from the cluster.
Discard-secondary: No matter which host is in the secondary role, it is the hazard of split brain.
After-sb-2pri: Split brain is found when both nodes are in the master role. The secondary option uses the same keywords as the after-sb-1pri, discarding the secondary node and reaching consensus
A brief Configuration:

Resource data {
Protocol C;
Handlers {
Split-brain "/usr/lib/drbd/notify-split-brain.sh root ";
Local-io-error "/usr/lib/drbd/notify-io-error.sh;/usr/lib/drbd/notify-emergency-shutdown.sh; echo o>/proc/sysrq-trigger; halt-f ";
}
Startup {
Wfc-timeout 0;
Degr-wfc-timeout 120;
}
Disk {
On-io-error detach;
}
Net {
Cram-hmac-alg sha1;
After-sb-0pri discard-zero-changes;
After-sb-1pri discard-secondary;
After-sb-2pri disconnect;
Max-buffers 8000;
Max-epoch-size 8000;
Sndbuf-size 0;
}
Syncer {
Rate 90 M;
Al-extents 257;
}
On BGP-LF-1MS2232 {
Device/dev/drbd0;
Disk/dev/sda4;
Address 192.168.1.104: 7788;
Meta-disk internal;
}
On BGP-LF-1MS2233 {
Device/dev/drbd0;
Disk/dev/sda4;
Address 192.168.1.105: 7788;
Meta-disk internal;
}
}

DRBD for Linux high availability (HA) CLUSTERS

DRBD Chinese application guide PDF

Installation and configuration notes for DRBD in CentOS 6.3

High-availability MySQL based on DRBD + Corosync

Install and configure DRBD in CentOS 6.4

DRBD details: click here
DRBD: click here

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

DRBD-configuration considerations

Contact Us

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support