The--raid of storage reliability Technology

Source: Internet
Author: User

When a cloud project is delivered, it is inevitable to consider what raid the storage disk will take. For example: Our project engineer may recommend that you connect a clone virtual machine system disk Group RAID 10, complete replication virtual machine data disk

Using RAID5 or RAID6 and so on, what exactly is RAID? Why should we use RAID? How do we choose the right RAID format?

1) RAID Concept

RAID is an abbreviated redundant array of independent disk, redundant array of independent disks. It is a data protection technology that combines multiple pieces of separate hard disks (physical hard disks) in different ways.

Form a hard disk group (logical hard disk) that provides higher storage performance and data redundancy than a single hard drive.

2) Advantages of RAID

• Improved transfer rate

RAID significantly increases the data throughput of the storage system by simultaneously storing and reading data on multiple disks. In RAID, you can have many disk drives transfer data simultaneously, and these disk drives are logically a disk

Drive, so using RAID can reach a single disk drive several times, dozens of times times even up to a hundredfold rate.

• Provides fault-tolerant functionality

Ordinary disk drives fail to provide fault tolerance, RAID provides fault tolerance, and raid fault tolerance is built on the hardware fault tolerance of each disk drive, so it provides greater security. In many raid modes, there are

More complete mutual verification/recovery measures, even direct mutual mirror backup, which significantly improve the fault tolerance of the raid system, improve the stability of the system redundancy.

Second, RAID classification

At present the industry recognized standard is RAID0~RAID5. In addition, there are other 6,7,10 and so on.

This article gives a brief description of several commonly used RAID levels:

· RAID0

It divides the data into a certain size and writes it sequentially to the disk in the array. As shown in the following:

  

Theoretically, a RAID0 system consisting of n disks, which reads and writes performance will be n times the read performance of a single disk, and the storage efficiency of disk space is maximum (100%). Due to the influence of various factors such as bus bandwidth,

The actual lift rate will be lower than the theoretical value. However, the performance of a large amount of data parallel transmission and serial transmission is necessarily greatly improved. RAID0 has one obvious drawback: no data redundancy protection, once the data is corrupted,

will not be recoverable. RAID0 requires at least 2 hard drives to be implemented.

· RAID1

Also known as mirroring (mirror), it writes the data exactly to the working disk and to the mirrored disk, respectively.

  

The RAID1 system has a disk space utilization of 50%, which has an impact on data write time, but does not have any impact when reading. RAID1 provides very good data protection, and once the working disk fails, the system automatically

The mirrored disk reads data without affecting the user's work. and RAID1 support "hot replacement", that is, the failure of the power of the case to replace the disk, the replacement as long as the data from the mirror disk recovery. RAID1 requires at least 2 hard drives to be implemented. ---even blocks.

· RAID5

A standalone disk structure with distributed parity. Data verification information is evenly dispersed across the array of disks, the array of disks have both data and data validation information, data blocks and corresponding checksum information stored on different disks.

  

The check bit, or P-bit, is different or calculated by the data of the same band. When a data disk is damaged, the RAID5 system can reconstruct the corrupted data based on other data blocks in the same area and corresponding checksum information. RAID5 requires at least 3 hard drives to be implemented.

· RAID6

A disk structure with two parity checks. Based on RAID5, RAID6 further strengthens the data protection, which is actually an extended RAID5 level.

The data redundancy performance of RAID6 is quite good. However, due to the addition of a check, the efficiency of writing is worse than RAID5. Moreover, the design of the control system is more complicated, and the second block also reduces the effective storage space.

RAID6 requires at least 4 hard drives to be implemented.

· RAID10

is a combination of RAID0 and RAID1. The structure of the RAID10 is very simple, first creating 2 independent RAID1, and then the two separate RAID0 to form a RAID0, when the data is written to this logical raid, the data is written in an orderly two RAID1.

  

The RAID10 performs an array with RAID0 to RAID1 as a data protection array. With the same fault tolerance as RAID1, the overhead for fault-tolerant processing is essentially the same as for a single mirroring operation, with high I/O bandwidth due to the use of RAID0 as the execution level.

RAID10 requires at least 4 hard drives to be implemented.

Usage Scenarios

· RAID0 Usage Scenarios

RAID0 does not provide fault tolerance, but it has a high read performance. Therefore, RAID0 applications are used in situations where the read performance requirements are high, but the data stored is non-critical.

· RAID1 Usage Scenarios

RAID1 provides very good data protection, and once the working disk fails, the system automatically reads the data from the mirrored disk and supports "hot swap", so RAID1 is used for applications where data protection is of paramount importance.

· Usage scenarios for RAID5 and RAID6

RAID5 is a storage solution that combines storage performance, data security, and storage costs. RAID5 can provide data security for the system, but the level of protection is lower than RAID1 and disk space utilization is higher than RAID1.

RAID6 two independent parity systems using different algorithms, the reliability of the data is higher than the RAID5, even if the two disks fail at the same time will not affect the use of data.

Therefore, for the data to save the required level of requirements is not very high, we only need to use the regular RAID5. For now, for those organizations with high levels of data security, such as data centers, information centres,

It is still necessary to use RAID6 to protect data.

· RAID10 Usage Scenarios

RAID10 is ideal for applications such as database storage servers that require high performance, high fault tolerance, but little capacity requirements.

In the actual project, we need to consider the security requirements of the data storage, price requirements and other factors to choose a reasonable raid format.

The--raid of storage reliability Technology

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.