Multi-Backup is a cloud platform (SaaS) application provider that focuses on backup, recovery, migration, storage, and archiving of business data in the cloud. In order to achieve adequate data security, we have adopted the self-developed cloud 5 technology.
The source of the problem
In February 2014, a user hurried to call. Mention of his site every day a large number of orders to produce, he also backed up with multiple backup of his website to the network disk, but he is still more worried, in case of cloud disk problems or deactivation (Baidu Cloud has appeared this situation), is his data lost? He told the multi-backup worker that he had done before, This data he has manually backed up several places, but the management is cumbersome, easy to mistake, and once even directly deleted.
This user is not groundless, based on the complexity of the network environment, there will be a variety of situations, how to set up data fault tolerance, correction, correlation mechanism, for multi-backup is an important technical problem. If there is no good cloud solution, the user's important data back up, also can not give the user a good reliability protection commitment, backup also lost meaning.
The problem immediately got a quick response from the product technical team, in a short span of 3 days, found several sets of solutions, of which from the Thunderbolt joined a senior technical expert, proposed the use of RAID 5 technology to enhance the reliability of cloud storage, can greatly improve the cloud storage reliability, You can also increase the speed at which backup results are saved.
What is raid
Redundant array of independent hard disks (RAID, R edundant A rray of I ndependent D isks), referred to as hard disk array. The disk array is made up of many inexpensive disks, combined into a large disk group, which uses individual disks to provide data with the added effect to improve the performance of the entire disk system. Using this technique, the data is cut into many sections, which are stored on each hard drive. When any hard disk in the array fails, the data can still be read, and when the data is reconstructed, the data is computed and re-placed into the new hard disk.
RAID has several advantages over a single hard drive: Enhanced data integration, enhanced fault tolerance, increased throughput or capacity. In addition, the disk array looks like a separate hard disk or logical storage unit for the computer. It is common to have raid-0,raid-1,raid-5,raid-10. Assuming that a disk has a failure rate of 1%, the simplest RAID5 can reduce the data failure rate by more than 30 times times. That is, the failure rate will be less than 0.033%.
In these technologies, RAID-5 is a tradeoff between storage performance, data security, and storage costs. It uses disk Striping (hard disk partitioning) technology. RAID 5 requires at least three disks, RAID 5 does not back up the stored data, but rather stores the data and the corresponding parity (parity information) on each disk that makes up the RAID5, and the parity and the corresponding data are stored on separate disks. When a RAID5 disk data is damaged, you can use the rest of the data and the corresponding parity to recover the corrupted data. such as
650) this.width=650; "src=" Http://www.d1net.com/uploadfile/2014/0804/20140804092812276.png "width=" 518 "height=" 369 "/>
The birth of multi-backup cloud 5
Based on the high price of traditional backup, the technician has limited energy, while the cloud backup price is low, and the site cloud host natural blood complement. After examining the feasibility of the technology, our storage research and development expert group immediately started to transplant the idea of RAID 5 into the multi-backup intelligent agent mode, and realized the schematic diagram as follows:
650) this.width=650; "src=" Http://www.d1net.com/uploadfile/2014/0804/20140804092813815.png "width=" 528 "height=" 395 "/>
The data is compressed, divided, encrypted, and then written to cloud a in which the rest of cloud B or Cloud C is written to parity. Once any cloud facility data is destroyed by force majeure, we can call parity from another cloud facility to reconstruct the data. That is, only at the same time there are two or more clouds at the same time, it can lead to the data is not available, the probability of how small, I am afraid that the industrial level of 6 9 (99.9999%) is enough to describe.
After the RAID 5 model is applied on multiple cloud platforms, more low-cost, reliable storage models will be introduced.
This article is from the "Big Meatball" blog, please make sure to keep this source http://12478147.blog.51cto.com/9663367/1621939
Multi-Backup Cloud 5 technology: the perfect migration of traditional data backup ideas