Data deduplication for Windows Server 2012

Source: Internet
Author: User

In Windows 2012, you can enable data deduplication for non-system volumes. Deduplication optimizes volume storage by locating redundant data in the volume, and then ensuring that the data is saved in only one copy of the volume. This is accomplished by storing the data in a single location and providing this location reference for other redundant copies of the data. Since data is divided into 32-128kb chunks, deduplication optimizes not only the redundant files in the volume, but also the redundant portions of the file.

Data deduplication can be deployed with DFS replication to provide a more efficient storage and replication infrastructure.

How Data deduplication works:

Once data deduplication is enabled on a volume, Windows 2012 optimizes volume space by maintaining the following components:

1. Data Block storage. The optimized file data is placed in the database store.

2. optimized files. Optimized files are stored in the form of a parse point. A parse point contains a pointer to the location of the data stored in the block store, so the corresponding data block can be retrieved successfully when the access request is received.

3. The file is not optimized. These include any files that do not match the file date criteria in Data deduplication. To optimize with data deduplication, files must remain static for a certain amount of time. Files that are not optimized can contain system state files, encrypted files, files smaller than 32KB, files with extended attributes, or files that are used by other apps.



Data deduplication Lab Environment:


LON-DC1 win2012 R2 Domain control

LON-SVR1 win2012 R2 member server


First use the PowerShell command to create a 300M file in the Deduplication folder of the C drive File1.txt

(New-object system.io.fileStream e:\deduplication\file1.txt,create,readwrite). SetLength (300MB)

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M00/4D/63/wKiom1RQhrbiiFSxAAMxhf2R7VA379.jpg "title=" QQ picture 20141029141736.jpg "alt=" Wkiom1rqhrbiifsxaamxhf2r7va379.jpg "/>

Copy multiple copies of a file1.txt file

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M02/4D/63/wKiom1RQiC2iKGsfAAReZfwjxIE291.jpg "title=" QQ picture 20141029142349.jpg "alt=" Wkiom1rqic2ikgsfaarezfwjxie291.jpg "/>

Installing the Data deduplication feature

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M02/4D/52/wKiom1RPZp7isvR5AALptST27S0849.jpg "title=" QQ picture 20141028174742.jpg "alt=" Wkiom1rpzp7isvr5aalptst27s0849.jpg "/>

Use the Eanble-dedupvolume e:-usagetyep default command to enable the deduplication of the e volume, or go to the file and storage services-volume in Server Manager and right-click on the e-disk to configure Data deduplication, In the pop-up Settings window, select General purpose file server.

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M00/4D/63/wKioL1RQiqPwf_3HAAQJ3G9PdvY819.jpg "title=" QQ picture 20141029143255.jpg "alt=" Wkiol1rqiqpwf_3haaqj3g9pdvy819.jpg "/>

After data deduplication is enabled, you can delete the duplicate save and deduplication rate items on the Server Manager volume in the e-drive are not empty

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M01/4D/64/wKiom1RQkxLiWnn5AAO74anSx84246.jpg "title=" QQ picture 20141029151019.jpg "alt=" Wkiom1rqkxliwnn5aao74ansx84246.jpg "/>

In the data deduplication settings, we can set a recurring deletion schedule, or use PowerShell command New-dedupschedule to create a schedule

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M02/4D/64/wKioL1RQkCbingb7AAUGnlEZ2G4251.jpg "title=" QQ picture 20141029145625.jpg "alt=" Wkiol1rqkcbingb7aaugnlez2g4251.jpg "/>

After the successful creation, we can open the Windows scheduled task, check deduplication, we can see the plan of the Optimizee we just created

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M02/4D/64/wKiom1RQkZ7SOBmcAAXijpSJv_g364.jpg "title=" QQ picture 20141029150408.jpg "alt=" Wkiom1rqkz7sobmcaaxijpsjv_g364.jpg "/>

To see the effect of data deduplication immediately, we use Start-dedupjob-type optimization-volumn E: To start a duplicate delete task and then run Get-dedupjob to see the status of the task running

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M02/4D/64/wKioL1RQlM-iG8QWAASxTqgI24Q544.jpg "title=" QQ picture 20141029151618.jpg "alt=" Wkiol1rqlm-ig8qwaasxtqgi24q544.jpg "/>

When you see progress as 100% through Get-dedupjob, the repeat delete has been executed, we return to the Server Manager volume to see the data deduplication rate of e-drive and delete the duplicate saved value changes

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M01/4D/64/wKioL1RQlZmgRotFAAO8soe6yLY038.jpg "title=" QQ picture 20141029151940.jpg "alt=" Wkiol1rqlzmgrotfaao8soe6yly038.jpg "/>

Through the above image, the disk space does not change after the deduplication task is executed, because we use the default value for file time in data deduplication, the default is to repeat the file deletion of files 3 days ago, in this experiment because the files are all newly created, so we change this value to 0, Can successfully complete the deduplication task, execute the following command to modify the file time

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M00/4D/64/wKioL1RQlqmTHARIAAJsLHO6DOU718.jpg "title=" QQ picture 20141029152411.jpg "alt=" Wkiol1rqlqmthariaajslho6dou718.jpg "/>

Then we execute the start-dedupjob again, then use Get-dedupjob to view the task execution, and when progress becomes 100%, go back to the Server Manager's volume to see the status of E drive again. The deduplication rate and the deduplication value are no longer 0, the deduplication rate is 71%, and the data deduplication feature is very helpful for our storage space utilization.

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M00/4D/64/wKiom1RQl9OgN6CYAAO_HjSBakM480.jpg "title=" QQ picture 20141029153037.jpg "alt=" Wkiom1rql9ogn6cyaao_hjsbakm480.jpg "/>

We opened the E-drive Deduplication folder and found that the file was not deleted, so this feature is transparent to the user and it does not affect the user's data reading at all.

650) this.width=650; "src=" Http://s3.51cto.com/wyfs02/M02/4D/65/wKiom1RQmG2hdeAjAAPSbcdd8yk665.jpg "title=" QQ picture 20141029153309.jpg "alt=" Wkiom1rqmg2hdeajaapsbcdd8yk665.jpg "/>





This article is from the "Dry Sea Sponge" blog, please be sure to keep this source http://thefallenheaven.blog.51cto.com/450907/1569251

Data deduplication for Windows Server 2012

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.