You often need to query materials and save your favorite materials. The blog Park is a good place. The quality of the materials is relatively high. It is better to focus on a series of materials from here. I often write blogs on my own to collect usefulTechnical materials. In the past year, I designed a data solution. Later, I ran away from it for many reasons and did not go further. For more information, see http://www.cnblogs.com/jamesli2015/archive/2011/11.html.
Crawler is an important component in data solution. Capture from a blogArticleLocal files can be saved as Doc, PDF, XPS, Epub, and other formats. Recently, the group (QQ group: 1637 21037) needs to back up the Blog content. I have this component, so I will sort it out and give it to you for download and use.
WholeProgramThere is only one interface and no third-party runtime libraries are required. The. Net 4 compiling platform is used during compilation. If it cannot be run, download the. NET 4 Runtime Library first.
There are two ways to download blog articles. One method is primarily the blogger ID. I put it in the user ID panel. As shown in
The input data format is:
Http://www.cnblogs.com/dudu/
Http://www.cnblogs.com/JamesLi2015/
Click Start to start the download.
The second method is to download HTML articles from a pile of HTML texts. For example, if I prefer a series of articles, you can go in and find related links, copy to here, click analysis to analyze the number of connections, select as needed, and click Start to download.
To my preferences, I put this page in the http://www.cnblogs.com/AllBloggers.aspx
Copy the top 300 text to the text panel, click the analysi button, and then click the Select All button to download.
The content on the configuration page is to save the format configuration.
The doc format is selected by default and saved in Word 2003 format. Remove Temp File: After the download is complete, delete the intermediate file.
This is all content. This tool can be used to download your favorite topics or blog articles.
Let's take a look at the effect after the download is completed:
1. It is required to save the file to the doc format. You can edit, modify, and cut the file. Further, you can save your favorite clips to your knowledge base.
In my own way, I like Evernote, 2.2 green version, less than 8 m size, and put it together with database files.
2. pdf and XPS are read-only. If you do not want them, remove them from the configuration panel.
The 3 Epub format is easy for mobile phones to read. Although the mobile phone can be viewed in the office format, my experience is not ideal, the screen is small, and the pages are frequently moved up and down. I have not tested a mobile phone in Epub format. If the Epub format is incorrect, report the problem to me.
4. Currently, only the blog community is supported. If other popular blogs have problems, they will be integrated after a complete test. It is better to make the system stable and have fewer functions than to see frequent crashes or inexplicable problems.
Program: Document Exporter
If you have good suggestions or problem reports, add a group or email to me. Thank you for your support.
Version 1.1 has been updated. Please download the new file again.
1. attachments can be downloaded. If an attachment exists in the document, the attachment will also be downloaded to the same directory as the document. Currently, the supported formats are zip and RAR.
2. You can select a directory in the directory text box.
3. Download cancellation is supported.