Analysis of the impact of original and acquisition on search engine
Source: Internet
Author: User
KeywordsSearch engines cross-stitch through
Acquisition refers to the activity of picking and recording writing materials with definite direction and clear purpose. It mainly refers to the investigation of interviews and access and collection of information. The main function of acquisition is to obtain direct and indirect materials for writing, analyzing and reporting.
Network collector is used to batch collection of Web pages, forums and other content, directly saved to the database or published to the website of a tool, is a target page to pick some data to form a unified local database process.
This data is intended to exist as text in a visible Web page. This process requires more than just web crawler and Web wrapper. A complex data extraction process needs to address a variety of obstacles, such as session recognition, HTML forms, client-side Java scripts, and data consolidation issues such as inconsistencies with datasets and word sets, as well as missing and conflicting data. can automatically collect the original page according to the rules set by the user, obtain the content that the format webpage needs.
Demo Website:
Original station: http://www.58xiu.cn/cross-stitch City
Collecting Station: http://www.miaidao.cn/Island Forum
The data are from the Webmaster Station statistics.
First, the basic introduction
The time difference between station station is about 20 days.
Secret Love Island Station collection data more than 3,000 stickers, at present a total of more than 6,000 stickers. The station built one months after the PR value reached 3. Cross-stitch City by the member posts, reprint accounted for the total number of 15%. The station until 2009 1 Yuan 1st PR value to reach 3
. Second, through the search engine keyword routing analysis
Secret Love Island through Baidu, can bring the posts are members of the site to ask questions, published by the original stickers brought. Those collection stickers basically did not play a role. Every day through Baidu to measure multiplying IP.
Cross-stitch city, because this site many posts are drawings exchange stickers. Some members in the post, in addition to a text title, content is a direct picture or drawing compression package. Can take things Baidu search on those in addition to the text title, there are pictures described posts. Daily through Baidu to measure 1000IP.
Iii. Original and reproduced
Secret Love Island Although there are more than 6,000 stickers, but the current GG included 2170 pages, Baidu included more than 900 pages.
Cross-stitch City Although only more than 5,000 stickers, but the current GG included 2000 pages, Baidu included 3,700 pages.
Iv. Summary
Collection allows members to open a website, they feel the content of this station rich. Only give members a momentary visual impact. Do not recruit search engines like. GG and Baidu contrast, as if GG Good point, at least collect so much, GG very face in one months let my collection station PR value is 3. But this station through Baidu to the amount of too little, every day also has 100 a dozen IP.
Cross-stitch City Although the content is 1.1 points updated. And PR value is also very slow, but from the amount of Baidu is indeed good. This has to say, the original more by Baidu like.
It takes two days to find a sticker on the island of MIDI. And in the Cross Stitch City afternoon post, basic evening can be seen included. Obviously this is related to the update speed of the website.
So, the collection is made for members to see. We can go to collect, but must have original. Collection suitable for the new station. When making a new station looks a certain scale, or to be more original paste, with rapid updates, I think, do a good job of a website is not difficult.
Finished, by the way the friendship link. The PR value is at least 2, so the station is stable. My QQ402825587
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.