Search engine How to determine your original content

Source: Internet
Author: User
Keywords Search

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

A lot of people in the group chat about things, such as how the original was determined by the search engine? Walnuts today share their views on the search engine to determine the original content.

First, we define two concepts: Original and pseudo original

Original: Simple understanding is the first time on the network published content.

Pseudo Original: Is the original for the second or the first n modified reprint published. For example, revise title, add summary, reprint incomplete content and so on.

How does a search engine make an original judgment?

Generally speaking, there are several factors to decide:

1. Snapshot date.

2, spider crawl date.

3, the number of the chain outside the page.

4, the degree of modification of the article.

For example: If an article titled "How the Search engine determines your original content" is published on a blog or website for the first time today at 10. What's going to happen?

Search engine spider came to this blog or website, found this page, analysis content, put into the database, and was identified as the first discovery, this must be original!

There are several details in the process of inclusion and judgment:

1. Necessary conditions

If this site is not included, this article will be considered original?

Of course not! Because it's not going to happen in the search database!

--How to make it original content?

--the first condition, the site must have been indexed by search engines.

If this site is included, but not often updated?

-Very simple, if not often updated, published articles to be included in the time will be considered original.

3. Reprint and Collection

What if the article is reproduced?

If the article is reproduced, then see reprint this article station update cycle and the first release station update cycle which faster.

--not quite sure about the update cycle.

For example, in a station published, B station reproduced, if the spider first visited a station, found the article, and then came to B station found the article, it is obvious that the original weight to a station.

-Does the collection match this?

-Yes, the same thing with the collection. If b collects a, but B is included earlier than a, B may become original!

4. Access time

What if the spider first visited station B?

Of course, the weight to B station, the general situation will be like this!

If B station reproduced the article with a station of the original article page link it?

-This is very clear, just included when, if the ranking, two results together, it may or B station ranked better.

Of course, the article reprinted more times, a station links more, to a station of the article more good, ranking will slowly become a stand in front.

--What if another reprinted article with a link to the B-station page?

This is funny, a joke to search engines, but if they don't, they become a link 17891.html "> Popularity game."

However, if there are many external links, and the difference is not small, then the rules of judgment should return to the original point, who is included first who is original.

5. Snapshot Date

--The snapshot date shows the earliest time, is generally original!

-not necessarily, this is in an update cycle, for example, within a week after the publication of the article, the sooner the snapshot time address will be recognized as original.

But if the article has been published for several months, perhaps the search engine has retrieved the snapshot, the date of the snapshot has changed!

-Is there any other possibility?

-there, generally, such as Baidu included, he may have a collection of databases, filtered after the content will be included in the search results. During this period there are some problems, such as a station for the first time, B station reproduced. Spiders first visit a station and then visit B station. Then may put the results of B station, and a station is still in the database.

So the search engine does not include the search engine spiders do not have access to these content, perhaps in the search engine inventory has been recorded, but you check the time did not put out just, like number 25th just put out the content, but the snapshot is 20th, this is the search engine inventory content, This is also the core point of time to test the original.

This situation generally appears between the new station and the old station, a station published, B station reproduced, but a station in the search engine trust is not high. But as long as a station is first visited, original right or a station, this is the most difficult to separate out the situation, because we do not know which station the spider first visit, unless you know two stations of the site space log content, you can see the search engine on two pages of access time.

6, False original

-False original will also be considered original?

Most of the time, the search engine spider intelligence is the same as a three-year-old child, can not clearly distinguish these things, because its thinking is too stylized. If your title changed, the passage of the article changed, then the spider will be very difficult to determine whether the article has been included, perhaps it can be certain that some of the content is repeated, but it also can not because of these and this article is really considered reproduced! Of course, with the improvement of the search engine program design, there should be a similarity of things out, For example, the similarity of text content over a few percent will be considered reproduced.

This analysis, I believe you should understand it. Only walnut their own views, I hope that we absorb what they want, do not agree to also come to mention their own views!

A few other suggestions:

1, if your station is the new station, the weight is not high, how to let the spider homepage to find your page and put into the database? In fact, very simple: with Nets, Baidu collection of these tools to make spiders faster to find your page!

2, we have a suggestion, that is, add their own copyright and content page address, others collect the time you are cool, although not fast, but the last link more, you are still original content.

3, published articles to wait until their own collection and then go to other sites to publish, at the same time add their own original address, this approach is very secure! The chances of being picked up at major stations are very high!

Blog: http://www.abseo.cn/

Original article, reprint please everybody mercy!

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.