Read the title of the article do not feel surprised, yes, yes. On the search engine crawl process many webmaster friends will encounter Baidu Crawl without WWW domain name situation. A few days ago I do a new station, Lin Dust Secondary school. Now has been found a problem, that is, Baidu is only included without the WWW page, see pictures.
Baidu Address: http://www.baidu.com/s?wd=%C1%D6%B3%BE%D6%D0%D1%A7&pn=10&tn=sitehao123
Again encountered this situation, using a number of simple ways to quickly solve the crawl problem within two days. This article only for new friends, master bypassed. The following specific description:
Problems in the domain name resolution process.
Many friends in the purchase of host, domain name, do not need to record the case directly to the corresponding host, parsing process we are generally two domain names are resolved, namely: www. domain name and. Domain name two the latter without the WWW domain name resolution can also be entered directly without the WWW domain name when the user successfully visited the site. This process is very simple, after parsing is not a problem, not, search engine crawl page is the domain name, which has a process of identification, that is, we identify these two resolved domain name for the preferred domain name, Usually we encounter the capture of the domain name without the WWW is also the search engine that does not take the WWW domain name for our main domain name. Of course, this happens at the same time, more than a few search engines appear at the same time, my web site is Baidu, Google at the same time crawl without WWW domain name, of course, part of the exception.
The above is the general reason for this situation, since the search engine has cognitive errors, then if we want to solve this problem, we need to help the search engine correct cognition, so that the search engine to take the WWW domain name is our main domain name, really want it to crawl.
First, do not take the WWW domain name all do 301 redirects
Novice friends here may be a little puzzled, may say I have never done 301 Redirect, also do not know what is 301, here we do not need to feel very kb, because understand what is 301 is not the key, simple setup problem can be solved. Here to my site for example: Google in the crawl home page, the internal pages continue to show without the WWW, indicating that Google has at this time not with the WWW domain name for the preferred domain name.
Baidu included: 61
Contrast found. Not with www. It's a whole lot more than a collection of www.
Workaround:
After landing the host Control Panel, the general backstage will have the redirect clicks the entrance, enters after chooses 301 redirects here requests that your host must support 301 redirects, but the general host supports this function. All the Web pages that have been crawled by the search engines are redirected to the WWW domain with a 301 redirect to the Internet, which can be solved within 35 days.
Now for a quick example
This can also be an increase in the Web page without WWW information, guide the Web page with www information.
Two, absolute path setting
Understand the web production of friends know the absolute path and relative path settings, we all understand that here is simple, relative path settings page open faster, without considering other issues, we generally recommend to do relative path settings, the advantages of the SEO is self-evident. But what we are saying is that in general, since there is a preference for domain name recognition errors, that is the exception. Modify all relative paths and reset to absolute paths, that is, take our http://www. Domain name. The reason is very simple, but also indirectly told the search engine, without the WWW domain name is actually with the WWW domain name under the page.
Third, page update, attract crawl, re-identify
Since the basic settings are resolved, the next thing is to let search spiders come over, to recognize the preferred domain, this method does not need me to say that we all understand, that is, the content of the site updated, as to update what the best article, that nature is original.
The above three methods basically solve the problem. Finally 60 wish you webmaster Friends 2011 Business is booming, a good luck.
This article from the Forest Dust Secondary school: http://www.lczx188.com welcome your reprint, please keep the link, thank you!