Search engine optimization for dynamic web pages

Source: Internet
Author: User
Dynamically generated webpage:

For those dynamic web pages, actual visitors can see them with the naked eye. But for most search enginesProgramBut it is often invisible, Which is why dynamic web pages are difficult to be searched by search engine spider. Therefore, to make your dynamic web page appear in search engine search results, you only need to make sure that your website content is visible to the search engine, that is, the search engine spider program can view the content of your website.

Dynamic Web Pages generally refer to pages dynamically generated using ASP, PHP, cold fusion, CGI, and other programs. Most of the content on the Web page comes from the database connected to the website. This page does not exist in the network space. It is generated and transmitted to the user's browser only after receiving the user's access request. In addition, because visitors can obtain the data they want in real time, dynamic web pages are often easy to impress. In addition, dynamic web pages are easy to maintain and update. For example, adjustments to a new product or price. The website administrator only needs to make some simple changes to the database, and does not need to modify each static page.

We can see that from the user's perspective, dynamic web pages are really great. However, from the search engine perspective, the conclusion is different.

problem with dynamic web pages:
the problem is that these dynamic web pages do not actually exist in the network space, they are generated only when you enter a value in the Variable Area. As we all know, the search engine spider is intelligent, and it is just a search program. They cannot select variables and enter numerical values as real people do.
You can also include the query in a link on the home page to make the link a pre-defined website directory query. You can type a query condition in a search table to query the data. A dynamic script usually requires some information to return the page content, such as cookie data, session ID, or an environment variable. But for search engine spider programs, it is impossible for them to know how to use your search function, or what to ask. That's why spider often has to stop searching websites before a dynamic website.
In addition, the URL of the dynamic page contains question marks (?) And percent sign (% ). There are also some symbols such as &, %, +, and $ that are often seen in the URL of a dynamic page. Such a URL is called "environment variable" (query string ). However, most spider cannot interpret the symbol "?".. For example, for such a URL: http: // www.americanbooks.com/cgi-bin/items.cgi? Name = naturaldiet most spider can only read the symbol "?" The previous character. "?" In the URL For them, it is a "stop flag", so that they can only retrieve the URL: http: // www.americanbooks.com/cgi-bin/items.cgi
obviously, because this URL does not actually exist, they get nothing. Therefore, if most web pages on your entire website or website are generated dynamically, You need to modify the website to make it easier for Spider to read the content on your website.
some search engines often reject static pages in the cgi-bin directory when searching pages (saved as dynamic pages of static pages) URL. For example, the following two URLs do not provide retrieval:
[url] URLs
[url] http://www.americanbooks.com/cgi/items.phpl#/url]

Why does the search engine not read the symbol "?"? Content:

The search engine spider is unwilling to read the webpage in the cgi-bin directory, or the URL contains the symbol "?" . The reason is that if the CGI provides an "infinite" Number of URLs, the spider will often be stuck in an endless loop due to the search of these "infinite" Web pages. This is the so-called spider traps ). The database program may also create a similar situation for the spider. To avoid the possible traps, spider has the symbol "?" "?" In the URL of No subsequent characters are read.

If a spider is trapped by your server, it will not only be bad for the spider itself, but also lead to repeated accesses to your website pages by the spider, which will completely paralyze your server system.

dynamic web page optimization scheme:
with [url] http://www.americanbooks.com/cgi-bin/items.cgi? Name = naturaldiet [/url]: from its url, we can see that it indicates that the website americanbooks.com provides content related to natural weight loss products. Unfortunately, because Spider does not support dynamic page generation, the relevant content on the dynamic page cannot be indexed by the search engine. Suppose there is a competitor, and he also has a webpage similar to the content of the natural weight loss product, but because his webpage is a static page, he can get the search engine index. You can imagine who the users who search for "natural weight loss products" will visit. What's more, even if the specific content provided by americanbooks.com is better than its competitors? Because it cannot appear in the search results of the search engine, it can only give the opportunity of online sales to his competitors.
the solution to this problem can follow the idea of changing the URL of a dynamic webpage so that the URL does not contain the symbol "?". "=" And other similar symbols look like a static URL. Different Solutions for dynamic pages developed by different programs:
• cgi/perl
if you are using CGI or Perl on your website, you can use a script to pick up all the characters before the environment variable, and then assign the remaining characters in the URL to a variable. In this way, you can use this variable in the URL.
path information (or script name) is a variable in a dynamic application. This variable contains the complete URL address and environment variable information. In this way, the script obtains the character before the environment variable from the path information variable and substitutes the remaining character into a variable.
however, the main search engine provides indexing support for webpages with built-in server-side include: server-side embedded content. Some web pages suffixed with .shtml are also parsed into SSI files, equivalent to common. phpl files. However, if these webpages use the cgi-bin path in their URLs, they may not be indexed by the search engine.

ASP

ASP (Active Server Pages: dynamic web page development technology on Web servers) is used in Microsoft-based network servers. Web pages developed using ASP are generally suffixed with. asp. ASP is favored by many friends because of its simple functions and practicality. With the powerful support of Microsoft, it can be said that it is one of the most popular Web development languages nowadays. To avoid using the symbol "?" in the URL, Most search engines Support dynamic web pages developed using ASP.

"?" Is used What should I do? To solve this problem, a company named "exception digital enterprise solutions" launched their solution, xqasp. This is an update software. It allows the symbol "?" in the URL Convert the network server to "/". For more information, see [url] http://www.xde.net/products/product_xqasp.php#/url]. There are also some relatively inexpensive solutions, such as aspspiderbait and portalpagefilter.

Cold fusion

If you are using cold fusion, You need to reconfigure it on your server. Web pages developed with cold fusion generally use. cfm as the file extension. Generally, the database uses the symbol "?". To retrieve the webpage. You can reconfigure cold fusion settings so that it can replace the symbol "?" in an environment variable. Replace it with the symbol "/" and pass the replaced value to the URL. In this way, the browser end is a static URL page. That is, for urlhttp: // www.americanbooks.com/items.cfm? Item_id = 11667. The final returned browser will be [url] response "?", Therefore, you can continue to index the entire dynamic page so that your dynamic page is still readable to the search engine.

Apache server

Apache is one of the most popular HTTP server software. It has a rewrite module called mod_rewrite, which enables you to convert URLs containing environment variables to URL types supported by search engines. However, mod_rewrite is not the default module in Apache Software Installation. Therefore, you should negotiate with your host hosting company to check whether this module exists on your server.

Or use the above URL as an example: After the mod_rewrite module is used, the URL [url] http://www.americanbooks.com/items.php? Cat = natural_diet [/url] becomes the following format: [url] http://www.americanbooks.com/natural_diet/index.php#/url]

You can directly submit a Dynamic Webpage Through the paid indexing program:

Submitting a dynamic website directly to AltaVista can increase your website's chances of indexing it. If you submit it through the paid indexing program of AltaVista or Inktomi, the website will be recorded.

File Extension:

What is the suffix of your file name. This is no longer a problem for search engines. Even if your webpage does not use the. phpl or. php extension, as long as the URL does not contain the symbol "?", It is still possible to obtain an index. In contrast, the search engine Northern ight is particularly flexible and can index any. phpl ,. PHP ,. shtml ,. STM ,. ASP ,. phtml ,. CFM ,. php3 ,. PHP ,. JSP ,. JHTML ,. files with the ASC, .text, and .txt extensions. However, it still does not support. cgi files. If your website has an unsupported extension file, it will notify you during the website submission process.

conclusion:
ask yourself, do you really need to use dynamic web pages? A database is usually used as a page generation tool. You can use it to create static pages, especially for areas that are not frequently updated on the website. You can use a database to generate static page content. To ensure the search engine index, you can also consider creating static Web pages for your dynamic content.
most search engines are still not used to indexing dynamic web pages. This is because the search for a dynamically generated web page may cause the search engine spider to fall into an endless "trap" and cannot get out of the way. However, the situation is gradually improving. Google is the first search engine that supports dynamic web pages. At the end of two thousand, it was able to index dynamic pages (including "?" in URLs). In addition, Hotbot and other search engines are also beginning to support dynamic pages.
although these search engines Support dynamic web pages, to ensure that their spider programs are immune from endless loops, the search engine will not conduct in-depth access to links from dynamically generated pages. So I suggest you spend a little more time on your dynamic web pages and make sure that each of them can be indexed by the search engine. In fact, in the end, you will find that the time you spend is definitely a great investment-and it can also bring more traffic to your website.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.