Use a station statistic data to talk about SEO and search engine

Source: Internet
Author: User

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

Do the internet has been 2 years, has been to do technology, in the network operation is a blank, really ashamed to say. A while ago decided to do a standing practice practiced, for the future from the technology to the operation of paved road.

Since never done operation and website promotion, the operation of friends do not know a lot of, so do not have the possibility of exchange links. And I just do the station, so a little bit of traffic exchange also really sorry others. After thinking and thinking, for me the fastest is also the most feasible way to operate the site is SEO, and use SEO to do site operations and site promotion and technology closer to a point, the start is also relatively fast. So read a lot of SEO information, whether it is SEO optimization site, or with the SEO to cheat traffic, can see all see. But the purpose of writing this article is not to tell you how to use SEO optimization is not to teach you how to cheat with SEO, after all, I have just learned, and SEO materials and materials have been quite a lot. I certainly did not write well. I just want to use my rice station some data to search engines and SEO to build some assumptions, and then speculate some conclusions, and finally we discuss, hope that we can make progress together.

OK, the nonsense does not say, first said I rice station basic situation. My station is a novel navigation website (many fan novel House http://www.duomimi.com/), the basic idea is that each big novel website's novel material collects, then classifies and puts on my website, By visiting my website, the user can search for all the novel Materials of several big novel websites and click to watch, and also can see the rank, recommendation, update and other information of each station. Well, not much to say, otherwise people should think I was in AD.

First is the website development, the interface imitates hao123, uses the most simple convenient asp+access, also because my 400mb virtual space space only supports ASP and access, the system altogether only has 4 pages, respectively is index.asp (homepage), list.asp (List page ), Search.asp (Search page) \bookreader.asp (Detail page), one day's time is fixed. The home page also made a template used to generate static pages (because the server space is limited, not all the pages are generated static pages, sorry Ah!) It is important to generate static pages for dynamic pages, and search engines prefer static pages, as indicated in all official search engine statements. Next is the data collection, first selected 5 novel sites, is the beginning of Chinese, novel reading, tea, Xiaoxiang, Sina Reading, and then wrote a program, automatic data collection and save to the SQL Server database, spent 2 days. Probably collected more than 190,000, and finally the data manually imported to access (there is also a small episode, because Access database is a single file Single-user, the function is very limited.) Cannot write the stored procedure, so the paging can only use the ASP's Recordset object, each match the condition the result all to put in the memory, then the paging, my 190,000 data puts each time in the memory, then then takes out 20, the speed and occupies the memory to be imagined. So here's a little trick to teach, is to put each page of data to add a field to indicate the number of pages he appears, so that each time you only need to find the data on this page can be, and later data increase or decrease, only need to use the Biga algorithm to change the page field problem solved. Another headache problem is that access does not have Full-text search. So the basic data search by like, I have done testing, the amount of data over 20,000 there may be memory overflow phenomenon, the solution to this problem is nothing else, only to build inverted index. This is my use access to large data processing encountered two problems and solutions, it is a discussion.

The website completes, the data collection completes, installed 51la free flow statistics, does a search engine tracker, starts to do the experiment. On line more than 20 days, did not do any promotion, only posted in post posts (very lucky to have a post was top up). Traffic, 60% is Baidu search engine, 16% repeat customers, 16% Bar (that is the top of the post), others are other search engines come. Basic statistical information and records are as follows:

(Here is a question to say, my domain name and space is last September before and after the application, but put a garbage system is no longer the tube.) So I am in long before this domain name has been BD and GG included, but only less than 10 pages of the collection, so I did not spend too much time to let search engines included me, but again let the search engine to retrieve my site just again

To show you two data, I recorded the BD and GG Search robot (BOT) every day to take my site number of times. (pictured)

ok! now start analysis, first of all, the design of the Web page, no frame, no need for Ajax, all the links are added title, there is no hidden and stack keywords, no color links, that is, did not carry out SEO cheating. (Note: The following assumptions and analysis only for my current views, not necessarily correct, I hope you can also analyze, point out that I am not the place)


Phenomenon 1
That's what I wrote in the title.--duomimi Novel Home---youth Campus | prose | fantasy novel | Fiction series | Supernatural Horror | short story |

Supernatural Reasoning | Fairy Tales | romance novels | Web fiction | history Martial arts martial arts novels. Search results show me in the "Youth Campus short story" This long tail

The keyword is the first.
That:
Page keywords mainly according to title tags in the key words, but the title of the keyword piling is not used, will only take the first keyword as your homepage of the main keywords and on the search engine index to establish relevance ranking.

Phenomenon 2
The same time BD included 1170,gg included 17. A lot of difference.
That:
Predecessors said is right, BD interested in the new station, and GG to the new station has the test period, the test time is certainly more than 20 days!

Phenomenon 3
BD robot daily search times vary greatly, and GG daily search is more stable. But BD search the number of pages and included the number of pages, and GG Search the number of pages and the difference is very far
That:
BD for the new station is not afraid, how many received. As long as it is climbing down the page, as long as there is no cheating, are included up first. and GG Crawl page will not immediately put up. Throw it or put it somewhere you don't know.

Phenomenon 4
Suppose the depth of the home page is 0, the inner link on the home page is 1, and the inner link on the page with depth 1 is 2, and so on, the more in-depth the page search
Suppose: Search engine is more interested in the depth of the page, may use this method to judge the updated data, when determining the hidden layer of the page data are not

Change to continue to climb down. So do the station must always update, and the newer things put the more dive better, do not hide.

Phenomenon 5
Search engine included in the page of my search page is the largest proportion (Bookreader page is I later added to the previous did not).
Suppose: Search engine does not like the list page, more like the detailed page, as the method of determination is estimated mainly through the number of links within the search page of the few links, basically are outside the link. So it took my search page as a detail page.

Phenomenon 6
I added a page bookreader page, the user clicked on the novel name no longer directly open the novel page, but into my Bookreader page, that is, I put the previous external connection into the link. As a result the next day, almost all search engines had fewer searches.
Let's say that search engines hate the change of links within pages. So try not to arbitrarily change the inner link of the page.

Phenomenon 7
Each time you search the list page, you will search the next page for a long time, and the Bookreader and search pages will be separated for a short time.
Suppose: Because there are many links in the list, and there are many links in Bookreader and search pages, there is a limit to the number of new links that may be included in BD every day. That is, every day to collect you so many links, and this number should be different from the station, I calculate, my station should be in 3000~4000 around

Phenomenon 8
Today, BD's search for me suddenly turned dozens of to more than 1000.
Suppose: now have not figured out what is going on, to see the changes in the future, it is certain that I did not make any changes to the station, just every 5 minutes to update the homepage. Is it the upgrade of my station??

So far to think of these 8 phenomena, I will continue to track and reply to the analysis. I hope we can discuss it together.


Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.