"HillTop" theory-exploring new Google ranking algorithms

Source: Internet
Author: User
Whether the Hilltop algorithm runs in real time
Google's server architecture is the 10 thousand Pentium-level servers distributed on the network. Once we understand the Hilltop algorithm, it is hard to believe that such a Pentium server can handle this problem: imagine that we should first find the "expert file" from thousands of theme files ", then calculate the score of the link of the target webpage from these expert files, and then return the value to other ranking systems of the Google algorithm, further processing-and all of this will take about 0.07 seconds-this will allow Google's world-renowned search speed to complete. It is incredible.
Operating frequency and coverage of Hilltop algorithms
We believe that to ensure Google's consistent "lightning-like" search speed, Google will query frequently-used (popular) words (the so-called "commercial word" blacklist) run batch processing regularly and store the results for future use. Google's database has a large number of frequently-queried query words, collecting keywords used in field searches and its AdWords self-help advertising system. Google may set an upper limit on the number of keyword searches. All query words whose search frequency exceeds this threshold will be included in the Hilltop system, the Hilltop system then periodically runs batch processing for all frequently collected keywords, which may be once a month. Small-scale batch processing at the incremental level may be more frequent.
At the same time, the database of Google's ten thousand servers will be updated synchronously after the Hilltop system runs the batch processing results every month, but the database updates for small-scale batch processing will be more frequent.
Google will still use the original algorithm and display the original ranking results for words that are not frequently queried by those users and thus are not honored to be included in the Hilltop algorithm. Therefore, keywords that are highly explicit or specialized are expected to maintain their original ranking because they are excluded from the scope of the new algorithm.

Why has the Hilltop algorithm been put into use for so long?
Google obtained this patent as early as February 2003, but before it is put into use, it must first ensure the full compatibility of the new computing method and the page level and page relevance system used by Google at that time, therefore, we need to conduct a lot of tests on its compatibility, then evaluate the results provided after the algorithm integration, and then perform Seiko adjustments, and then perform further complex tests... I think it takes a lot of time.
Disadvantages/defects of Google's new algorithms
After further analysis, we found that the algorithm has several defects and deficiencies:
The premise of Hilltop is that every expert document is completely fair and has no deception or manual control components. However, the situation may not be so ideal. A small flaw in expert documents can have a huge negative impact on rankings.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.