Looking at the ASP service of search engine from Chinese net

Source: Internet
Author: User
Keywords Search engines work

Intermediary transaction SEO diagnosis Taobao guest Cloud host technology Hall

Abstract: According to the Sixth China Internet Information Center Statistics report, search engine is the second largest Internet application after email. 55.91 of the Internet users in our country use search engines to provide search services. An excellent search engine is essential to attract users and increase the stickiness of the site. But search engine is a very high technology content of the Internet application, development and operation need to spend a lot of manpower and material resources, very expensive.

The
Chinese web site (www.chinaren.com) is a portal for the global Chinese generation. Chinese web site successfully entered the top 10 in the Sixth China Internet Impact Survey conducted by China Internet Information Center in July 2000. As a portal, its main goal is to build brands and attract customers.   And to attract users, the most important means is to provide users with the best service. Chinese web sites from the outset understand that the advantages of the portal is not to do search engine development, they have been looking for partners to search engine cooperation.


Baidu is a company specializing in search engines and related software development of the Internet companies, it is ASP way to provide technical services to the portal site. When Baidu Company launched in May this year Baidu search engine, the Chinese website decided to cooperate with Baidu. In the joint efforts of both sides, only in a short period of two weeks, the launch of China's first query-style search engine-"Monkey King search Engine", to provide users with a good Internet search service.


High Reliable service


Internet Search service requires uninterrupted service for 24 hours a day, 7 days a week. The Chinese website requires the system to provide search services within 99.9% of the time. Baidu Company as a search engine services to provide ASP, using a number of technical means to ensure that the Chinese web search service high stability and reliability.


1, high-availability architecture

Each part of the
system adopts the redundancy design of N. When an exception occurs in a work module, the redundant module can be started immediately. The system can automatically detect the exception of the module, and switch. Redundant modules are configured in two ways:


Standby redundancy (Standby redundancy) redundant modules are not used when the working module is normal, only when the work module is abnormal, once the original work module is restored to normal, the system will automatically cut back to the working module, and the redundant module is returned to standby state. Active redundancy (active redundancy) redundant modules and work modules work together to backup each other. Under normal circumstances, the redundant module and the work module share the system load, when a module is abnormal, the work that the module undertakes is assigned to other modules, when the abnormal module returns to normal, it will be automatically put into work.


High Availability architecture guarantees high reliability services for the system.


2, System anomaly alarm system


when the system module is abnormal, in addition to the redundant module immediately put into use, but also to the engineer alarm, so through manual intervention, as soon as possible to repair the abnormal system. To this end, Baidu company developed a system of abnormal alarm system.


system of abnormal alarm system is: The system monitoring program regularly detects the working status of the monitored objects, if detected any abnormal state, immediately to the system engineers and related personnel mobile phone send short message, or to the pager to send messages to the alarm.

The
system anomaly alarm system runs on a machine that is different from the object being monitored for remote monitoring. System exceptions can be detected within half a minute.


3, System abnormal intervention


system engineers in the system after receiving abnormal alarm, to intervene immediately, in the shortest possible time to restore the normal work of the system. Baidu to provide ASP services to the machine hosted in China Telecom. During normal working hours, Baidu uses the company's local area network to surf the Internet; Telephone dial-up Internet access can be through the ISP, but at the peak time the ISP's phone is very busy, may not be successful, for this reason, Baidu in the server's hosting application for a dedicated phone, specifically for dial-up Internet. In addition, if the company's LAN can not be connected to the Internet, this phone can also be used as an Internet backup.


Flexible Customer Strategy


is a subject that ASP faces. Portals have high quality Internet search service needs, but different customers have their own different characteristics needs. For the Chinese web site, because it is targeted at young people's website, so the pursuit of new, strange, cool. In response to this requirement, Baidu designed a set of flexible customer strategy.


Baidu to mention "content clustering" and "site clustering" and other rich features for users to choose. Users can control the display style and display content of the search results page. In the display content, in addition to search results from Baidu search engine returned by Baidu Control, all other content (including advertising, links to other pages, etc.) are controlled by the customer. The way and content of the search results can also be customized, such as the properties of the page (title, abstract, encoding type, date, length, URL), the color of the matching keyword, the link between the different pages can be determined according to customer preferences.


Baidu from the technical work closely with the realization of customer customization requirements. Baidu currently has three models to exchange search results with customers:


Service Application Protocol mode (Kyoto model) This mode employs the customer/waiter model. Customer's program and Baidu's search server for communication, communication protocol using Baidu definition of BSP (Baidu search Kyoto). This way completely by the Customer Control results page display mode, Baidu only returns the retrieved URL and its properties. The most flexible mode, the highest degree of customer control, the disadvantage is to achieve more complex, the customer needs a certain degree of programming capabilities.


Service Application Template mode (Template model) user defined in advance the results page display content and display, the results of the page entirely by the Baidu search engine construction. The user does nothing but work with the user's web interface. The advantage of this model is simple, easy to implement, the disadvantage is that the customer control system is low, the template changes the frequency and flexibility is very poor.


Service Application Protocol template mode (Kyoto Template model) This pattern is a combination of the above two modes. The customer uses the display template to define the result page part content (often is the retrieval result) the display way, the remaining part is decided by the customer, the entire result page construction by the customer completes. Search (if there is no results, there is no "related search" characteristics such as the results, the number of services are normal) to follow the Baidu and customer predefined agreement. This model is both easy to implement and provides users with good flexibility.

The
Chinese website uses the Service Application protocol template pattern. Chinese websites want to provide a Chinese search engine that can understand natural questions, so the site needs to know the retrieval of each query and handle it accordingly. If a search has results, the site will be Baidu search engine returned the URL and their own ads integrated together to display to the user. If the query has no results, the site will be cut word processing, with the word processing to construct a query string, again to Baidu search engine submission, get the results of the search.


Baidu search engine Technical characteristics


Baidu search engine has the advantages of fast response, accurate search results, strong timeliness, less invalid links, consistent with Chinese language characteristics and Chinese use habits.


1, using intelligent Chinese language processing technology


Baidu search engine uses unique Chinese language processing technology to skillfully solve the Chinese information Understanding problem, information index based on words and words, better solve the simple word based or simple based on the shortcomings of the word, combined the advantages of both, more in line with the Chinese user's search habits. Baidu search engine supports the mainstream Chinese coding standards (including GBK, GB2312, BIG5) and can be converted between different encodings.


2, using scalable search technology


Baidu Search uses intelligent web Spiders (Spider) to automatically search for information on the Internet, customizable, highly scalable scheduling algorithm allows the searcher to collect the largest amount of Internet information in a very short time, covering most Mandarin-speaking areas and parts of North America and Europe.


3, using intelligent correlation evaluation Algorithm


Baidu search engine based on content and link analysis based on the method of relevance evaluation, can objectively analyze the information contained in the Web page, so as to maximize the results of the search and the user query string has a high correlation.


4, using efficient search algorithm


uses advanced technology such as multithreading, efficient search algorithm and stable UNIX platform, so that the average response time of each retrieval is less than 1 seconds.
Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.