Crystallization of technology and humanities-a discussion on Search Engine Technology

Source: Internet
Author: User

Crystallization of technology and Humanities

-- Search Engine Technology

Recreation

In the face of the vast ocean of information, people are often at a loss. The emergence of the Internet search engine seems like a boat, carrying us freely traveling in the ocean. Search Engines quickly become a powerful tool for us to master knowledge.

As an essential tool for the Internet, search engines are becoming increasingly popular. In addition, as Internet applications continue to deepen, search engines are becoming an important network infrastructure.

The search engine infrastructure has three functions: first, it is essential. Without a search engine, half of the web pages in the world will have no value for use. Second, wide coverage. Search engine technology involves system technology, network technology, multimedia technology, language processing technology, artificial intelligence technology, etc. Third, more and more "attention ". With the increasing number of professional search engine manufacturers and the emergence of new search engine technologies, there are also a variety of forms of search engine performance, and even cannot realize its existence when it is used.

Technical nature

Technology comes from demand. The diversity of needs leads to the diversification of technology implementation, and the diversity makes our world a harmonious beauty.

The earliest search engine broke the gap between directories and listed the results. Subsequently, technologies such as result relevance sorting, logical query, and result query improve the search accuracy;ArticleRanking of importance and user behavior analysis are more in line with user psychology. Today, natural language understanding, intelligent query, vertical search, and other technologies make searching easier, more valuable, and more attractive to users.

The difference in requirements leads to the difference in technical applications, while differentiation is the foundation of new products. The demand segmentation and different technical means form a situation where search engine products are blooming.

Traditional and Modern

Even if there is no Internet, search engines exist and play a role, such as applications in traditional fields such as intelligence retrieval, book retrieval, and news publishing, the search range is also evolving from simple text to large-capacity databases, and the search technology is also evolving from keyword searching to full-text retrieval.

The rapid development of the Internet has changed everything, and the new network search engine has a qualitative leap over the traditional search engine. In terms of data volumes, traditional search engines face slow growth and limited data (tens of thousands or hundreds of thousands of records are the most common ), however, network search engines face fast-growing and almost unlimited data. Google can search 2 billion pages. Changes in the volume bring about qualitative changes.

Used by traditional search enginesAlgorithmIt becomes very clumsy in the face of massive data; the data structure used by traditional search engine technologies cannot be expressed in the face of massive data; traditional search engines are mainly used in standalone structures, the network search engine works in a distributed environment. Therefore, the modern network search engine technology is already fundamentally different from the traditional search engine technology in terms of algorithms, computing environments, and theoretical models. The application of various comprehensive technologies and Human Care have brought network search engine technology to a new height.

Even if there is a leap, even if there are differences, the modern search engine and the traditional search engine share the same goal, that is, the query is complete and accurate, however, the new era environment gives more meaning to the new technology. From the perspective of structure, traditional search engines mainly have two parts: Index and query, while modern search engines mainly have four parts: Collection (The Role of robot or spider), index, query and result processing. In terms of core technologies, modern search engines are also inseparable from traditional indexing and Word Segmentation technologies. The development of traditional search engine technologies will soon be applied to modern search engine technologies. The development of modern search engine technologies has greatly promoted the development of traditional technologies. A new technology is integrated into the search engine technology, and a new search engine will be born.

With the development of the times, traditional technologies will suddenly play a new role in the new environment and become a new technology, just as the clothes style of decades ago will become fashionable tomorrow.

The original Internet Directory classification is simply not "technical" because they are too "Manual. However, after several rounds of reincarnation, many people will have more requirements for directory classification and have higher requirements for "Manual" because although it is artificial, manual knowledge is more valuable, knowledge workers will be created in the knowledge economy era ".

Integrated Technology

The times are evolving and new demands are constantly emerging, prompting the continuous emergence and integration of technologies.

Modern search engine technology requires the use of information retrieval, database, data mining, system technology, multimedia, artificial intelligence, computer networks, distributed processing, digital libraries, natural language processing, and many other fields of theory and technology, become a comprehensive technology.

From the perspective of the collection process, hyperchain analysis is a core technology. In the face of an infinitely broad Internet, how to obtain the required links and index links requires a lot of consideration, the "value" analysis behind the link is even more intelligent. This analysis is a technology used to mine massive data. Compared with a wide range of static Web pages, dynamic web pages contain more valuable information, but there are a wide variety of dynamic web page technologies (such as ASP, JSP, CGI, etc.) that are constantly evolving) coupled with a complex network environment, the collection process becomes heavy and abnormal.

From the index process, network search engines not only use traditional search engine technology, but also use database technology, web cache technology, multimedia technology, distributed storage and computing technology, in addition to indexing web pages, you also need to index various media, including text, animation, audio, video, and other special files (such as PDF and XML ).

Query is technically the inverse process of the index, and the index is used for query. However, user input, proxy, word segmentation, and natural language processing are also used for queries. The application of these technologies reflects the value of indexes and makes search engines simpler and more useful to users.

Presenting the best query results to users is the final goal of the search engine. In general, Results sorting is a relevance sorting technology. It also uses technologies such as removing duplicate webpages and user behavior analysis, and may also use cache technology to provide users with expired webpages.

The above is explained from the four components of the network search engine. In fact, in order to ensure the stable operation of search engines, such as system technology and distributed technology, are supporting its operation, such as cluster technology, network cache technology, and distribution technology. More importantly, in order to reflect human care, the network search engine must use intelligent and personalized technologies in the human-machine interface.

Technologies in other fields will inevitably drive the development of search engine technology. New standards and applications also promote the development of modern search engines. For example, with the emergence and widespread use of XML, search engines will certainly provide full support. The development of P2P and grid computing will also enable more applications for search engines.

Customer first

Various technologies emerge one after another, and the development of technologies will never end, but there will never be pure technologies. The over-commercialization of the technology once deviated from its nature. When enterprises shout "customer-centric", technology also comes back to its essence.

What is the best search engine technology?

User satisfaction is the first level. The direct purpose of a user's use of the search engine is to find the information they need. As long as the search engine is "Comprehensive" and "accurate", the user will be satisfied. If you optimize the results to make them more effective for the user, the user will be highly loyal to the search engine. For users, the technology doesn't matter. To achieve the goal, the technology will continue to improve from low to high, and the customer will be satisfied if the demand is continuously met.

User happiness is the second level. The technology comes from the needs and meets the needs of users. If the technology can discover the needs behind users' needs or unexpected needs and implement them, users can enjoy the joy of technology. When a search engine not only gives a user a search result, but also gives him the "authoritative" result that he is most interested in, he is happy. In the tide of knowledge economy, every search can satisfy his desire for "Learning" knowledge.

However, technology itself cannot implement itself. Technology cannot be used without capital. In order to collect more web pages and provide faster speeds, the search engine requires nearly servers, and the funds temporarily limit the use of technology. Without a market, good technologies will be abandoned. At this time, proper commercialization will promote the development of technology. For example, commercial technologies such as advertisement and bidding ranking in search engines enrich search engines and meet the needs of some users. But the naked commercialization will also let users go away. Therefore, in the process of technical implementation, the user-oriented strategy is the best technical strategy.

Development and Future

Internet makes technology ever-changing. Search engines in the Knowledge Economy infrastructure position will surely receive more attention and development, and the search engine technology is full of opportunities and challenges.

"User-centric" is a constant purpose. To meet user needs, user segmentation is the key. Industry Users, Enterprise Users, and individual users have different needs. Industry users need search engines to connect information islands to achieve professional information sharing. Enterprise Users have higher requirements on knowledge management when they become "Learning Enterprises", and the role of search engines will be very prominent. Although individual needs vary, individual users need a key to open the door when facing a huge Internet, and search engines are a golden key, it can satisfy people's desire to learn. Therefore, "Knowledge" becomes the key to search engine technology.

In terms of the development direction of search engines, one is the pursuit of quality, and the other is the mode to win. People's pursuit of quality is always endless. New search engine technologies will be "faster"-fast updates and speeds; "bigger"-more data capacity; "Stronger"-intelligence and satisfying results. New things always have unparalleled advantages. The key to winning the model lies in exploring demand and subdivided needs to meet deep-seated needs of people. For example, specialized search for various multimedia and vertical search for various specialties will have a broad market.

The application and integration of more new technologies, such as wireless networks and P2P, will bring new impetus to the search engine technology. The search engine technology will have a bright future.

(Computer World News 25th B10 and B11)

 

To: http://www2.ccw.com.cn/02/0225/ B /0225b03_1.asp

 

 

 

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.