Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall
This article first quotes a few words:
1. "To understand the user's intention, the need to return users." ”
2. "Portals are all thinking about how to save money, not how to spend money to buy technology." ”
3. "Search engine is not everyone can do the field, the threshold of entry is relatively high." ”
4. "Just being excellent is not enough, the best way is to put one thing to the extreme." "(Google Ten Truths)
5. "Do search engines need to focus on" "for a line to the fourth business, the portal is difficult to focus." ”
6. "The user can't describe what he's looking for unless he sees what he's looking for," he said. ”
7. "The so-called wedge-shaped, in fact, is an inverted triangle, the cutting-edge part of the triangle to represent the search technology, the middle is based on the technology of product application platform, the top is the entire search engine users of the culture of the knowledge and understanding, as well as modern company competition is the most important and most "Wedge" implication is another meaning: the wedge to hit the wall, cutting-edge is very important, but the damage of the wedge is how strong, on the wall can squeeze out how much space, which end, the back end of the calm and thick is the key.
The technology and idea of search engine need time and experience accumulation
It is necessary to improve the long-term continuous improvement, absolutely do not think that can be achieved overnight, to achieve a relatively mature leading search engine from the beginning to the leading cycle of the general need is four years. Anxious not to. The reason is that the search engine is too complex and "the user can't describe what he's looking for unless he sees what he's looking for." "Everything needs to be groped, try, the problem needs a solution, the user needs a little bit of digging."
A search engine is a product that provides services to users
The need for long-term continuous improvement of the upgrade adjustment to continue to mention the user experience, needs to meet the growing and changing needs of users, need to adapt to the network changes. This is because the network environment is constantly changing, the needs of netizens are constantly changing. Do not take the search as a project to do, finished that let the user to use that you certainly have no. In the Search engine field is to talk about experience, the new engine if the user experience once the overall lead to more than one year gap and lasted 2 years, the early leadership of the advantages of the total disappeared, because the search engine's user transfer cost is relatively low and word-of-mouth is the best way to spread. If a search engine does not continue to innovate the concept of technological innovation, it is tantamount to death for this search engine. We generally describe the search engine as a lead in terms of time. For example: Search from Baidu overall gap X years, Baidu from Google's overall gap of x years, ... As long as you can in the user experience to maintain a year's leading edge of 2 years, do not need hype, all over the world. In front of the user experience, any hype appears very small.
As a vertical search engine, Spadger, but spite.
No matter the idea culture, product management, application, technology and the search engine's wedge-shaped theory is no different. So to do a vertical search must solve these aspects.
Wedge-shaped Tip: Vertical search technology.
Vertical search technology is mainly divided into two levels: template level and Web page library level.
The template level is the way to extract data for template setting or automatic template generation for a Web page. The collection of Web pages is also targeted collection, suitable for small scale, less information sources and stable demand, the advantage is the rapid implementation, low cost, flexibility, the disadvantage is the late maintenance costs, information sources and small amount. The Web page library level is in the information source quantity, the data capacity searches the capacity, the stability reliability on all is the web search engine level request, and the template way biggest difference is to the concrete webpage does not rely on, may for any normal webpage to enter the information collection information to extract .... This leads to a qualitative difference in data capacity and template mode, but with poor flexibility and high cost. Of course, the template and the Web page library level is not antagonistic, both for vertical search engine is complementary, because technology is only means, the purpose is to cut the needs of users. This article discusses the technology mainly refers to the Web page library level vertical search engine technology.
Search engine is indeed a relatively high technical requirements of the application, a few years ago the relevant talent is relatively small. Now there are more search technicians, and the relevant technology and technology applications are more mature than before, but the competition is more intense.
Vertical search generally requires the following technologies:
1. Information Acquisition Technology
2. Web Information Extraction Technology
3. Information processing technology, including: Repeat recognition, repeat recognition, clustering, comparison, analysis, corpus analysis, etc.
4. Semantic correlation analysis
5. Participle
6. Index
Information acquisition technology, vertical search engine spider and web spider compared to the more professional, 17813.html "> Customizable." The accessibility of the collection and the vertical search range related to the Web page ignores irrelevant pages and unnecessary pages, selection of content-related and suitable for further processing of the page depth first acquisition, the selection of the page to adjust the frequency of updates ..., the collection can be manually set the URL and Web page analysis of the URL.