In the image reading era, the construction methods of professional image search engines are discussed.

Source: Internet
Author: User

The title of "Search Engine" may be too big, but in this age of information explosion, professional search engines are increasingly necessary. I have made pictures for more than 10 years. From the time when I first purchased material CDs to the time when I used search engines to search for image materials, they both have their own advantages and disadvantages, the purchased materials and discs are organized in a very standardized and orderly manner. The disadvantage is that the materials capacity is limited and you still need to pay for them. However, most of the materials purchased by the search engine are free of charge, however, in today's information age, the time cost for obtaining authentic and effective image materials is getting higher and higher under the interference of massive junk information. Because the current image search mainly relies on the relevant descriptions of the image, it is often difficult for us to find the image information that we use text keywords to search, turning dozens of pages, you still cannot find your desired image.

Baidu's image search image.baidu.com faces a wide range of users, and the complexity of the problem is too high, making it impossible for him to give a professional reply. For example, if a user searches for a cow, baidu will surely pick out millions of photos of all kinds of cattle for users. But in the face of professional designers, all they need is a piece of material with a single background. Therefore, Baidu serves all mankind, and it is impossible to provide accurate search results for designers, because he does not know that he is sitting in front of the computer as a designer looking for materials, therefore, our designers need a professional image search engine.

How to construct a professional image search engine? As I said at the beginning, this title is a little too big. Maybe it cannot be solved by my personal abilities. Here I can only talk about it in general, talk about some ideas, and share it with you, I hope to resonate with many of my friends who want to solve this problem. Since it is a search engine, we must also use the search engine mechanism to construct our system. We need our own web page collection Spider Program, but our collection scope can be reduced, we don't need to accept images like Baidu image search. We only need high-quality image materials, so our target database can be locked on design websites that provide image materials, regularly monitor and collect the latest image materials and index the pages in our database. Since the reprinting and plagiarism of various major websites are very common, there is no doubt that many duplicate information is collected for the first time, and such duplication is not in the traditional sense, however, the content of the image subject is the same, but the description of the image is different. Therefore, the computer cannot identify the duplicate content immediately with the naked eye, this is a very complex category of image recognition, and there is no effective solution at present. However, we may be able to solve this problem in the background using the semi-automated approach of Human + computer. After careful preliminary deduplication, our stored information will be more concise and effective, and the search content will be faster. Another important problem for image search engines is that users may not know which keywords are used as the fastest retrieval method. Image search engines must be able to guide users in efficient and quick search, the keyword structure in the background is also a very profound learning. All kinds of such problems can be encountered only by hands-on.

The old man said: the journey of a thousand miles begins with a single step. I personally started a project: www.23pic.com, which is used to construct and study this professional image search engine. I want to make it a similar open-source website, I have studied with a large number of search engine enthusiasts. If you are interested, please join me in QQ, and. Haina baichuan, has a high capacity, I hope to be the Don Quixote, use my sword to defeat the monster like a windmill search engine.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.