yioop! is a PHP search engine. Yioop! can be configured as a generic entire http://www.aliyun.com/zixun/aggregation/10412.html "> Web search engine, it can also configure search results for URLs or domain names." It supports a variety of indexed file formats, such as HTML, PDF, DOC, PPT, RTF, RSS, XML, SVG, PNG, JPG, BMP, GIF, and Sitemaps. yioop! Crawlers can be deployed on one or more machines of low-end and wired internet hardware. Yioop! can crawl the storage Web Archive format and move easily. Crawling can be done on a machine and other places as a result of deployment. Yioop! supports a hybrid crawl, equipped with the required GUI and can be localized to a search front-end. This GUI supports RTL languages. Managed crawlers can also use this GUI, configurable using memcache if there is a simple way.
yioop! 0.80 This version supports starting, stopping, and viewing log file queue servers from the Web interface fetchers. It is now possible to inject a new URL through the active crawl of a web interface. This yioop version! Supports a fixed number of days after the page crawls again. In addition, the file name extension being crawled, the number of bytes downloaded per page, and how to yioop! The weight of different page components is now available through a web interface rather than just config.php file control. Improvements have also been made to how the HTML processor extracts text for indexing.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.