Characteristic Analysis of six common free data collectors in China

Source: Internet
Author: User

Currently, there are several popular free collectors on the Internet: locomotive, haina, ET, threesome, octopus, and madman. Here, the free version is relative. If a person performs regular collection, the free version is generally enough. For enterprise users, fees are generally charged. After all, the collectors also need to eat!
Okay. Let's take a look at the features of these free collectors!

1. Locomotive collector
The locomotive should be one of the typical examples of successful collection of software in China, and the number of users including the number of paid users should be the largest.
Advantages: The function is complete and the collection speed is relatively fast. It is mainly for CMS and can collect a lot in a short time. The filtering and replacement are good and detailed. The interface is relatively complete. Supported extensions are relatively easy to use, if you understand the code, you can use PHP or C # To develop extensions for any function. The attachment collection function is complete.
Disadvantages: Writing collection rules is not small and difficult for many users, especially those who do not understand the code. Memory and CPU resources are occupied during running, and resource recycling is not well controlled. In addition, it is sometimes inconvenient to authorize binding to a computer.
2. Haina
Advantage: you can crawl many keyword articles on the website. It seems suitable for special topics on the website, especially articles and blogs.
Disadvantage: The classification function is not complete, and manual classification is easy to confuse. For specific interfaces, the collected content is limited. Only one item can be collected at a time, and batch collection is not allowed. The interface must be connected to the website background webpage. During installation, the personnel from Haina must provide on-site technical support, which is troublesome. CHARGES, free features are too limited, just like chicken ribs.
3. Et collector
Advantages: unattended and automatically updated, the user base is mainly concentrated in the long-term station diving webmaster. The software is clear, the necessary functions are complete, and the software is free of charge.
Disadvantages: general support for forums and CMS. There are few help files and it is not easy to get started.
4. Three-person pedestrian collector
Advantages: Migration and mobile targeting major forums are fast and accurate. It is also suitable for forums.
Disadvantages: Super complicated, difficult to get started, and poor support for CMS.
5. madman collector
Feature: You can have a large number of Members in your new forum at the beginning.
Advantage: Suitable for collecting discuz forums.
Disadvantages: too specific and poor compatibility.
6. Octopus collector

Advantage: it has complete functions and is easy to operate without writing rules. A collection task can be run on an ECS instance when it is shut down.

Disadvantage: the qualification for new products is relatively young.


Conclusion: For simplicity and complete functions, you can select the octopus collector. If you are a technical person who is familiar with writing rules and is pursuing incomplete functions, you can choose the locomotive collector. Both the octopus collector and the locomotive collector can quickly collect a lot of resources and can be applied to multiple aspects. Here we only talk about the six major free collectors. In fact, there are many other collectors which will not be described in detail.

 

Characteristic Analysis of six common free data collectors in China

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.