Compiling the whole process of collection module of Knight Station Group collection system

Source: Internet
Author: User

Intermediary transaction http://www.aliyun.com/zixun/aggregation/6858.html ">seo diagnose Taobao guest cloud host technology Hall

First, we introduce the process of the Knight Station group system. First, I want to write a collection of game Introduction module released to my site, my site is a dream to do. Second of course, is ready to work, this time the main is to say acquisition module, release module has the opportunity, or content too much publishing module official have a lot of, all kinds of CMS release module Knight all prepare for us very full, function is also perfect. The third best to learn the next knight for our learning video. Official Learning Web site: Http://www.xiake5.com/demo, the Rules test tool download address can be Knight official forum download.

I use the release module id=1173. You can get online. OK, I'm going to start right now.

We can make new modules, crawl and release. Click. This is the interface for building the module. Of course, first modify the module information, do not hand lazy oh, in favor of their own management. Choose the crawl mode you need, four kinds of collection, free choice. module parameters, customization and keyword grabbing have three processes, spiders and synchronous tracking modes have two processes.

Let's explain something else: 1 Knight. You can save your own module to local, while supporting import export, recommended for local save. 2 custom Crawl mode, as the name suggests, of course, you can freely collect the content you need, recommended learning under the regular. Keywords crawl, according to the definition of a good keyword library to crawl, you can get related content topics. Spiders crawl, imitate spiders, give the entrance address, you can in the whole station without hindrance crawl. Synchronous tracking, tracking the target station in time, according to the target station for timely crawl. Corpus automatic reorganization, automatic original high-quality articles. This section is for Third-party Web site publishing content.

Process 1 part. Choose your own crawl code, fill out your own crawl site, that is, the target station. Note that the coding format of each place to be unified oh.

The first step: fill in the test URL for testing the rules. The second step: there are two ways to extract, the first, for visualization, will not be regular friends can try, we use the second kind. Step Three: Select the installation rule extraction. Fourth step: To add a rule to the panel. In this way, depending on the choice of the first step, the rules added are different.

Description: The regular way to extract pagination. Where paging is found, use the regextest (download address above) for testing. Description: \d matches the number. The second process: the extraction of content links.

Description: We found the Content Code section. Write out the collection rules. I've provided two, and the second is where I put the rule description. We can refer to the following. What I'm choosing here is regular mode extraction, which corresponds to regular rules. The third process: the specific content to get part:

Description: Fill in the basic information. Extraction mode Two, rules and intelligence, we in order to illustrate the problem, using the rule extraction method, let everyone understand the next regular. You can also extract pagination, where the paging process is a list of page settings that are similar, not in the filler language.

Note: Extract the title, use the regular, the same, we found that there is a B tag, one will be extracted after processing filtered out. I was going to use the visual engine to extract the title, next time.

Description: Body content extraction, find the beginning and end of the body, write regular, you can. Method. Specific regular learning has been posted on the head of the Knight Video tutorial.

After extraction, let's filter the contents of the text. Several important label filters. Description: Label filter. including links, scripts, etc. that affect the layout of the Web page and collect information on the site, use the regular we filter out.

Process four: Now we save our crawl rules, build sites, add tasks. Test it.

Description: A site can set up a number of tasks, a task can be a collection module, the task corresponding to a release module.

Description: Collection started! Get the list first, get the content.

Description: This article library information, we look at the quality of the article, if the quality is not good, we can choose to replace the library filter or revise the collection rules, to collect. Site Settings: The quality of collection, it is OK, we do not need to come back here. The following are the specific settings for the publication:

Description: Three parts: The first part is the basic library. The second part is the module setup. The third part is the test release. First login in the category, in the release, if the release is successful, almost ready. If this is not successful, we can modify the release module or retrieve the other release modules.

Description: Test Login

Description: Test Get Category

Description: Test release article, if normal, that is, a Knight test article.

Description: Test publish the article successfully.

Description: Warrior Release Process!

Description: Publish a successful web page. has been successfully published.

This tutorial has taken you step-by-step through the process of the whole process of the warrior. The knight has other powerful functions. I this is only the tip of the iceberg, I hope you have a lot of guidance, to provide valuable advice, thank you!

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.