Python crawls online OJ questions

Source: Internet
Author: User

School work needs, need to set up an intranet OJ server, the use of open source Hustoj. The question was downloaded from the Hustoj freeprblem XML file. There are many errors when importing, for unknown reasons. In addition to the calendar year Noip to add to the test, but the Noip in the past years the XML file only 3, 4. Cogs on almost all of the calendar year Noip then thought of using python+pyquery to crawl into XML. As for not choosing BeautifulSoup and choosing pyquery is feeling PQ syntax close to jquery, it is more convenient to use, and the speed may be faster!

ver0.9 has been completed, but due to the COGS format is not unified, their own experience, found a lot of errors, pending further improvement!

Ver1.0 intends to rectify these errors and try to make the questions crawl as correctly as possible. Data fetching can be considered later, import problem

Python crawls online OJ questions

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.