International - English

Cart Console

Topic Center

Contact Sales

Home > Developer > Python

Python combat: Beautiful picture downloader, a huge picture of you download

Last Update:2016-10-07 Source: Internet

Author: User

Tags python web crawler

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Python applications are now in full swing with a wide range of applications. Fast access to the top of the programming language rankings due to its rapid development and high efficiency. This series of articles is dedicated to a comprehensive and systematic introduction of Python language development knowledge and related knowledge summaries. I hope you can get started quickly and learn the language of Python.

This article is based on Python in the previous part of Python combat: Python crawler Learning tutorial, get the movie leaderboard, again upgrade the Python web crawler Combat course.

1. Project Overview.

The use of XPath and requests module for Web page crawl and analysis, to achieve the effect of Web page image download.

Grab and crawl pictures address: http://www.2cto.com/meinv/

Development environment: Python 2.7, Pycharm 5 Community

Required Knowledge: Artifact XPath, requests module, Python basic syntax.

2. Introduction and installation of the required modules

Xpath

Description: XPath is actually a language that can be used to find and extract information in XML through the attributes of an element. It supports HTML.
Simpler than regular expressions. More powerful
Installation: Download the lxml library for installation operations. : http://www.lfd.uci.edu/~gohlke/pythonlibs/#lxml. Download the corresponding version of lxml
Open Library Directory Run command to install

After the download is complete, please change the suffix name WHL to zip.
Unzip the file to put the lxml folder in the Python installation directory of the Lib folder.

Requests Module Installation

For detailed installation steps see: Python Combat: Python crawler learning tutorial for requests installation in the movie leaderboard.

3.Xpath extract Find content in detail:

Language is no exception, XPath also has a certain syntax.

Locating the root node

/down Level Search

/text () Extract text content

/@xxx Extract Attribute Contents

4. Project Principal Code

From lxml import etree

selector = etree. HTML (Web page source code)

Selector.xpath (XPath syntax)

Import requests

Requests.get (URL)

5. Code Demo:

Effect Show:

Tip: XPath simple get: Developer Tools-Locate the label you want to extract-right-click to copy the XPath path.

But still need to modify OH.

Welcome to the Headlines Today: Be the full-stack siege lion. Python actual combat: Beautiful picture downloader, a huge amount of images you download.

QQ Technology Group: 538742639

Project source code please pay attention to the public platform: fullstackcourse do all-stack siege lion. Reply: "Beautiful picture downloader" gets.

Next: Python Learning Primer Tutorial, String function expansion

Python combat: Beautiful picture downloader, a huge picture of you download

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

picture downloader chrome picture downloader chrome picture of triangular prism picture of roulette wheel picture of arc picture of dvorak keyboard picture of metric ruler

Python design mode-UML-Package diagrams (Package Diagram) 09-09

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

What's Trending

Top 10 Tags

datastax versions naming convention zookeeper client class definition md5 microsoft sql server 2005 data structures exception handling error handling

Top 10 Keywords

microsoft download center down wordpress address url site address url wordpress address url windows installer 4 0 download 302 not found web address url definition site address url wordpress db2 integer mac os installation step by step pdf abbreviation for return

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Python combat: Beautiful picture downloader, a huge picture of you download

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support