I. meaning of the URL
URL, the Uniform Resource Locator, which is what we call the URL, the Uniform Resource Locator is a concise representation of the location and access methods of resources available from the Internet, and is the address of standard resources on the Internet. Each file on the Internet has a unique URL that contains information that indicates the location of the file and how the browser should handle it.
The format of the URL consists of three parts:
① The first part is the protocol (or service mode ).
② the second part is the host IP address (and sometimes the port number) where the resource is stored .
③ The third part is the specific address of the host resource, such as directory and file name.
Crawling data must have a target URL to get the data, so it is the basic basis for the crawler to , accurate understanding of its meaning for the crawler to learn a lot of help.
Meaning of the URL