Php curl module collects page instances after Simulated Logon

Source: Internet
Author: User
Tags http cookie

In php, the curl module is a multi-thread module that can easily simulate logon, such as post and get, the following example uses the curl module to simulate the collection page program after logon.

My homework today is to get product inventory from a website, but this website needs to be logged on. I used fsockopen to pass the entire header, and it was useless. I had to turn to curl for help.

The following describes how to enable the curl module:

(1) Copy libeay32.dll and ssleay32.dll from the php Directory to the windows directory.
(2) Open php. ini, find "extension_dir = xxxxx", and check that the php_curl.dll file is in the file directory.
(3) in the same way as php. ini, find "extension = php_curl.dll" and confirm that it is not commented out (no ';' above ';').
(4) Restart apache. If curl_init () is used and an error message is displayed in the statement, the installation is successful.

Example

The Code is as follows: Copy code

$ Curl = curl_init ();
$ Cookie_jar = tempnam ('./tmp', 'cooker ');
Curl_setopt ($ curl, CURLOPT_URL, 'HTTP: // www. bKjia. c0m/checkUser. jsp '); // enter the logon processing interface here.
Curl_setopt ($ curl, CURLOPT_POST, 1 );
$ Request = 'user = xxx & password = XXX ';
Curl_setopt ($ curl, CURLOPT_POSTFIELDS, $ request); // transfers data
Curl_setopt ($ curl, CURLOPT_COOKIEJAR, $ cookie_jar); // Save the returned cookie information in the $ cookie_jar file.
Curl_setopt ($ curl, CURLOPT_RETURNTRANSFER, 1); // sets whether the returned data is automatically displayed.
Curl_setopt ($ curl, CURLOPT_HEADER, false); // you can specify whether the header information is displayed.
Curl_setopt ($ curl, CURLOPT_NOBODY, false); // you can specify whether to output the page content.
Curl_exec ($ curl); // return the result
Curl_close ($ curl); // close

$ Curl2 = curl_init ();
Curl_setopt ($ curl2, CURLOPT_URL, 'HTTP: // www. bKjia. c0m/aaa. php'); // The page on which you want to obtain information after logging on.
Curl_setopt ($ curl2, CURLOPT_HEADER, false );
Curl_setopt ($ curl2, CURLOPT_RETURNTRANSFER, 1 );
Curl_setopt ($ curl2, CURLOPT_COOKIEFILE, $ cookie_jar );
$ Content = curl_exec ($ curl2 );

The obtained data is passed as a string to $ content. Then process the string and delete unnecessary parts.
I only deleted unnecessary parts of the page front-end:

The Code is as follows: Copy code

$ Content = strstr ($ orders, '<div class = "products">'); // search for the first appearance

<Div class = "products">

And delete

CURL parameters:

Bool curl_setopt (int ch, string option, mixed value)

The curl_setopt () function sets options for a CURL session. The option parameter is the setting you want, and the value is the value given by this option.

The values of the following options will be used as long integer (specified in the option parameter ):

* CURLOPT_INFILESIZE: When you upload a file to a remote site, this option tells PHP the size of the file to be uploaded.
* CURLOPT_VERBOSE: If you want CURL to report every unexpected event, set this option to a non-zero value.
* CURLOPT_HEADER: If you want to include a header in the output, set this option to a non-zero value.
* CURLOPT_NOPROGRESS: if you do not display a process entry for CURL transmission in PHP, set this option to a non-zero value.

Note: PHP automatically sets this option to a non-zero value. You should change this option only for debugging purposes.

* CURLOPT_NOBODY: if you do not want to include the body in the output, set this option to a non-zero value.
* CURLOPT_FAILONERROR: If you want PHP not to be displayed when an error occurs (HTTP code returns a value greater than or equal to 300), set this option to a non-zero value. By default, a normal page is returned, ignoring the code.
* CURLOPT_UPLOAD: If you want PHP to prepare for upload, set this option to a non-zero value.
* CURLOPT_POST: If you want PHP to create a regular http post, set this option to a non-zero value. This POST is a common application/x-www-from-urlencoded type, most of which are used by HTML forms.
* CURLOPT_FTPLISTONLY: set this option to a non-zero value. PHP will list the FTP directory names.
* CURLOPT_FTPAPPEND: set this option to a non-zero value. PHP overwrites the Remote Application file.
* CURLOPT_NETRC: set this option to a non-zero value. PHP will be in your ~. In the/netrc file, find the username and password of the remote site you want to establish a connection.
* CURLOPT_FOLLOWLOCATION: set this option to a non-zero value (like "Location:") header. The server will send it as part of the HTTP header (note that this is recursive, PHP will send the header like "Location ).
* CURLOPT_PUT: sets this option to upload a file over HTTP as a non-zero value. To upload this file, you must set the CURLOPT_INFILE and CURLOPT_INFILESIZE options.
* CURLOPT_MUTE: set this option to a non-zero value. PHP will be completely silenced for the CURL function.
* CURLOPT_TIMEOUT: specifies the maximum number of seconds for a long integer.
* CURLOPT_LOW_SPEED_LIMIT: sets the number of long integers to control the number of bytes transmitted.
* CURLOPT_LOW_SPEED_TIME: sets the number of long integers and controls the number of seconds to transmit the number of bytes specified by CURLOPT_LOW_SPEED_LIMIT.
* CURLOPT_RESUME_FROM: transmits a long integer parameter containing the byte offset address (the start form you want to transfer ).
* CURLOPT_SSLVERSION: transmits a long parameter containing the SSL version. By default, PHP will be determined by its own efforts. You must set it manually in more security scenarios.
* CURLOPT_TIMECONDITION: transmits a long parameter to specify how to process the CURLOPT_TIMEVALUE parameter. You can set this parameter to TIMECOND_IFMODSINCE or TIMECOND_ISUNMODSINCE. This is only used for HTTP.
* CURLOPT_TIMEVALUE: the number of seconds from January 1, to the present. This time will be used by the CURLOPT_TIMEVALUE option as the specified value, or by the default TIMECOND_IFMODSINCE.

The values of the following options will be used as strings:

* CURLOPT_URL: the URL you want to retrieve with PHP. You can also set this option when initializing with the curl_init () function.
* CURLOPT_USERPWD: transmits a string in the format of [username]: [password] to connect to PHP.
* CURLOPT_PROXYUSERPWD: transmits a string in the format of [username]: [password] to connect to the HTTP proxy.
* CURLOPT_RANGE: transmits a range you want to specify. It should be in the "X-Y" format, X or Y is excluded. HTTP shipping also supports several intervals separated by sentences (X-Y, N-M ).
* CURLOPT_POSTFIELDS: transmits a string of all data for the HTTP "POST" operation.
* CURLOPT_REFERER: a string containing the "referer" header in an HTTP request.
* CURLOPT_USERAGENT: a string containing the "user-agent" header in an HTTP request.
* CURLOPT_FTPPORT: transmits an IP address that contains the IP address used by the ftp "POST" command. This POST Command tells the remote server to connect to the specified IP address. This string can be an IP address, a host name, a network interface Name (under UNIX), or '-' (using the default IP address of the system ).
* CURLOPT_COOKIE: transmits a header connection containing the HTTP cookie.
* CURLOPT_SSLCERT: transmits a string containing the PEM format certificate.
* CURLOPT_SSLCERTPASSWD: pass a password that includes the password required to use the CURLOPT_SSLCERT certificate.
* CURLOPT_COOKIEFILE: a string that transmits the name of a file containing cookie data. This cookie file can be in the Netscape format or heap containing the HTTP header in the file.
* CURLOPT_CUSTOMREQUEST: When an HTTP request is sent, a character is used by GET or HEAD. It is helpful to perform DELETE or other operations. Pass a string to be used instead of GET or HEAD when doing an HTTP request. this is useful for doing or another, more obscure, HTTP request.

Note: Do not do this before confirming that your server supports commands.

The following options require a file description (obtained by using the fopen () function ):
 
* CURLOPT_FILE: This file will be the output file you placed for transfer. The default value is STDOUT.
* CURLOPT_INFILE: this file is the input file you sent.
* CURLOPT_WRITEHEADER: This file contains the header of your output.
* CURLOPT_STDERR: this file is written incorrectly, not stderr.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.