PHP learning-parsing URL, php-url
There are two methods in PHP to parse the URL: parse_url and parse_str.
Parse_url
Parse the URL and return its components
Mixed parse_url (string $ url [, int $ component =-1])
This function parses a URL and returns an associated array, which contains various components in the URL.
This function is not used to verify the validity of a given URL, but is used to separate it into the parts listed below. Incomplete URLs are also accepted, and parse_url () tries to parse them correctly as much as possible.
Parameters
The url to be parsed. Invalid characters are replaced.
Component specifies one of PHP_URL_SCHEME, PHP_URL_HOST, PHP_URL_PORT, PHP_URL_USER, PHP_URL_PASS, PHP_URL_PATH, PHP_URL_QUERY, or PHP_URL_FRAGMENT to obtain the string of the specified part in the URL. (Besides PHP_URL_PORT, an integer value is returned ).
Return Value
For URLs that are seriously unqualified, parse_url () may return FALSE.
If the component parameter is omitted, an associated array is returned. At present, at least one element is included in the array. Possible keys in the array include:
- Scheme-such as http
- Host
- Port
- User
- Pass
- Path
- Query-question mark? After
- Fragment-after the hash symbol #
If the component parameter is specified, parse_url () returns a string (or an integer if PHP_URL_PORT is specified) instead of an array. If the specified component in the URL does not exist, NULL is returned.
Instance
<?php$url = 'http://username:password@hostname/path?arg=value#anchor';print_r(parse_url($url));echo parse_url($url, PHP_URL_PATH);?>
The above routine will output:
Array( [scheme] => http [host] => hostname [user] => username [pass] => password [path] => /path [query] => arg=value [fragment] => anchor)/path
Parse_str
Parses a string into multiple variables.
Void parse_str (string $ str [, array & $ arr])
If str is the query string passed by the URL, It is parsed as a variable and set to the current scope.
Obtain the current QUERY_STRING. You can use the $ _ SERVER ['query _ string'] variable.
Parameters
Str string.
If arr sets the second variable arr, the variable will be saved to this array as an array element, as an alternative. ,
Instance
<?php$str = "first=value&arr[]=foo+bar&arr[]=baz";parse_str($str);echo $first; // valueecho $arr[0]; // foo barecho $arr[1]; // bazparse_str($str, $output);echo $output['first']; // valueecho $output['arr'][0]; // foo barecho $output['arr'][1]; // baz?>
Some time ago, I was reading the source code of php-resque and saw the application of these two methods. I felt very useful and used to parse the redis link settings.
Redis link format: redis: // user: pass @ host: port/db? Option1 = val1 & option2 = val2, is it the same as the URL, so it is easy to parse with the above two methods.
Address: https://github.com/chrisboulton/php-resque/blob/master/lib/Resque/Redis.php
The Code is as follows:
/** * Parse a DSN string, which can have one of the following formats: * * - host:port * - redis://user:pass@host:port/db?option1=val1&option2=val2 * - tcp://user:pass@host:port/db?option1=val1&option2=val2 * * Note: the 'user' part of the DSN is not used. * * @param string $dsn A DSN string * @return array An array of DSN compotnents, with 'false' values for any unknown components. e.g. * [host, port, db, user, pass, options] */ public static function parseDsn($dsn) { if ($dsn == '') { // Use a sensible default for an empty DNS string $dsn = 'redis://' . self::DEFAULT_HOST; } $parts = parse_url($dsn); // Check the URI scheme $validSchemes = array('redis', 'tcp'); if (isset($parts['scheme']) && ! in_array($parts['scheme'], $validSchemes)) { throw new \InvalidArgumentException("Invalid DSN. Supported schemes are " . implode(', ', $validSchemes)); } // Allow simple 'hostname' format, which `parse_url` treats as a path, not host. if ( ! isset($parts['host']) && isset($parts['path'])) { $parts['host'] = $parts['path']; unset($parts['path']); } // Extract the port number as an integer $port = isset($parts['port']) ? intval($parts['port']) : self::DEFAULT_PORT; // Get the database from the 'path' part of the URI $database = false; if (isset($parts['path'])) { // Strip non-digit chars from path $database = intval(preg_replace('/[^0-9]/', '', $parts['path'])); } // Extract any 'user' and 'pass' values $user = isset($parts['user']) ? $parts['user'] : false; $pass = isset($parts['pass']) ? $parts['pass'] : false; // Convert the query string into an associative array $options = array(); if (isset($parts['query'])) { // Parse the query string into an array parse_str($parts['query'], $options); } return array( $parts['host'], $port, $database, $user, $pass, $options, ); }