Baidu engineers talk about the implementation principle and performance analysis of PHP functions (III.) _php skills

Source: Internet
Author: User
Tags rand strcmp urlencode alphanumeric characters

Implementation and introduction of common PHP functions

Count
Count is a function we often use, and its function is to return the length of an array.
Count, what is the complexity of this function? A common saying is that the count function traverses the entire array and then gets the number of elements, so the complexity is O (n). Is that actually the case? We go back to the implementation of count to see, through the source can be found, for the array count operation, the function of the final path is zif_count-> php_count_recursive-> zend_hash_num_elements, And Zend_hash_num_elements's behavior is return ht->nnumofelements, visible, this is an O (1) instead of O (n) operation. In fact, the array at the bottom of PHP is a hash_table, for the hash table, Zend has a special element nnumofelements record the number of the current element, so for general count actually directly returns this value. Thus, we conclude that count is the complexity of O (1) and is independent of the size of the specific array.
Non-array type of variable, what about the behavior of count? Returns 0 for a variable that is not set, whereas an int, double, string, and so on, returns 1

Strlen
Strlen is used to return the length of a string. So what is the principle of his implementation? We all know that in C strlen is a function of O (n), which traverses the string sequentially until it encounters a, and then a length. Is this how it is in PHP? The answer is no, PHP. Strings are described in a composite structure, including pointers to specific data and string lengths (similar to string in C + +), so strlen directly returns the length of the string, which is a constant-level operation. In addition, for a variable that is not a string type, it will first cast the variable to a string and then strlen the length, which requires attention.

Isset and Array_key_exists
The most common use of these two functions is to determine whether a key exists in an array. But the former can also be used to determine whether a variable has been set. As mentioned earlier, Isset is not a real function, so it is much more efficient than the latter. It is recommended to replace Array_key_exists.
Array_push and array[]
Both are appending an element to the tail of the array. The difference is that the former can push multiple at a time. Their biggest difference is that one is a function and one is a language structure, so the latter is more efficient. Therefore, if it is just an ordinary append element, we recommend that you use array [].

Rand and Mt_rand
Both are provided with the ability to produce random numbers, the former using the LIBC standard of Rand. The latter uses the known characteristics of the Mersenne twister as a random number generator, which can produce a random numerical velocity of four times times faster than the libc provided by Rand (). Therefore, if the performance requirements are high, you can consider replacing the former with Mt_rand. As we all know, Rand produces pseudo random numbers, and in C you need to display the specified seeds with Srand. But in PHP, Rand will help you invoke the Srand by default, and in general you don't need to display it yourself. It is important to note that if you need to invoke Srand in special cases, you must support the call. That is srand for rand,mt_srand corresponding Srand, must not mix use, otherwise is invalid.

Sort and Usort
Both are used for sorting, but the former can specify a sort strategy, similar to the qsort in our C and the sort of C + +. In the sort of both are using the standard of the fast row to achieve, for a sort of demand, such as special circumstances called PHP provided by these methods can be, do not have to do it again, the efficiency will be much lower. The reasons for this are as compared to the analysis of the user function and the built-in function.

UrlEncode and Rawurlencode
Both are used for URL encoding, except for-_ in the string. All non-alphanumeric characters are replaced with a percent semicolon (%) followed by a two-bit hexadecimal number. The only difference is that for spaces, UrlEncode is encoded as +, and Rawurlencode is encoded as%20. In general, in addition to search engines, our strategy is the space code for%20. Therefore, the latter is mostly used. Note that the encode and decode series must be used in matching.

STRCMP Series functions
This series of functions include strcmp, STRNCMP, strcasecmp, and strncasecmp, which are implemented in the same function as the C function. But there are also different, because the PHP string is allowed to appear, so in the judgement of the bottom of the use of the MEMCMP series rather than strcmp, theoretically faster. In addition, because PHP can get the string length directly, so it will first check this aspect, in many cases the efficiency will be much higher.

Is_int and Is_numeric
The functions of these two functions are similar and not exactly the same, and they must be noticed when they are used. Is_int: To determine whether a variable type is an integer type, PHP variable has a special field representation type, so direct judgment of this type can be an absolute O (1) Operation Is_numeric: To determine whether a variable is an integer or a numeric string, That is, in addition to the integer variable returning true, for a string variable, if the shape "1234", "1e4", and so on, will also be sentenced to true. This time will traverse the string to judge.

Summary and Suggestions

Summarize:
Through the principle analysis and performance test of the function realization, we summarize the following conclusions
1. PHP has a relatively large function call overhead.
2. Function-related information is stored in a large hash_table, each time by the function name to find in the hash table, so the function name length has a certain effect on performance.
3. function return reference has no practical meaning
4. Built-in PHP functions are much more performance-efficient than user functions, especially for string-class operations.
5. class method, normal function, static method efficiency is almost the same, there is not much difference
6. To remove the effect of empty function calls, built-in functions and the same function of the C function is basically the same performance.
7. All parameter passes are shallow copies with reference counts, at a very low price.
8. The performance impact of the number of functions can be almost negligible

Suggestions:

Therefore, for the use of PHP functions, there are some suggestions
1. A feature can be done with built-in functions, using it rather than writing PHP functions yourself.
2. If a feature is highly performance-specific, consider using extensions to implement it.
3. PHP function calls are expensive, so don't encapsulate them too much. Some features, if you need to call the number of times and only 1, 2 lines of code on the line to implement, the recommendation does not encapsulate the call.
4. Don't be overly obsessed with design patterns, as the previous article describes, excessive encapsulation can lead to performance degradation. The trade-off between the two needs to be considered. PHP has its own characteristics, must not parody, too follow the Java model.
5. Functions should not be nested too deep, use caution recursively.
6. Pseudo function performance is very high, the same function to achieve the priority given. Like using Isset instead of array_key_exists.
7. The function return reference does not make much sense, nor does it have any practical effect, and the recommendation is not considered.
8. Class member methods are less efficient than ordinary functions, so there is no need to worry about performance loss. It is recommended that you consider static methods more readable and secure.
9. Parameter passing, if not a special requirement, suggests using a pass value rather than a reference. Of course, reference passing can be considered if the parameter is a large array and needs to be modified.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.