This condition statement is used to determine whether the start mark and the closed Mark are equal. if they are not equal, the mark is not closed. Finally, use in_array ($ start_tags [1] [$ I],
1. use a regular expression to check whether HTML is disabled
Code snippet
PHP code
- Function check_html ($ html ){
- Preg_match_all ("/<([a-zA-Z0-9] +) \ s * [^ \/>] *>/", $ html, $ start_tags );
- Preg_match_all ("/<\/([a-zA-Z0-9] +)>/", $ html, $ end_tags );
- If (count ($ start_tags [1])! = Count ($ end_tags [1]) return false;
- For ($ I = 0; $ I <count ($ start_tags [1]); $ I ++ ){
- If (! In_array ($ start_tags [1] [$ I], $ end_tags [1]) return false;
- }
- Return true;
- }
Explanation:
/<([A-zA-Z0-9] +) \ s * [^ \/>] *>/this pattern is used to match HTML tags (such:,
And so on,
And keep the label in $ start_tags.
(Such as head and div ). And/<\/([a-zA-Z0-9] +)>/this pattern is used to match closed HTML
Mark (for example:, Etc. And keep the closed tag name in $ end_tags. Then we use
Count ($ start_tags [1])! = Count ($ end_tags [1]) This condition statement is used to determine the starting tag and closed
Mark whether it is equal. if it is not equal, it indicates it is not closed. Finally, use in_array ($ start_tags [1] [$ I], $ end_tags [1]).
To determine whether the start mark is equal to the closed mark. So far, we have completed HTML matching!
2. match the email format
Code snippet
PHP code
- Function check_email ($ email ){
- If (preg_match ("/^ [\ w \ d! # $ % & '* +-\/=? ^ '{|} ~] + (\. [\ W \ d! # $ % & '* +-\/=? ^ '{|} ~] +) * @ ([A-z \ d] [-a-z \
- D] * [a-z \ d] \.) + [a-z] [-a-z \ d] * [a-z] $/", $ eamil) return true;
- Return false;
- }
Explanation:
Do not be
/^ [\ W \ d! # $ % & '* +-\/=? ^ '{|} ~] + (\. [\ W \ d! # $ % & '* +-\/=? ^ '{|} ~] +) * @ ([A-z \ d] [-a-z \ d] * [az \
D] \.) + [a-z] [-a-z \ d] * [a-z] $/
This mode is actually quite simple. Previous part
[\ W \ d! # $ % & '* +-\/=? ^ '{|} ~] + (\. [\ W \ d! # $ % & '* +-\/=? ^ '{|} ~] +) * Just match to conform to the RFC-2882 label
The quasi-email address can contain English letters, numbers, and symbols. if you are interested, you can query
RFC-2882 manual, and ([a-z \ d] [-a-z \ d] * [a-z \ d] \.) + is matching the HOST. Last [a-z] [-a-z \ d] * [a-z]
That is, it matches top-level domain names (such as. com and. org ).
3 non-greedy mode
When using regular expressions, you may find the following problems:
Code snippet
PHP code
- Preg_match ('/". *"/', 'Tony say: "hello", Jack say: "Hi" ', $ matches );
- Print_r ($ matches );
You are surprised to find that the matching content is "hello", Jack say: "Hi", not "hello" and
"Hi ". This is caused by greedy matching. In greedy match, regular expressions match the most words as much as possible.
So there is the first case, but what we want is the second case. what should we do?
We can use non-greedy match to change the pattern /".*? "/Then we can see what we want
The result is displayed. *? This is the non-greedy match mode. Another type is + ?.
? *? : The preceding characters can appear any number of times, *? The next character stops matching.
? +? : The preceding character can appear once or multiple times, but the +? The next character stops the match.
Configuration.
4. check whether a user password is secure.
Code snippet
PHP code
- Function is_good_pw ($ pw ){
- If (preg_match ('/(? =. * [0-9]) (? =. * [A-z]) (? =. * [A-Z]). {8}/', $ pw )){
- Return true;
- }
- Return false;
- }
Explanation:
In this example, we use /(? =. * [0-9]) (? =. * [A-z]) (? =. * [A-Z]). {8, 16}/pattern to match our
Password. In this mode, we use the forward-looking mode (? = ). (? =. * [0-9]) This is
There are numbers in the password, and (? =. * [A-z]) indicates that the password matches lowercase letters (? =. * [A-Z]) is to match the password
The code contains uppercase letters. The. {8, 16} password is a string of 8 to 16 characters. Then we
The password must contain numbers. a password consisting of uppercase and lowercase letters is a secure password ~
5. match all links in a website
Code snippet
PHP code
- Function get_links ($ link ){
- $ Html = file_get_contents ($ link );
- $ Html = str_replace ("\ n", "", $ html );
- $ Html = preg_replace ('/
- $ Html = preg_replace ('/<\/a>/', "\ n", $ html );
- Preg_match_all ('/.*? <\/A>/', $ html, $ matches );
- Return ($ matches );
- }
In this example, we want to use file_get_contents to obtain the content of a webpage. Then use
Str_replace ("\ n", "", $ html) removes all line breaks. Use preg_replace ('/<\/a>/', "\ n", $ html) to start a new line of all the... modes.
Use preg_match_all ('/.*? <\/A>/', $ html, $ matches) matches the link mode.
/.*? <\/A>/is the regular expression that matches the pattern. Why?
Do you want to add another line of link ?? Because in /.*? <\/A>/mode ,.*
It cannot match the line feed, so it cannot match if it is not in the same line !! So we need
Do it!