Topic Center

Contact Sales

Home > Developer > Python

Python Regular Expression _re module _ using compile acceleration

Last Update:2015-02-10 Source: Internet

Author: User

Developer on Alibaba Coud: Build your first app with APIs, SDKs, and tutorials on the Alibaba Cloud. Read more ＞

Use compile acceleration compile (rule [, flag])

Compiles regular rules into a Pattern object for the next use.
The first parameter is regular, and the second parameter is the rule option.
Returns a Pattern object
Use findall (rule, target) directly to match the string, two times a day nothing, if it is repeated use, because the regular engine each time the rules to explain the rule, and the interpretation of the rules is quite time-consuming, so the efficiency is very low. If you want to use the same rule more than once to make a match, you can use the re.compile function to precompile the rule, using the Regular Expression Object or the Pattern that was compiled to return object to be searched.
Cases
>>> s= ' 111,222,aaa,bbb,ccc333,444ddd '
>>> rule=r ' \b\d+\b '
>>> compiled_rule=re.compile (rule)
>>> Compiled_rule.findall (s)
[' 111 ', ' 222 ']
It is seen that using compile rules is similar to using non-compiled uses. The compile function can also specify some rule flags to specify some special options. Multiple options with '|' (bit or) to connect together.
I IGNORECASE ignore case differences.
L LOCAL Character Set localization. This feature is designed to support multiple language versions of the character set using the environment, such as the escape character \w, which stands for [a-za-z0-9] in English, that is, so English characters and numbers. If used in a French environment, the default setting does not match "é " or " C". Plus this L option and you can match it. However, this does not seem to work for the Chinese environment, it still does not match the characters.
M MULTILINE multi-line matching. In this mode ' ^ ' ( representing the beginning of the string ) and ' $ ' ( representing the end of the string ) will be able to match the case of multiple lines, becoming the beginning and end of the line mark. Like what
>>> s= ' 123 456\n789 012\n345 678 '
>>> rc=re.compile (R ' ^\d+ ') # matches a number at the beginning without using the M option
>>> Rc.findall (s)
[' 123 '] # results can only be found at the first beginning of the ' 123 '
>>> rcm=re.compile (R ' ^\d+ ', re. m) # using the m option
>>> Rcm.findall (s)
[' 123 ', ' 789 ', ' 345 '] # found three numbers at the beginning of the line
Similarly, for ' $ ' , without the M option, it will match the last line at the end of the number, i.e. ' 678 ', plus later, it will be able to match three end of the number 456 012 and 678 up .
>>> rc=re.compile (R ' \d+$ ')
>>> rcm=re.compile (R ' \d+$ ', re. M
>>> Rc.findall (s)
[' 678 ']
>>> Rcm.findall (s)
[' 456 ', ' 012 ', ' 678 ']
S Dotall '. ' Will match all the characters. By default '. ' Match all characters except the newline character ' \ n ' , after using this option,'. ' Can match any character that includes ' \ n ' .
U unicode \w , \w \b \b \d \d \s and will use Unicode.
X VERBOSE This option ignores whitespace in the regular expression and allows you to use ' # ' to guide a comment. This will allow you to write the rules more beautifully. Like you can put the rules

>>> rc = Re.compile (r "\d+|[ a-za-z]+ ") #匹配一个数字或者单词

Use the X option to write:

>>> rc = Re.compile (R "" "# Start a rule

\d+ #

| [A-za-z]+ # Word

"" ", Re. VERBOSE)

In this mode, if you want to match a space, you must use the form ' \ ' (followed by a space)

Python Regular Expression _re module _ using compile acceleration

This article is an English version of an article which is originally in the Chinese language on aliyun.com and is provided for information purposes only. This website makes no representation or warranty of any kind, either expressed or implied, as to the accuracy, completeness ownership or reliability of the article or any translations thereof. If you have any concerns or complaints relating to the article, please send an email, providing a detailed description of the concern or complaint, to info-contact@alibabacloud.com. A staff member will contact you within 5 working days. Once verified, infringing content will be removed immediately.

Related Keywords:

Python design mode-UML-Package diagrams (Package Diagram) 09-09

Python abstract class (ABC module) 09-18

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

What's Trending

Top 10 Tags

datastax versions naming convention zookeeper client class definition md5 microsoft sql server 2005 data structures exception handling error handling

Top 10 Keywords

microsoft download center down wordpress address url site address url wordpress address url windows installer 4 0 download 302 not found web address url definition site address url wordpress db2 integer mac os installation step by step pdf abbreviation for return

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

Python Regular Expression _re module _ using compile acceleration

Contact Us

What's Trending

Top 10 Tags

Top 10 Keywords

Trending Topic

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support