How to create a regular expression in abbyy FineReader

Source: Internet
Author: User

ABBYY FineReader is an OCR word recognition software that, when identifying text, can be used to create a custom language to identify text characters that contain many very useful elements (such as code numbers), which are listed today for everyone in ABBYY FineReader A specific method for creating a regular expression for a custom language dictionary.

If you have any questions, you can visit:http://www.abbyychina.com/FRshiyongjiqiao/abbyyfr-zhengzebds.html


Note:

1. To use regular expression symbols as normal characters, add a backslash before the symbol. For example, [t-v]x+ represents TX, TXX, TXX, etc. and UX, UXX, etc., while \[t-v\]x+ represents [T-v]x, [t-v]xx], [t-v]xxx, etc.]

2. To group the regular expression elements, use parentheses. For example, (A|b) +|c represents C or any combination of similar abbbaaabbb, Ababab (any length other than 0, which may have any number of words in any order of A and B), while A|b+|c represents a, C and B, BB, BBB, and so on.

Example:

Suppose you recognize a table that contains three columns: Date of birth, name, e-mail address, in which case you can create two new languages, data, and addresses, and specify the following regular expression for them:

Regular Expressions for dates:

The number representing the day can be made up of one digit (1, 2, etc.) or two digits (02, 12), but not 0 (00 or 0), and the day's regular expression should appear as : ((| |) [1-9]) | ([1|2][0-9]) | (30) | (31);

The regular expression for the month should appear as : ((| | |) [1-9]) | (10) | (11) | (12);

The regular expression for the year should appear as : ([19][0-9][0-9]| ( [0-9] [0-9]) | ([20][0-9][0-9]| ([0-9][0-9]).

Next, you need to combine all of these together with a period to separate numbers (such as 1.03.1999), the period is also a regular expression symbol, so you must precede the backslash (\), so the full date regular expression should appear as: ((| |) [1-9]) | ([1|2][0-9]) | (30) | (31) \. ((|) [1-9]) | (10) | (11) | (12) \. (19) [0-9][0-9]) | ([0-9][0-9]) | ([20][0-9][0-9]| ([0-9][0-9])

Regular Expressions for e-mail addresses :

[a-za-z0-9_\-\.] +\@[a-z0-9\.\-]+

These are the specific methods used to create a regular expression for a custom language dictionary, and it is hoped that the description of the same content can help you create and apply a regular expression to the abbyy FineReader in the custom language dictionary. There is a more detailed understanding and application.



How to create a regular expression in abbyy FineReader

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.