ABBYY FineReader is an OCR word recognition software that, when identifying text, can be used to create a custom language to identify text characters that contain many very useful elements (such as code numbers), which are listed today for everyone in ABBYY FineReader A specific method for creating a regular expression for a custom language dictionary.
If you have any questions, you can visit:http://www.abbyychina.com/FRshiyongjiqiao/abbyyfr-zhengzebds.html
Note:
1. To use regular expression symbols as normal characters, add a backslash before the symbol. For example, [t-v]x+ represents TX, TXX, TXX, etc. and UX, UXX, etc., while \[t-v\]x+ represents [T-v]x, [t-v]xx], [t-v]xxx, etc.]
2. To group the regular expression elements, use parentheses. For example, (A|b) +|c represents C or any combination of similar abbbaaabbb, Ababab (any length other than 0, which may have any number of words in any order of A and B), while A|b+|c represents a, C and B, BB, BBB, and so on.
Example:
Suppose you recognize a table that contains three columns: Date of birth, name, e-mail address, in which case you can create two new languages, data, and addresses, and specify the following regular expression for them:
Regular Expressions for dates:
The number representing the day can be made up of one digit (1, 2, etc.) or two digits (02, 12), but not 0 (00 or 0), and the day's regular expression should appear as : ((| |) [1-9]) | ([1|2][0-9]) | (30) | (31);
The regular expression for the month should appear as : ((| | |) [1-9]) | (10) | (11) | (12);
The regular expression for the year should appear as : ([19][0-9][0-9]| ( [0-9] [0-9]) | ([20][0-9][0-9]| ([0-9][0-9]).
Next, you need to combine all of these together with a period to separate numbers (such as 1.03.1999), the period is also a regular expression symbol, so you must precede the backslash (\), so the full date regular expression should appear as: ((| |) [1-9]) | ([1|2][0-9]) | (30) | (31) \. ((|) [1-9]) | (10) | (11) | (12) \. (19) [0-9][0-9]) | ([0-9][0-9]) | ([20][0-9][0-9]| ([0-9][0-9])
Regular Expressions for e-mail addresses :
[a-za-z0-9_\-\.] +\@[a-z0-9\.\-]+
These are the specific methods used to create a regular expression for a custom language dictionary, and it is hoped that the description of the same content can help you create and apply a regular expression to the abbyy FineReader in the custom language dictionary. There is a more detailed understanding and application.
How to create a regular expression in abbyy FineReader