Correction of OCR recognition error by flexible searching and replacing in WPS

Source: Internet
Author: User

Friend small A in the collation of a material, often use the scanner will be used to convert the existing paper materials into electronic documents, but the use of OCR software to identify the text, the total "primer" to identify the word "bow 1" or "Bow I", "Bow l". Using the substitution function in the WPS text, he chose to use the wildcard character to replace the "bow *" with "lead", and the result was to replace the word "bow", and the letter in the back was not replaced, so he asked for advice from the "Brick House".

Haha, find the "brick home" even if you find the right person. The "Brick house" bluntly told him: with the search and replace function to achieve, the direction is right, but the replacement technique is not mastered.

In the method of small A, "Look up" object is "Bow *", will make WPS very embarrassed, because "*" as a wildcard character, is generally to be placed in the middle of the search content, that is, before and after the content. Otherwise, because "*" means "There are any number of arbitrary characters," there is no specific character limit, will cause the system do not know what you are looking for, the result only found the "bow" (any number of course include "0"), of course, replaced only "bow" word, followed by the "I", "L", "1" Such characters are not always replaced.

So, what should we do? In addition to "*", there is a wildcard character "?" (Half-width question mark), unlike "*", a "?" Represents only one character, if the lookup content is set to "bow", you can find "bow 1" or "Bow I", "Bow l". However, this also leads to another problem, that is, although OCR identifies the wrong "bow 1" or "Bow I", "bow l" are replaced with "primer", but the original correct "bow" and "frightened" in the "bow" and so are replaced as "primer", which can not!

Looking at small a that surprised and admire the eyes, my vanity got a great satisfaction, nothing to say, put my ace one or two pieces to teach him.

Enter "Bow [1il]" in "Find what" and enter "Primer" in "Replace with", click "Replace All" (see picture), OK, "bow 1" or "Bow I", "bow l" all replaced with "lead", and "frightened" but did not become "startled bird"!

Here, the square bracket function is "as long as any one character match", will be found, so that both find all the content to find, but also effectively avoid the "wrong kill 10,000" problem.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.