How to use ABBYY FineReader to identify text in a picture

Source: Internet
Author: User

As an OCR optical character recognition software, ABBYY FineReader is able to quickly and conveniently convert scanned paper documents, PDF files and digital camera images into editable, searchable text, making computer processing more efficient, getting rid of previous troubles, and saying goodbye to time-consuming manual input and file editing. Today I would like to share with you a case of others using ABBYY FineReader to identify the text in the picture, to see how others are using ABBYY finereader to improve efficiency:

Yesterday in the micro-blog collection of several picture format of the color spectrum, later translation may be used, so think of OCR (Optical character recognition) to identify processing after the introduction of the cat standby. Before in micro-blog often see you guys recommend abbyy FineReader, mention it unparalleled recognition effect, today, a small test, excited, the effect is really good, the Chinese character recognition is higher, not long-winded, the above picture illustrates Satan.

Preparation: Locate the two pre-saved JPG images and install the latest version of ABBYY FineReader 12 software.

Objective: To extract the English and Chinese columns in the picture and to export the text in Excel format.

Original picture

Operation 1, because the text in the picture is displayed, so after opening abbyy finereader 12, select Microsoft Excel entry;

Note: In this window you can set the language to identify (Simplified Chinese and English), as well as color mode, where you can choose full color and black and white mode, black and white mode to read a little faster.

2, then select "Image or PDF file to Microsoft Excel", add to identify two pictures, open the software automatically began to recognize, you can also click on "File", a new document, and then directly to identify the picture dragged to the left side of the software, the same can be opened for identification;

3, considering that the picture text may appear blurred, text skew and turn, so choose to cancel the recognition, the image first edit, click on the above toolbar "edit image", the right to open the list of editing tools;

4, first of all, to the image skew correction, if the scanned picture is not regular, after the scan will be prompted to correct the need for correction of the picture, where you can select "All Pages", and then click "Skew Correction", if the picture is rotated 90 degrees or inverted after the picture, you can rotate it here or flip processing;

5, the next, but also the most important, is to adjust the resolution of the picture, some pictures blurred, will affect the software recognition effect, here can be the resolution of the picture to scan the resolution of the image, that is 300dpi, this value can be generally recognized, can also be customized resolution. With this option, you can set the resolution of the picture separately, you can also select odd or even pages and all pages, in order not to affect the identification, here you can choose "All pages";

6, then you can exit the image editor;

7. Since we only need two-column text in Chinese and English, other irrelevant content can not be identified, therefore, you can select the area to identify, that is, click on the middle of the upper left corner of the "a" button, you can choose two columns to identify the text;

8, the selected text is light green, and then click on the selected area, in the Pop-up toolbar Select button "A", found inside the "form" item, so that the text identified after the two columns of the text of the comparison;

9, then, click on the above Toolbar "read" option, start to identify;

10, the following figure for the recognition of the effect map, the rightmost column is the identification of the text content, in the column head, you can identify the text format to set, such as setting font, font size, tilt, bold, etc.

11, after the recognition of the text, green shows that there may be spelling or recognition errors or lower confidence in the characters, if not done directly export processing, may affect future use. At this point, you can select the "validation Text" in the toolbar to edit and confirm the green tag section.

12, in the course of operation, it will be found that the text marked green is not misspelled, may only be improper font settings, in this case only need to ignore skip, there is the identification of the wrong text, make changes to replace, FineReader's own dictionary will be prompted to identify the correct variable, choose the correct text, Click "Replace" or "replace All", then "confirm";

13, the above image is verified after the text, is not more beautiful?

14, and then output text, click on the toolbar "save", that is, save the file in Excel format, the default state, the saved files will automatically open;

15, this is the exported file, again on the font and font size to adjust to make it look more beautiful. Then you can import it into a variety of cat (computer-aided translation) software, later translation, if such terminology, cat can automatically prompt, is not to save Google asked query distress?

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.