Recently I was wondering if I don't have a tool for image text recognition? I think of OCR, which is quite awesome in China. Can python be used for implementation? Recently I was wondering if I don't have a tool for image text recognition? I think of OCR, which is quite awesome in China. Can python be used for implementation? So I found a fun program like PyTesser! Share and discuss it:
PyTesser is an optical character recognition module of python. it is used in conjunction with the Tesseract OCR engine to extract and output strings from an image or image file.
To use PyTesser, you do not need to install the Tesseract OCR engine, but you must first install the PIL module (Python Image Library, python graphics Library)
Official introduction:
PyTesser is an Optical Character Recognition module for Python. It takes as input an image or image file and outputs a string.
PyTesser uses the Tesseract OCR engine, converting images to an accepted format and calling the Tesseract executable as an external script. A Windows executable is provided along with the Python scripts. the scripts shoshould work in other operating systems as well.
PyTesser official: http://code.google.com/p/pytesser/downloads/list
PIL Library source address: http://www.pythonware.com/products/pil/
However, during the test, it was found that only English content was better recognized, but Chinese content could not be identified!
If you are interested, try it out.