The OCR definition of Baidu (Optical Character recognition, optical character recognition) refers to electronic devices (such as scanners or digital cameras) that check the printed characters on the paper, determine their shape by detecting dark, bright patterns, and then use the character recognition method The process of translating shapes into computer text, that is, to convert the text in a paper document into a black-and-white bitmap image file for printed characters, and to convert the text in the image into text format through the recognition software for further editing of the processing technology by the word processing software.
As an engineer, in the actual programming, you may need to display the text in the picture, which requires the use of OCR technology. Because PHP development, so the first choice PHP, to find the PHP OCR extension test, the results found not available (address: Http://sourceforge.net/projects/phpocr.berlios)? Online also saw a lot of friends of the demo, basically the principle is to decompose the image into 0, 1 matrix, and then according to the characteristics, converted into the corresponding string. It is not feasible to test a few. And then see people say that PHP has little to do OCR, also not suitable, language efficiency is too low, this algorithm needs very high efficiency. You can try the OCR algorithm such as C,matlab. Do matlab play OCR this kind of biased algorithm a lot.
Helpless only learning shallow, not C. Inadvertently found that Baidu has the OCR API provides: Http://apistore.baidu.com/apiworks/servicedetail/146.html.
Wrote a play under: