The Chinese New Year is approaching, and the new application of the blogger-screen word-taking-the-old image coding work is also in full swing. Next we will share with you the core feature ocr in this application, that is, the image recognition function. Let's take a look at my implementation results. It is an English page that is randomly cut off on the Internet. It is the Implementation Effect of my application.
2. Implementation
(1) first downlo
In a recognition software found Asprise OCR "Figure", the last officer network to check the relevant information, found that the function is very powerful, the identification of the printed body should be good, unfortunately it seems not to recognize Chinese, but do not know how it is distorted after the English recognition ability, otherwise it can be used to identify the verification code. Download the C + + demo on the website and record it here.1.
ocr,optical character recognition abbreviation, that is, optical recognition system, is a branch of graphics recognition, OCR is for the printing character, the use of optical way to convert the document data into the original data black and white lattice image file, The recognition software identifies the text in the image as Chinese and English, and then converts it into a text format. So the computer can
Linux Installation TESSERACT-OCRPreparatory work:one. Compilation Environment :1. GCC gcc-c++ make (this environment is a common machine, can be ignored)Yum Install GCC GCC Make2. Dependent packages: autoconf automake libtool libjpeg-devel libpng-devel libtiff-devel zlib-devel Leptonica (more than 1.67)autoconf automake libtool libjpeg-devel libpng-devel libtiff-devel Zlib-devel can be installed via Yum:Yum Install autoconf automake Libtool Yum Install libjpeg-devel libpng-devel libtiff-devel z
1,pil or pillow (Python Imaging Library) image processing librariesprinciple: The image class is a very important class in the PIL library, through which the instance can be loaded directly into the image file, read the processed graphthree ways to get images like and through crawlingsteps to install PIL and Pillow (Window edition)Prerequisites: Before installing PIL, you need to install Pip (Pip is a tool for installing and managing Python packages, a replacement for Easy_install) 1. First foun
Tags: orm bin OCR oracle10g ror crs_stat cluster get backVerify that the cluster status is normal before operation1. Replace OCR#./ocrconfig-replace ocrmirror/dev/rhdisk11#./ocrcheckStatus of Oracle Cluster Registry is as follows:Version:2Total Space (Kbytes): 1048300Used Space (Kbytes): 4628Available Space (Kbytes): 1043672id:2049932717Device/file Name:/dev/rhdisk2Device/file Integrity Check SucceededDevic
The following links contain the jar packages that are required for the installation package and the program to run, and the Chinese resource pack.How to use the Chinese package: Find the Tessdata installation directory (my Local: C:\Program Files (x86) \tesseract-ocr\tessdata), replace Eng.traineddata with Chi_sim.traineddata , and rename the Chi_sim.traineddata to Eng.traineddataResource Bundle: HTTP://PAN.BAIDU.COM/S/1DFC0EM1Code please refer to: ht
Please download the http://asprise.com/product/ocr/index.php first? The lang = csharp sdk provides detailed OCR methods, as follows: The text in the image frame picbveryfycodeis the same as the text in the image frame .txt veryfycode. Text. The digital recognition rate I encountered was almost 100%. Http://blog.csdn.net/crabo/ [Dllimport ("aspriseocr. dll")]Static extern string craboocr (string file, in
Warehouse Address: Https://github.com/RobinDavid/PytesserInstall tesseract sudo Install Opencv-pythonAfter installation, you need to download the identification file, because my environment isTesseract 3.02.02leptonica-1.70Zlib 1.2.11So I downloaded 3.02 of the Chinese recognition training data, the address ishttps://sourceforge.net/projects/tesseract-ocr-alt/files/Need to extract to/usr/local/share/tessdataThen write the script test.pyImport= pytesse
Before using the Fast OCR text recognition software, let's take a look at the software. Fast OCR text recognition software is a well-known text recognition software, specifically for users to provide image text recognition services, is a PDF file processing function of the OCR software, it has a variety of features:
1. High recognition accuracy and fast recognit
on the paper, by detecting dark, bright mode to determine its shape, and then use the character recognition method to translate the shape into computer text process, that is, the text data scanned, and then the image file analysis processing, access to text and layout information process. It is the most important subject for OCR to improve the recognition accuracy by using auxiliary information, and the Intelligent Character recognition noun is produ
If we want to convert part of the hand and paper into an electronic file, in addition to scanning the computer with a scanner, we also need to use OCR text recognition software, to convert the scanned image to a text file. Google now supports OCR!
Official Website: http://docs.google.com/
Step
1. Open your browser, connect to http://docs.google.com, log on to your Google account, and click upload in the
TESSERACT-OCR tools in the Text2image.exe, download other people compiled by the win system will not work properly. It took a long time to finally compile a normal operation.--font= "Font name" Specifies the font name with double quotes, Cannot use single quotation marks.--text= "input File" The text file to be entered needs to be in UTF-8 format. The name of the font needs to run the Text2image--list_available_fonts command first to see the valid fo
As usual. First, send the picture.And then the speed of the link to everyoneBaidu Software Center OCR free SoftwareThis software currently supports local import and scanner capture images. Two different ways.Language support both simplified and plain English.Export supports PDF, txt.Because it is a free trial version, large quantities of tests are not supported for the time being.You can test our recognition rate. Because the core is the same.Well, do
Tesseract-ocr is an open-source optical character recognition engine that supports Google and supports recognition in many languages, next I will talk about the installation steps in Ubuntu. In fact, the official documentation is very detailed. The commands listed below are sudoapt-getinstallautoconfautomakelibtoolsudoapt-tesseract-ocr, which is an open-source optical character recognition engine, supported
Then the previous OCR said. The previous article introduced the simple use of tesseract in the command line, of course, to inherit into our program, or need code implementation. Here's a sample of the Java implementation that you'll share.watermark/2/text/ahr0cdovl2jsb2cuy3nkbi5uzxqvbg1qnjiznty1nzkx/font/5a6l5l2t/fontsize/400/fill/i0jbqkfcma==/ Dissolve/70/gravity/center "/>Take the code to scan the image above, and then output the results.The main id
automatic distinction between computer and human public Turing Test (Completely Automated Public Turing test to Tell Computers and Humans Apart)Abbreviation CAPTCHA, commonly known as verification codeWindows1. Install Tesseract,Installation path join path, set tessdata_prefix environment variableotherwise error:' Error opening data file \\exe\\tesseract-ocr\\tessdata/eng.traineddataPlease make sure the TESSDATA_PREFIX environment variable are set to
Send one last year to do a removal of the ticket interference item demo, the core processing is to remove the Red seal, do not destroy the red print superimposed text.It's just a little demo that hasn't been specifically optimized yet.And I don't know when we'll have the energy to continue optimizing it.Now let's give you a trial effect.Put on:Demo downloadHttp://files.cnblogs.com/files/tntmonks/GImage.zip(note: May be killed by anti-virus software, added as a trust can be)My contact information
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.