Atitit. OCR Framework Library Daquan Attilax Summary

Source: Internet
Author: User
Tags tesseract ocr

Atitit. OCR Framework library Daquan attilax Summary

tesseract asprise? Java OCR

Free To do nothing, found that Baidu has a OCR Text Recognition interface, it feels very interesting, take to study.        

Baidu Service Introduction: Word recognition is the natural scene of Baidu OCR Services, relying on Baidu's industry-leading OCR algorithm , provides full-text detection, recognition, integer image recognition, text line positioning and image recognition and other functions.

don't say more, just look at the demo !

java4less

The J4L OCR tools are set of components of the can being used to include OCR capabilities in Java applications. That's means you can receive faxes, PDF files or scan documents and extract business information from the images. The main 3 components is:

a Java wrapper for the tesseract OCR engine. The OCR engine tesseract itself is delivered under the Apache 2.0 license and we support a version compiled for Windows on Ly.

A PDF to Text converter.

A text document parser.

The document recognition process can therefore is divided in 2 steps:

The component takes an image file (TIF, PNG, jpg ...) or a PDF file and returns the text contained in it. The Java wrapper would perform this operation by using Tesseract. Alternatively can use any other OCR engine. If You is however using a PDF file, you'll use the PDF to Text converter.

In the second step, your Java application needs to understand the text returned by the OCR engine or PDF converter. This was done by the document parser. The document parser uses as input as text string (the data) and a XML file that describes the structure of the document an D The ouput is a business document either as a Java object or as a XML file

JAVA Implementation of Baidu OCR character recognition function - Zhang Rongjin column - Blog channel -CSDN. Net.html

author::  Nickname :Old Wow's claws( Full Name::AttilaxAkbar Al Rapanui Attilaksachanui) 

Kanji Name: Etila ( Ayron) , email:[email protected]

reprint Please indicate source: http://www.cnblogs.com/attilax/

Atiend

Atitit. OCR Framework Library Daquan Attilax Summary

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.