PDF File Processing (small size text files)

Source: Internet
Author: User

Question:

Most of the PDF files are generated after scanning books, and the size is very large. But I want to convert them into text files so that they can be stored on the mobile phone. The problem is how to convert them.

Problem Solving

1. There is no panacea, that is, there is no dedicated Conversion Tool for direct conversion. If so, it is a lie.

2.pdf provides secure encryption. One is low-grade general encryption and the other is high-grade security certificate encryption.

3. measure the test taker's knowledge about the security of PDF files and solve the problem in different ways: simply encrypt the files and use a common tool to remove the PDF password to reduce the security level; the password of the security certificate should be removed from the source for encryption, because the popular pdf document password removal tool will report an error: [this document is based on "adobe. pubsec [adbe. pkcs7.s5] 128-bit security V.4 "encryption handler creation. This protection method is not supported .].

4. The security-deprived PDF file has a watermark, which is the simplest. Use Adobe Acrobat to cancel the watermark and save it.

5. Adobe Acrobat has a very cute function. exporting to Tif format is the key to this function. It exports all pages to Tif format files one by one. Do not underestimate this format, because its application is very important. First, the capacity is small. Second, it can be directly displayed in the operating system of XP or later versions in the form of images. If there is an office installed, office automatically uses its own tools to read it. Once again, only files in Tif format can be converted into text by specific software.

6. those who have bought the Tsinghua Ziguang scanner know that it comes with an OCR software to recognize scanned books and convert them into text. This software is easy to use and can be recognized on the entire page, you can manually identify the difference and convert it into text. Note that the page containing images must not be automatically recognized on the whole page. Otherwise, a bunch of garbled characters will be generated.

7. after recognizing the text on one page and copying its text and images to the Word documents one by one, we can place our own Word files on our mobile phone, and the content can be determined by ourselves, you can control the size on your own, and do not need to load it for half a day during mobile phone recognition.

8. For OCR software, there is a CD when you have a scanner on the Internet or when you buy it. for the removal of the PDF security certificate, I will not mention it here, so respect for intellectual property rights is the most important.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.