Data format conversion (I) PDF conversion technology, format conversion pdf

Source: Internet
Author: User
Tags xsl xsl file

Data format conversion (I) PDF conversion technology, format conversion pdf

The PDF (Portable Document Format) Format is an electronic file Format developed by Adobe. This file format has nothing to do with the operating system platform, which makes it an ideal document format for electronic document distribution and digital information dissemination on the Internet. More and more e-books, product descriptions, company announcements, online materials, and emails are beginning to use PDF files. PDF files have become an industrial standard for digital information.

PDF has many advantages that cannot be compared with other electronic document formats. PDF file formats can encapsulate text, fonts, formats, colors, and image files independent of devices and resolutions in one file. This format file can also contain electronic information such as hypertext links, sound and dynamic images. It supports special files and provides high integration and security reliability. PDF files are becoming more and more widely used.

 

Sakura Maple software company can provide the following data format to PDF format file conversion service.

 

  1. Microsoft Word file → PDF file conversion

 

(1) language and version of Microsoft Word Files

★Microsoft Word 97/2013/XP

★Traditional Chinese Microsoft Word 97/2013/XP

★Microsoft Word 97/2013/XP

 

(2) Basic functions of generating PDF files

Generate a PDF file of the Word files provided by the user according to the standards in Article 1.3. When generating a PDF file, set the conversion conditions according to your needs.

★Font Embedding settings

According to user requirements, embed the contour data of the font in the source Word file into the PDF file. This option is only intended for TrueType and Type1 font embedding. When the font cannot be embedded, use blank text instead.

★Set the password for opening a PDF file

You can set a password to open a PDF file as required. The password can contain up to 32 characters. 0x20 ~ 0x7E, 0xA1 ~ Text and Chinese characters outside the 0 x DF range cannot be used as passwords. This application uses 40-bit RC4 to encrypt the PDF file.

★Security password settings

You can set a secure password based on your requirements. The password can contain up to 32 characters. 0x20 ~ 0x7E, 0xA1 ~ Text and Chinese characters outside the 0 x DF range cannot be used as passwords.

★Print the settings that are not allowed

The content of the PDF file cannot be printed.

★Modify settings not allowed in the document

The content of the PDF file cannot be changed.

★Select settings that are not allowed for text and graphics

The PDF file cannot be copied.

★Add or modify settings not allowed by the annotation and form field

PDF file comments and form fields cannot be added or modified.

★Image Type

According to user requirements, you can specify the compression method for images stored in PDF files. There are three image types: [automatic], [BMP], and [JPEG. When [automatic] is specified, select a smaller SIZE compression method after image compression. JPEG or ZLIB.

★Image Quality

According to user requirements, you can specify the image quality, the specified range is 1 ~ 100. The larger the number, the better the image quality. However, the generated PDF file is large.

 

  1. Conversion of various image and image files → PDF files

 

(1) Types of image files

★BMP, GIF, TIFF, WMF, EMF, JPG, JPEG

 

(2) Types of graphic files

★CGM, SVG

 

  1. Text File → PDF file conversion

 

(1) Source Text File Language

Simplified Chinese, Japanese, English

 

(2) text set used by the source text file

★Simplified Chinese: GB2312, GBK

★Japanese: Jis, Shift_Jis, EUC_JP, EUC_JP_Fix, ISO2022_JP, Window31J

★ISO8859-1.

★Unicode (UTF8, UTF16, UCS4, UCS8)

 

(3) basic functions of generating PDF files

★Language settings

Simplified Chinese, Japanese, English

★Font setting

Based on the font set by the System

★Font modification settings

Underline, remove line, bold, italic

★Font size setting

8, 9, 10 ,...... 72

★Font color settings

48 basic colors and 16 custom colors

★Page settings

Paper: A4, A3, B4, B5 or custom paper length and width

Direction: vertical and horizontal

Whitelist:

 

  1. XML + XSL file → PDF file conversion

 

 

Based on the Extensible Stylesheet Language (XSL) pattern planned and promoted by W3C, our company developed XML formatter software products for displaying, printing, and converting PDF files. Using the self-developed mongocreator library, the system outputs XML + XSL typographical results as PDF files without installing Acrobat.

 

(1) features of XSLFormatterV3.0

★This product has a powerful typographical function that allows you to typeset XML files with headers and footers on pages. Automatically generate directories and indexes.

★Supports text length scaling.

★With the self-developed mongocreator library, the system outputs the typographical result as a PDF file without installing Acrobat.

★High-speed and large-capacity layout.

★Drawing High-Resolution portraits in PDF using self-developed SVG painting software.

★Supports multiple languages

 

(2) technical features of the PDF Creator Library

★Supports text, images, graphics, and other complex functions, such as functions and shadows; supports image formats such as BMP and JPEG; supports graphical betiller curves; utilizes functions, shadow and other functions beautify graphics and images to create a strong three-dimensional effect.

★Supports multiple languages, including Chinese, Japanese, Korean, and Spanish, and can handle special Thai and Arabic languages.

★Supports the PDF compression function, which can compress bmp images into Jpeg formats and save them to PDF files. It can also compress text in Flate mode, thus greatly reducing the file Size.

★Supports PDF encryption.

★Supports PDF bookmarks and comments (including link comments, text comments, line comments, and underline comments.

★Supports font embedding.

★Supports image cutting.


How can I convert a pdf file into an excel file and edit the data in it?

It is not directly transferred. We recommend that you convert the pdf file into word and then paste it into excel.

Can PDF documents be electronic documents? How to convert?

1. How to extract several pages from a multi-page PDF file to become JPG Images
The most convenient way is to use Adobe Acrobat. Click "Export" in Adobe Acrobat and select "JPG". All pages of the PDF file are automatically converted into JPG files.
If you have Photoshop software, you can open a PDF file in Photoshop. Photoshop will let you choose to open the page, select the page, open it, and save it as a JPG image file. The disadvantage of this method is that only one page can be extracted at a time and repeated operations are required, which is inefficient.

Ii. How to convert a PDF document into a Word document
Open the PDF document to be converted in adobe acrobat, choose "file"> "Save as" from the menu, and select "Microsoft word" as the storage type to convert it into a word document. Alternatively, click "Export" in Adobe Acrobat and select "Word" as the export file type. The result is the same.
A software named e-PDF To Word Converter is used To convert PDF documents into Word documents. The Chinese special edition is available for download on the Internet.
Bytes.
In particular, if a PDF document is generated by scanning a paper file and converted into a Word document, every page obtained after conversion is an image inserted in the Word document and cannot be edited. Even PDF documents generated by other electronic documents such as Word are converted into Word documents, the text is also distributed in text boxes. If the layout is slightly more complex, text may overlap, and the workload for re-editing is heavy.

3. How to export text from a PDF file
If the PDF document is generated by other electronic documents such as Word that can be edited, Use adobe reader or adobe acrobat's "selection tool" to select the text and press "Ctrl" + "C, you can copy the text to the clipboard and paste it there. Adobe reader is invisible in the default status after installation. You need to choose tools> Custom toolbar from the menu to hook up before selecting tools, display it.
To export the text of the entire PDF file, you only need to operate "file"> "Save As text" in adobe reader, And the exported text is saved to a text file. In adobe acrobat, choose "file"> "Save as" in the Operation menu, select "plain text" as the storage type, or click "Export, select "More Formats"> "plain text" for the exported file type ".

4. Use OCR technology to export text
If a PDF file is generated by scanning a paper file, you cannot use the preceding simple method to export text. However, you can also use OCR text recognition technology to export text.
In adobe acrobat, you can choose documents> OCR text recognition, you can copy the selected text with the "select tool.
Adobe reader does not have its own OCR text recognition function. However, you can choose File> print from the menu, and select Microsoft Office Document Image Writer as the printer name ", this is a virtual printer installed on your computer with Microsoft Office 2003. It prints PDF files to files suffixed with "mdi" and automatically opens them, in the opened "mdi" file, operate the menu "... remaining full text>

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.