Introduction to kettle
Kettle is an ETL (Extract, transform and load extraction, conversion, and loading) tool. It is frequently used in data warehouse projects. Kettle can also be used in the following scenarios:
Integrate data between different applications or databases
Export data from the database to a text file
Load large volumes of data into the database
Data cleansing
Integration of application-related projects is a use
Kettle is very simple to use. It does not need to write code to implement the business through the graphic interface design. Therefore, kettle is designed for metadata;
Kettle supports many input and output formats, including text files, data tables, and commercial and free database engines. In addition, kettle's powerful conversion function allows you to easily manipulate data.
The following is a simple "Hello World" example. This tutorial will show you how to use kettle easily, so that you can learn more complex conversion functions.
2. Getting started with kettle 2.1
Download kettle from the official website;
Demand Environment:
Kettle requires jre1.5 and later versions, which can be downloaded from the Oracle official website for free;
Kettle Installation
Kettle can directly decompress the ZIP file to the specified folder without installation. On Unix-like operating systems, you need to execute the following script:
CD kettle
Chmod + x *. Sh
Run
A graphical user interface in kettle is spoon. Spoon can design conversion and job, and can also run conversion and job. The following content will continue to introduce them.
2.2 Introduction to kettle design tool spoon
Spoon is a graphic design tool used to design and test the data exchange and processing process. It can also be executed through the command line (terminal.
Resource library and files
Design jobs and conversions in spoon. Kettle provides two storage methods: resource library and file;
If you select a resource library, you need to create a resource library when spoon is started for the first time. Select the file method. If the job saves the file, the extension is kjb and the file extension is Ktr. To simplify learning, the latter is used in the following tutorial.
Start spoon
Run spoon. BAT in windows and spoon. Sh in Unix-like systems. At startup, a dialog box is displayed, prompting you to select a resource library and enter connection information. Click the cancel button.
Then, you can see the welcome window. Click "options" under the "Tools" menu. In the pop-up window, you can perform some global settings, such as language, log, and other information. After the settings, You need to restart the settings to take effect.
For the following content, see kettle series tutorial 2.
2.3. Hello world example 2.4. redesign the Hello world example too many bytes too many