Kettle series tutorial 1

Source: Internet
Author: User
1. kettle introduction kettle is an ETL (Extract, TransformandLoad Extraction

1. kettle introduction kettle is an ETL (Extract, Transform and Load Extraction

Zookeeper
1. kettle is an ETL (Extract, Transform and Load extraction, conversion, and loading) tool, which is frequently used in data warehouse projects, kettle can also be used in the following scenarios:

Integrate data between different applications or databases

Export data from the database to a text file

Load large volumes of data into the database

Data cleansing

Integration of application-related projects is a use

Kettle is very simple to use. It does not need to write code to implement the business through the graphic interface design. Therefore, kettle is designed for metadata;

Kettle supports many input and output formats, including text files, data tables, and commercial and free database engines. In addition, kettle's powerful conversion function allows you to easily manipulate data.

The following is a simple "Hello World" example. This tutorial will show you how to use kettle easily, so that you can learn more complex conversion functions.

Install kettle

Introduction to kettle design tool spoon

Hello world example

Redesign the helloworld example

2. Getting started with kettle 2.1

Download kettle from the official website;

Demand Environment:

Kettle requires jre1.5 and later versions, which can be downloaded from the oracle official website for free;

Kettle Installation

Kettle can directly decompress the zip file to the specified folder without installation. On unix-like operating systems, you need to execute the following script:

Cd Kettle

Chmod + x *. sh

Run

A graphical user interface in kettle is spoon. spoon can design conversion and job, and can also run conversion and job. The following content will continue to introduce them.

2.2 Introduction to kettle design tool spoon

Spoon is a graphic design tool used to design and test the data exchange and processing process. It can also be executed through the command line (terminal.

Resource library and files

Design jobs and conversions in spoon. kettle provides two storage methods: resource library and file;

If you select a resource library, you need to create a resource library when spoon is started for the first time. Select the file method. If the job saves the file, the extension is KJB and the file extension is KTR. To simplify learning, the latter is used in the following tutorial.

Start spoon

Run spoon. bat in windows and spoon. sh in unix-like systems. At startup, a dialog box is displayed, prompting you to select a resource library and enter connection information. Click the cancel button.

Then, you can see the welcome window. Click "options" under the "Tools" menu. In the pop-up window, you can perform some global settings, such as language, log, and other information. After the settings, You need to restart the settings to take effect.

There are already too many other

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.