Kettle series tutorials 1. kettle series tutorials

Source: Internet
Author: User

Kettle series tutorials 1. kettle series tutorials
Introduction to kettle

Kettle is an ETL (Extract, Transform and Load extraction, conversion, and loading) tool. It is frequently used in data warehouse projects. kettle can also be used in the following scenarios:

    • Integrate data between different applications or databases

    • Export data from the database to a text file

    • Load large volumes of data into the database

    • Data cleansing

    • Integration of application-related projects is a use

Kettle is very simple to use. It does not need to write code to implement the business through the graphic interface design. Therefore, kettle is designed for metadata;

Kettle supports many input and output formats, including text files, data tables, and commercial and free database engines. In addition, kettle's powerful conversion function allows you to easily manipulate data.

The following is a simple "Hello World" example. This tutorial will show you how to use kettle easily, so that you can learn more complex conversion functions.

  • Install kettle

  • Introduction to kettle design tool spoon

  • Hello world example

  • Redesign the helloworld example

 

2. Getting started with kettle 2.1

Download kettle from the official website;

Demand Environment:

Kettle requires jre1.5 and later versions, which can be downloaded from the oracle official website for free;

Kettle Installation

Kettle can directly decompress the zip file to the specified folder without installation. On unix-like operating systems, you need to execute the following script:

Cd Kettle

Chmod + x *. sh

 

Run

A graphical user interface in kettle is spoon. spoon can design conversion and job, and can also run conversion and job. The following content will continue to introduce them.

 

2.2 Introduction to kettle design tool spoon

Spoon is a graphic design tool used to design and test the data exchange and processing process. It can also be executed through the command line (terminal.
Resource library and files

Design jobs and conversions in spoon. kettle provides two storage methods: resource library and file;

If you select a resource library, you need to create a resource library when spoon is started for the first time. Select the file method. If the job saves the file, the extension is KJB and the file extension is KTR. To simplify learning, the latter is used in the following tutorial.

 

Start spoon

Run spoon. bat in windows and spoon. sh in unix-like systems. At startup, a dialog box is displayed, prompting you to select a resource library and enter connection information. Click the cancel button.

Then, you can see the welcome window. Click "options" under the "Tools" menu. In the pop-up window, you can perform some global settings, such as language, log, and other information. After the settings, You need to restart the settings to take effect.


 

For the following content, see kettle series tutorial 2.

2.3. hello world example 2.4. redesign the hello world example too many bytes too many
Who has used kettle spoon's open source etl Tool? Is there a detailed tutorial?

There are a lot of online resources. If you have a foundation, you can learn and create projects. You can get started in one month.

This type of tool is easy to get started with, but to do well, you must have a certain database foundation, certain development capabilities, and a thorough understanding and foresight of the project.

We recommend that you search for a QQ group. Of course, you must have basic, self-learning, and research skills.

KETTLE and SSIS in SQL 2005 are both a type of tool.

KETTLE is widely used now and it is quite easy to use.

Who has kettle video tutorial?

Pan.baidu.com/..hird4244

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.