Data migration using kettle

Source: Internet
Author: User

Scenario

There are five databases, two of which are SQL servers and three are Oracle10g.

Target

Migrate the business data of the two SQL servers to the three Oracle databases according to the specific logic.

Kettle usage (basic)

Kettle installation and configuration

  1. You can download kettle installation on the official website.
  2. A database is required for kettle configuration. Kettle database stores metadata descriptions, tasks, and conversions required by kettle. Kettle's default login information is admin/admin.
  3. Follow the prompts to create an Oracle database connection.
    Server connection.

Note the difference between Database Name and instance for SQL Server connection
Name.

 

Basic concepts of kettle

  1. A job is responsible for organizing [conversion] together to complete a job. Generally, we need to split a large task into several logically isolated jobs, when these jobs are completed, the task is completed.
  2. Conversion: defines the container for data operations. Data operations are a process from input to output of data. It can be understood as a container with a lower granularity than a job. We divide tasks into jobs, then, you need to split the job into one or more transformations, and each transformation completes only part of the work.

Kettle basic example

  1. Kettle's error handling requires error logging in many scenarios, for example, if the migration prompts data problems, primary/foreign key errors, or violation of constraints, the current scenario should be recorded in one place for special processing.

Example

Main Process

Error message Configuration

 

  1. When the data volume is large, you can add filter parameters for processing.

 

 

  1. Value ing

 

 

 

  1. Connection

Note that the raw data used for the connection must be sorted in ascending order.

 

 

References

  1. Baidu Library # bi-pentaho authoritative guide
  2. Http://www.pentaho.com/
  3. Http://blog.csdn.net/ybygjy/article/details/7478113

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.