Pentaho biserver Community edtion 6.1 is a kettle component that can run kettle program scripts. However, since Kettle does not publish directly to the BISERVER-CE service, the ETL scripts (. KTR &. KJB) that are developed in the local (Windows environment) through a graphical interface need to be uploaded to the Biserver-ce managed repository. Can be run and dispatched by Biserver-ce.
Focus: Kettle Repository and BISERVER-CE resource pool establish a consistent directory management ETL script file.
The first part Kettle script development
1 Create a resource library and log in
Here's how to use the MySQL repository.
2 new MySQL library sync job
Basic logic: Reading from the source database all indicates that, in the result set, each row of data in the result sets is the name of each table, and the synchronization task is performed. Where the synchronization task is to clear the target database to specify the table, the source table data output to the target table.
The entire MySQL library synchronization job consists of five files. are stored in the/home/spads/directory of the Repository, two jobs:
JOB_ALL_TABLE_EXTRACT.KJB, JOB_SINGLE_TABLE_EXTRACT.KJB
Three trans:
TRANS_ACQUIRE_TABLENAMES.KTR, TRANS_SINGLE_TABLENAME_SET.KTR, TRANS_SINGLE_TABLE_EXTRACT.KTR
3 whether the local test program can run
Executes the JOB_ALL_TABLE_EXTRACT.KJB, verifying that the program executed successfully.
You can select the online test run and select it as needed.
The second part publishes the Kettle script to Biserver-ce
Biserver-ce using the Repository Manager file, log in Http://IP:8080/pentaho , you can click Browse files to view the program file.
Because the kettle resource pool and the BISERVER-CE resource pool table structure are independent, you cannot publish kettle scripts directly from the database. You need to export the *.KTR and *.KJB files in kettle, and upload them to the Biserver-ce repository via the Resource Management page.
Note here that there is a directory reference relationship between the 5 script files in the Kettle repository, so you need to set up the corresponding file directory in the Biserver-ce repository and publish the script files everywhere to the specified directory for the script file to run successfully.
1 Exporting Kettle Scripts
Note file name
2 Create a file directory that is consistent with the kettle repository in the Biserver resource pool
The kettle script's storage path in the kettle repository is the default root of the/home/spads,biserver-ce repository as/home so create a new directory under the/home directory Spads
3 Upload kettle script to the corresponding file directory
The third part runs, dispatches the kettle script
Select the job in Resource Explorer, you can select the action run in background on the right, and you can view the execution log through open. You can also choose schedule for scheduling processing.
Pentaho biserver Community edtion 6.1 Tutorial third post and Schedule kettle (Data integration) script Job & Trans