Website Link: http://wiki.pentaho.com/display/EAI/Pan+User+DocumentationPanA Pan is a program that can perform a transformation that is edited using spoon.The decompression of PDI Software.zip has been pan.batcommand line using pan to execute transformationThe official website mainly introduces the commands under the Linux platform, I mainly introduce the commands under the Windows platformOptions optionFormat/option: "Value"Parameters parametersFormat "-param:name=value"Repository Warehouse Sel
This is the last Sunday. I think it is better to record it. After all, my memory is not very good and it is easy to forget.
On Saturday, I started to discuss with Northwest China and Zhang Wei where to go for dinner. I thought annie said on
The data in the database may be transferred in different databases for replacement. Because different databases may use different character sets, the resulting data may be garbled. This time, we ran data in a job. After the data was run, the
Two months ago, the process of a workbook, this is not a tutorial not too much to tell the specific painting methods and specific software techniques, mainly when I draw this picture
Think of some analysis of the image of the way, so strong
1. It can only have one primary query result set.
2. Place the $ {parameter name} parameter in the SQL statement at the bottom of the DATA page in the upper-right corner. The parameter must have a default value. Otherwise, no DATA is displayed in
This example is simple and the difficulty lies in the installation of your Hadoop2.20 plugin (my previous blog post). The steps to implement are as follows:
1. Create a job
Create a kettle job to achieve the following effects.
2. Configure
Kettle requires a JDK environment. You can download it on the Oracle official website. In addition, JDBC or ODBC is required to use kettle. I prefer JDBC. I hate to understand the concept and knowledge of JDBC.
"What is JDBC?
JDBC (Java Data Base
I. Extracting data from HDFS to an RDBMS1. Download the sample file from the address below.Http://wiki.pentaho.com/download/attachments/23530622/weblogs_aggregate.txt.zip?version=1&modificationDate =13270678580002. Use the following command to place
The layout of the dashboard is very troublesome. This CSS layout is a virtue. This mainly involves layout and compontens layout size settings.
Layout layout includes row objects and column objects, as well as images and HTML objects.
A row object
Http://www.aboutyun.com/thread-7450-1-1.html
There is a very large table: Trlog The table is about 2T.Trlog:CREATE TABLE Trlog(PLATFORM string,user_id int,Click_time String,Click_url string)Row format delimitedFields terminated by ' t ';
Environment:
Kettle: Kettle-Spoon version stable release-4.3.0
MySQL: MySQL Server 5.5.
Database connection information:
Test the database connection.
Error connecting database [MySql-1]: org. pentaho. Di. Core. Exception. kettledatabaseexception:
Erroccured while trying to connect to the database
Predictionwhile loading class
Org. gjt. Mm. MySQL. Driver
Org. penta
Ls-a to display all objects, because.) The object of XX is hidden by default)
Execute again./spoon.sh
[Cognos@bitic data-integration]$./spoon.sh
/home/cognos/pdi-ce-4.2.0-stable/data-integrationINFO 11-11 14:56:34,164-spoon-logging goes to File:///tmp/spoon_66d28e63-4a9e-11e3-a301-7b09d1d32e5b.logINFO 11-11 14:56:35,641-logging to Org.slf4j.impl.JCLLoggerAdapter (Org.mortbay.log) via Org.mortbay.log.Slf4jLogINFO 11-11 14:56:35,646-class org.pentaho.
buildguy): An Unexpected ERROR occurs in Spoon: probable cause: close other Spoon windows before stopping spoon! 12:25:08-Spoon-Java heap space2015/01/05 12:25:08-insert/update. 0-ERROR (version 5.0.1-stable, build 1 from 2013-11-15_16-08-58 by buildguy): java. lang. outOfMemoryError: Java heap space2015/01/05 12:25:0
of data cleaning and data conversion is achieved by setting up the visual function node of data processing beforehand. For data reduction and integration, a variety of data processing function nodes are provided through the combination preprocessing subsystem, which can quickly and efficiently complete data cleaning and data conversion process in a visual way.
4. ETL Tool Introduction
ETL Tool function: must be extracted to the data can be flexible calculation, merging, split and other convers
through the source run Spoon
Kettle Source engineering itself may be in the linux64 bit machine debugging, SWT configuration is linux64 Library, all in the operation of the source code needs to be modified to Win32 SWT, steps as follows: Project à property Àjava build Pathàlibrariesàadd Jars
Then remove the linux64 SWT Library
Finally open Src-uiàorg.pentaho.di.ui.spoonàspoon.java, Run Asàjava application two. Source Analysis 2.1. Modify the Kettle
1. What is Kettle
Kettle is "kettle e.t.t.l. Envirnonment" initials only, which means it is designed to help you achieve your ETL needs: Extract, transform, load data; Kettle translated into Chinese name should be called Kettle, The origin of the name as MATT, the program's main programmer, said in a forum: I want to put all kinds of data in a pot and then flow out in a specified format.
Kettle is an excellent, open source ETL software, which is based on Java implementation, code hosted on the
Thirty methods of Eggplant
1. fried eggplant StripsIngredients: 300 grams of tender eggplant.Ingredients: 10 grams of pepper, 10 grams of carrot, half an egg, 75 grams of wet starch, 750 grams of salad oil.Seasoning: 5 grams of iodized salt, 3 grams of MSG, 5 grams of soy sauce, 20 grams of sugar, 10 grams of vinegar, 3 for each onion, ginger, and garlic3 grams of coriander.Production:(1) clean and peel the eggplant with a decimal part, cut 4 cm long and 1 cm square lines, and put the eggs and w
connect to the Hadoop distribution also has not been kettle support, you can fill in the corresponding information requirements Pentaho develop one.There are 1 more cases where the Hadoop distribution is already supported by Kettle and has built-in plugins.3 is configured.3.1 Stop application is if kettle in the run first stop him.3.2 Open the installation folder our side is kettle, so that's spoon. File p
also has not been kettle support, you can fill in the corresponding information requirements Pentaho develop one.There are 1 more cases where the Hadoop distribution is already supported by Kettle and has built-in plugins.3 is configured.3.1 Stop application is if kettle in the run first stop him.3.2 Open the installation folder our side is kettle, so that's spoon. File path:3.3 Edit Plugin.properties file
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.