The simple introduction of DataStage practice __BI

Source: Internet
Author: User
DataStage Composition: DataStage Designer (designer): The design interface used to create DataStage job (Job). Each job specifies the data source, the desired transformation, and the destination of the data. The job is compiled into executable, scheduled by director and run by the server.
DataStage Director (conductor): Used to verify, schedule, run, and monitor DataStage operations.
DataStage Manager (Manager): Used to view, edit the contents of repository.
DataStage Administrator (Supervisor): for creating DataStage users, creating, moving items.   DataStage Installation: Very simple, all the way next, of course, first you must have authorization to do,:)   DataStage Simple examples (The following examples are running through the server job):   Functionality: Implements importing data from a fixed-length text file into an Oracle database. Summary: Although the function is simple, but embodies the entire ETL process, namely: from the data extraction to the data conversion finally load data to the specified library process. Figure: Each part description: Sequential_file_0 (sequence file): Data source file, can be. Txt,.del, and so on any order file. The main need is to set the file Name property in Outputs->general, select the source file, and then you need to set the structure corresponding to the document, using Outputs->columns->load ... To load the structure you need.   Transformer (stage component for conversion): the main need to set its "conversion rules" (personal understanding), when the data from the SEQUENTIAL_FILE_0 read out, according to the corresponding "rules" and then loaded into the database, In fact, the process of cleaning the data, of course, and so on, and so on, where the example is relatively simple, so do not need to do any processing of data. Although this component is simple to use but not very efficient, it should be used sparingly in practical practice.   Oracle_oci_9 (Oracle components): DataStage is able to complete the unified processing of heterogeneous databases, the main reason I think this is it. It provides a lot of database stage, such as db2,informix,oracle,sybase and so on, even if you do not need to, you can also use ODBC to complete the link to the database, a word: strong. The settings for the Oracle_oci_9 component are mainly on the database Source name (the name of the DB instance), the User ID (the table space name), the Password (table space password), the table name, the table structure, and so on. The parts are set with the attached drawings: sequential_file_0:   Transformer:oracle_oci_9:   Experience Summary: You may encounter problems at run time, with director can see their reported errors and warning messages, good oh ...




from:http://opengreat.blog.51cto.com/264115/62102




Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.