IBM provides comprehensive and industry-leading data warehousing and business analysis solutions, including information integration tools InformationServer, real-time and incremental data replication tools InfoSphereCDC, and data warehousing solutions InfosphereWarehouse; business analysis tools CognosBI and a series of business analysis applications.
IBM provides comprehensive and industry-leading data warehousing and business analysis solutions, including Information Server, InfoSphere CDC, and Infosphere Warehouse; business analysis tool Cognos BI and a series of business analysis applications.
This article mainly introduces IBM data Warehouse and business analysis solutions, especially how to use Infosphere Datastage, Infosphere CDC, Infosphere Warehouse, and Cognos to quickly create data analysis applications, to help you quickly master the basic methods for developing and analyzing applications using Infosphere Datastage, Infosphere CDC, Infosphere Warehouse, and Cognos.
For data warehouses and business analysis applications, IBM provides comprehensive and industry-leading solutions. In terms of software, it provides integrated and end-to-end solutions, including Information Server, InfoSphere CDC, and Infosphere Warehouse; business analysis tool Cognos BI and a series of business analysis applications; Infosphere BigInsights and Infosphere Streams are provided for Big Data analysis; for information supervision, provides Quality Stage data Quality management tools, Infosphere Optim data lifecycle management solutions, Infosphere Guardium Data Security Solutions, and IBM Smart Analytics System solutions, based on the user's expected data warehouse size, it provides users with pre-configured, optimized, scalable overall software and hardware configuration solutions, including model, configuration, and storage configuration. Network configuration can provide users with reasonable hardware selection and one-stop solutions. IBM also provides the Netezza data warehouse all-in-one solution with excellent performance and simple configuration; in terms of data warehouse models, IBM provides data warehouse models for banking, telecommunications, insurance, and retail. It can provide templates for industrial applications and accelerate industrial application modeling.
Shows the typical data Warehouse solution of IBM. We use Infosphere Warehouse as the EDW system of the enterprise data Warehouse, Infosphere Warehouse as the relational data mart system, and Cognos as the multi-dimensional data mart system; use Infosphere CDC to copy data from the Business System to the ODS system in real time; Use Infosphere Datastage to load data in batches to a data warehouse or a data set; Use Infosphere CDC to Load Incremental and real-time data; use Cognos BI and Cognos applications for business analysis.
Figure 1. architecture of the IBM data warehouse solution
Below, we will use a simple "Sales Performance Analysis" example to introduce how to use Infosphere Datastage, Infosphere CDC, Infosphere Warehouse, and Cognos to quickly create data Analysis applications.
In this experiment, we created db2olap in DB2 9.7 As the OLTP data source, and used DB2 9.7 provided by Infosphere Warehouse to create olapdb as the ODS/Data Warehouse System, inforsphere Datastage 8.7 is used as the ETL Tool to extract the required data from the data source db2olap, convert the data, and load the data into the olapdb ODS/Data Warehouse. At the same time, we use the CDC Transaction Stage provided by Infosphere CDC 6.5.1 and Infosphere Datastage 8.7 to load real-time and incremental data. Finally, we use Cognos BI 10.1.1 to implement final report presentation, OLAP analysis, and dashboard applications.
Environment preparation
In this experiment, we use the Redhat Linux operating system, kernel 2.6.1, and installed the following software on it:
- IBM Information Server 8.7
- Inforsphere CDC 6.5.1
- DB2 9.7.4
- IBM Cognos BI Server 10.1.1
- Ibm http Server 7.0
The following software is installed on a windows 7 client:
OLTP Data Source
In this experiment, we created the db2olap database in DB2 9.7 As the OLTP data source, including the following table and table structure definition information:
Listing 1. Data Source Definition
create table locations_s--location dimension (city_id char(8) not null primary key, prov_id varchar(10), area_id varchar(10), country_id varchar(10) ); create table city_s (city_id char(8) not null primary key, city varchar(10), city_population int); create table prov_s (prov_id varchar(10), prov varchar(10)); create table area_s (area_id varchar(10), area varchar(10)); create table products_s --products dimension (product_id varchar(10) not null primary key, sub_class_id varchar(10), class_id varchar(10)); create table product_s (product_id varchar(10) not null primary key, product varchar(50)); create table subclass_s (sub_class_id varchar(10), sub_class varchar(50)); create table times_s ---time dimension (day_id int not null primary key, day varchar(10), month_id int, month varchar(10), year_id int, year varchar(10)); create table salesperf_s (city_id char(8) not null, product_id varchar(10) not null, day_id int not null, sales decimal(10,2), costs decimal(10,2), constraint fk_day foreign key(day_id) references times_s, constraint fk_location foreign key(city_id) references locations_s, constraint fk_product foreign key(product_id) references products_s );