Learning and Training Learning


IBM DataStage

This training cycle is focused on providing hands-on assistance to specialists who integrate data on the basis of IBM InfoSphere Information Server product range, and on preparing specialists for the professional certification in this IBM product range.​ 
 The training course includes the following modules: 
  • Overview of IBM InfoSphere Information Server product range
  • Development of data processing processes in IBM DataStage
  • Advanced development of data processing processes in IBM DataStage
During the one-day “Overview of IBM InfoSphere Information Server product range” module, the course participants will be familiarized with the architecture, principal components and potential of the IBM InfoSphere Information Server product range.
 
The two-day “Development of data processing processes in IBM DataStage” module will impart participants with hands-on experience in developing data processing processes using IBM DataStage. They will be familiarized with main components and their functional specifics.
 
The two-day “Advanced development of data processing processes in IBM DataStage module will deliver knowledge and skills in debugging, ETL procedures optimization, and recommendations on the development of scalable high-performance and least resource-intensive data processing processes in IBM DataStage.


Module 1: 1 day

Overview of IBM InfoSphere Information Server

  • Concept of ETL
  • Architecture of IBM InfoSphere Information Server
  • Key components of IBM InfoSphere Information Server
  • Client application overview


Module 2: 2 days

Development of data processing processes in IBM DataStage 

  • Creating data processing processes in IBM DataStage
  • Compilation and execution
  • Conveyor parallelism
  • Data types in IBM DataStage
  • Managing file data
  • Managing relational data
  • Data conversion
  • Data combination
  • Batch data processing
  • Control processes


Module 3: 2 days

Advanced development of data processing processes in IBM DataStage

  • Test data generation
  • Performance management
  • Debugging data processing processes
  • Advanced data transformation techniques
  • Modular approach toward development
  • Slowly Changing Dimensions
  • Best development practices
  • Development of complex control processes
 

IBM QualityStage

The purpose of this training is to present the complete process cycle of data quality improving using IBM QualityStage, as well as develop practical skills to work with this tool, and prepare students for certification for this IBM product.
 
The training includes the following modules:
  • Administration of IBM InfoSphere Information Server
  • Development of data processing by IBM QualityStage
  • Development of rules for data standardization by IBM QualityStage
During the two-day module "Administration of IBM InfoSphere Information Server» will be presented installation, configuration, backup, monitoring and system life cycle.
 
During the two-day module "Development of data processing by IBM QualityStage» will be considered a full cycle of development process for data quality improvement including of investigation, standardization, verification, matching and survive phases.
 
The two-day module "Development of rules to standardize data in IBM QualityStage» will be dedicated to a practical application of the process of custom rules development to standardize data by means of Rule Set of IBM QualityStage.


Module 1: 2 days

Administration of IBM InfoSphere Information Server

  • Installation of IBM InfoSphere Information Server
  • Security settings
  • Resetting components of IBM InfoSphere Information Server
  • Configuration of connections to databases
  • System performance monitoring
  • Backups
  • System lifecycle


Module 2: 2 days

Development of data processing processes in IBM QualityStage

  • Concept of the automated data quality improvement
  • Creating data processing processes in IBM QualityStage
  • Data analysis
  • Using and modifying data standardization rules
  • Data reconciliation within one source
  • Data reconciliation between two sources
  • Generation of a reference dataset


Module 3: 2 days

Development of data standardization rules in IBM QualityStage

  • Rule Set structure overview
  • Primary data analysis
  • Creating reference tables in Rule Set
  • Pattern Action File structure
  • Pattern Action File syntax
  • Rule Set testing

This training cycle is focused on providing hands-on assistance to specialists who integrate data on the basis of IBM InfoSphere Information Server product range, and on preparing specialists for the professional certification in this IBM product range.​ 
 The training course includes the following modules: 
  • Overview of IBM InfoSphere Information Server product range
  • Development of data processing processes in IBM DataStage
  • Advanced development of data processing processes in IBM DataStage
During the one-day “Overview of IBM InfoSphere Information Server product range” module, the course participants will be familiarized with the architecture, principal components and potential of the IBM InfoSphere Information Server product range.
 
The two-day “Development of data processing processes in IBM DataStage” module will impart participants with hands-on experience in developing data processing processes using IBM DataStage. They will be familiarized with main components and their functional specifics.
 
The two-day “Advanced development of data processing processes in IBM DataStage module will deliver knowledge and skills in debugging, ETL procedures optimization, and recommendations on the development of scalable high-performance and least resource-intensive data processing processes in IBM DataStage.


Module 1: 1 day

Overview of IBM InfoSphere Information Server

  • Concept of ETL
  • Architecture of IBM InfoSphere Information Server
  • Key components of IBM InfoSphere Information Server
  • Client application overview


Module 2: 2 days

Development of data processing processes in IBM DataStage 

  • Creating data processing processes in IBM DataStage
  • Compilation and execution
  • Conveyor parallelism
  • Data types in IBM DataStage
  • Managing file data
  • Managing relational data
  • Data conversion
  • Data combination
  • Batch data processing
  • Control processes


Module 3: 2 days

Advanced development of data processing processes in IBM DataStage

  • Test data generation
  • Performance management
  • Debugging data processing processes
  • Advanced data transformation techniques
  • Modular approach toward development
  • Slowly Changing Dimensions
  • Best development practices
  • Development of complex control processes
 

The purpose of this training is to present the complete process cycle of data quality improving using IBM QualityStage, as well as develop practical skills to work with this tool, and prepare students for certification for this IBM product.
 
The training includes the following modules:
  • Administration of IBM InfoSphere Information Server
  • Development of data processing by IBM QualityStage
  • Development of rules for data standardization by IBM QualityStage
During the two-day module "Administration of IBM InfoSphere Information Server» will be presented installation, configuration, backup, monitoring and system life cycle.
 
During the two-day module "Development of data processing by IBM QualityStage» will be considered a full cycle of development process for data quality improvement including of investigation, standardization, verification, matching and survive phases.
 
The two-day module "Development of rules to standardize data in IBM QualityStage» will be dedicated to a practical application of the process of custom rules development to standardize data by means of Rule Set of IBM QualityStage.


Module 1: 2 days

Administration of IBM InfoSphere Information Server

  • Installation of IBM InfoSphere Information Server
  • Security settings
  • Resetting components of IBM InfoSphere Information Server
  • Configuration of connections to databases
  • System performance monitoring
  • Backups
  • System lifecycle


Module 2: 2 days

Development of data processing processes in IBM QualityStage

  • Concept of the automated data quality improvement
  • Creating data processing processes in IBM QualityStage
  • Data analysis
  • Using and modifying data standardization rules
  • Data reconciliation within one source
  • Data reconciliation between two sources
  • Generation of a reference dataset


Module 3: 2 days

Development of data standardization rules in IBM QualityStage

  • Rule Set structure overview
  • Primary data analysis
  • Creating reference tables in Rule Set
  • Pattern Action File structure
  • Pattern Action File syntax
  • Rule Set testing


This training course has been elaborated to meet needs of ETL developers, analysts, testers, managers of ETL development teams using IBM InfoSphere Information Server products