Courses


Course : Data Warehousing Duration : 40 Hrs
Data Stage : Data Stage
Course Content : IBM WebSphere DataStage and QualityStage 8.1
Unit -1 : Data Warehouse Fundamentals

An introduction to Data Warehousing - purpose of Data Warehouse - Data Warehouse Architecture - Operational Data Store - OLTP Vs Warehouse Applications - Data Marts- Data marts Vs Data Warehouses - Data Warehouse Life cycle.

Unit -2 : Data Modelling

Introduction to Data Modeling - Entity Relationship model (E-R model) - Data Modeling for Data Warehouse, Narmalization process - Dimensions and fact tables - Star Schema and Snowflake Schemas.

Unit -3 : ETL Design Process

Introduction to Extraction, Transformation & Loading- Types of ETL Tools - Key tools in the market .

Unit - 4 : Introduction to Datastage Version 7.5x2 & 8.1

Datastage introduction - IBM information Server architecture - DataStage components - DataStage main functions - Client components.

Unit - 5 : Datastage Administrator

Datastage project Administration - Editing projects and Adding Projects - Deleting projects Cleansing up project files - Enviranmental Variables-Environement management - Auto purging - Rutime Column Propagation(RCP) - Add checkpoints for sequencer - NLS configuration - Generated OSH (Orchestra Engine) - System formats like data, timestamp - Project protect - Version details.

Unit - 6: Datastage Director

Introduction to Datastage Director - Validating Datastage Jobs - Executing Datastage jobs - Job execution status - Monitoring a job - Job log view - job scheduling - Creating Batches - Scheduling batches.

Unit - 7 : Datastage Designer

Introduction to Datastage Designer - Importance of Parallelism - Pipeline Parallelism - Partition Parallelism - Partitioning and collecting - Symmetric Multi Pro9cessing (SMP) Massively Parallel Processing (MPP) - Partition techniques - Datastage Repository Palette - Passive and Active stages - Job design overview - Designer work area - Annotations - Creating jobs - Importing flat file definitions - Managing the Metadata environment - Dataset management - Deletion of Dataset - Routines - Arguments.

Unit - 8: Working with Parallel Job Stages

Database Stages
Oracle - Teradata - ODBC - dynamic RDBMS

File Stages

Sequential file - Dataset - File set - Lookup file set.

Processing Stages

Copy - Filter - Funnel - Sort Remove duplicate - Aggregator - Modify - Compress - Expand - Decode - Encode - Switch - Pivot stage - Lookup - Join - Merge - difference between look up, join and merge - change capture - Change apply - Compare - Difference - Surrogate key generator - Transformer.

Debug Stages

Head - Tail - Peek - Column generator - Row generator -Write RangeMap Stage.

Real Time Stages

XML input - XML output

SAP Plug-in Stages

ABAP Stage,IDoc Extract Stage,IDoc Load Stage, BAPI stage

Local and Shared containers
Routines creation
Unit - 9: Advanced Stages in Parallel Jobs (Version 8.1)

Range Look process - Surrogate key generator stage - Slowly changing dimension stage - iway stage - FTP stage - Job performance analysis - Resource estimation - Slowly Changing Dimensions implementation - Performance tuning.

Unit - 10: Job Sequencers

Arrange job activities in Sequencer - Triggers in Sequencer - Restablity - Recoverability - Notification activity - Terminator activity - Wait for file activity - Start Look activity - Execute Command activity - Nested Condition activity - Exception handling activity - User Variable activity - End Loop activity - Adding Checkpoints

Unit - 11: Information Analyzer

IBM WebShpere Information Analyzer overview - Data Profiling process - Column analysis - Primary key analysis - Foreign key analysis - Cross-domain analysis - Baseline analysis - Aanalysis result publication - Deleting statistics reports - Baseline analysis reports - Cross-domain analysis summary statistics reports - Beseline analysis reports - Cross-domain analysis reports - Primary key reports - Foregin key analysis reports.

Unit - 12: WebShpere Quality Stage

About Data Quality - Datastage quality stages - Investigate stage - Standardize stage Match Frequency stage - Unduplicate Match stage - Reference Match Stage - Survive stage-MNS stage,WAVES stage - Migration of Datastage & Quality Stage Jobs from 7.5x2 to Version 8.1

Unit - 13: IBM Information Server Administration Guide

IBM WebSphere DataStage administration - Opening the IBM Information Server Web console - setting up a project ion the console - Customizing the project dashboard - Setting up security - Creating users in the console - Assigning security roles to users and groups - Managing licenses - Managing active sessions - Managing logs - Managing schedules - Backing up and restoring IBM Information Server.

Additional Features
  • Datastage Certification Guidence
  • Performance Tunning of Parallel Jobs
  • Datastage Installation process and setup
  • Full Length Class Room Notes Which Covers all the concepts
  • Well Versed Materials Which Covers Datawarehousing Basics, Datastage Concepts UnixCommands,Shall Scripts, Databases