Courses


Course : Data Warehousing Duration : 30 Hrs
Sub Course : MicroStrategy
Course Content : Ab Initio Introduction & Advanced Topics
Introduction to Ab Initio. Operating Systems & Hardware Platforms that it supports

Products of Ab Initio Software Corporation
  •  The Ab Initio Co>Operating System
  •  Graphical Development Environment (GDE)
  •  The Ab Initio Enterprise Meta>Environment (EME) 
Salient Features of Ab Initio

Understanding the High Level Architecture of Ab Initio

What is the Anatomy of Running a Job? What happens when we push the 'Run' Button?
  •  Deployment Process
  •  Host Process Creation
  •  Agent Process Creation
  •  Component Process Creation
  •  Successful Component Termination
  •  Agent Termination
  •  Host Termination
  •  Abnormal Component Termination 
How do we Connect from GDE to the Server (Run Settings)     
  •  Host Profile File (.aih)
  •  Connection, Co-Operating System, Shell & Host Directory 
What is Graph Programming? What are the basic parts of an Ab Initio Graph?

Broad Classification of Components
  •  Dataset Components
  •  Program / Computing Components 
Data Manipulation Language (DML)   
  •  Record Formats
  •  DML Expressions
  •  Transform Functions
  •  Key Specifiers 
How do we build a basic Ab Initio Graph? Using .dat(Data File) & .dml(Record Format File)

Dataset Components
  •  Input File
  •  Output File
  •  Intermediate File
  •  Lookup file 
Editors in the GDE               
  •  Record Format Editor
  •  Expression Editor
  •  Transform Editor
  •  Key Specifier Editor
  •  Package Editor
  •  Variables Editor
  •  Sequence Specifier Editor 
Transform Functions     
  •  Transform File(.xfr)
  •  Functions, Variables, Statements & Rules 
Transform Components - Multistage and Advanced Components
  •  Filter by Expression
  •  Dedup Sorted
  •  Reformat
  •  Aggregate
  •  Scan
  •  Rollup
  •  Join
  •  Match Sorted
  •  Normalize
  •  10.Denormalize Sorted 
Lookup Dataset's & DML Lookup Functions

Partition Components
  •  Broadcast
  •  Partition by Key
  •  Partition by Expression
  •  Partition by Percentage
  •  Partition by Range
  •  Partition by Round-Robin
  •  Partition with Load Balancing 
Multifiles and Multidirectories

De-partition Components
  •  Concatenate
  •  Gather
  •  Interleave
  •  Merge 
What is Parallelism - Types   
  •  Component ||'sm
  •  Pipeline ||'sm
  •  Data ||'sm
Repartitioning - When & Why

Layout Implications

Sort Components
  •  Sort
  •  Sort within Groups
  •  Sample
  •  Partition by Key & Sort
Working with Database's. Introduction Database Configuration File (.dbc)

Interactive Database Components (IDB)
  •  Input Table
  •  Output Table
  •  Run SQL
  •  Truncate Table
  •  Update Table
  •  Join with DB 
Miscellaneous Components   
  •  Redefine Format
  •  Gather Logs
  •  Replicate
  •  Run Program
  •  Thrash 
Deprecated Components    
  •  Find Splitters 
Compress Components    
  •  Compress
  •  Uncompress
  •  GZip
  •  GUnzip 
File Transfer Protocol (FTP) Components 
  •  FTP From
  •  FTP To 
Testing and Validation of Ab Initio Graphs using Validate Components 
  •  Check Order
  •  Compare Records
  •  Validate Records
  •  Generate Records
  •  Generate Random Bytes
  •  Compute Checksum
  •  Compare Checksum 
Phasing & Checkpointing

Reusability Features
  •  Sub graphs
  •  Graph Parameters 
The AbInitio Enterprise Meta>Environment(EME)   
  •  High level anatomy of the EME
  •  Relation to sandbox environment
  •  Checking in your sandbox
  •  Checking out a graph or a sandbox
  •  Locking in order to change an object
  •  Tagging
  •  Common projects
  •  Dependency and Impact analysis 
Performance Tuning AbInitio Graphs

What is Good Performance?
  •  Going Parallel 
  •  Serial Inputs 
  •  Phases 
  •  Record Formats and Components 
  •  Memory Usage 
  •  Various Tips