Courses


Course : Data Warehousing Duration : 25 Hrs
Sub Course : AB-INITIO
Course Content : Ab Initio Introduction & Advanced Topics
Introduction to Ab Initio. Operating Systems & Hardware Platforms that it supports

Products of Ab Initio Software Corporation
  • The Ab Initio Co>Operating System
  • Graphical Development Environment (GDE)
  • The Ab Initio Enterprise Meta>Environment (EME) 
Salient Features of Ab Initio 

Understanding the High Level Architecture of Ab Initio

What is the Anatomy of Running a Job? What happens when we push the 'Run' Button?   
  • Deployment Process
  • Host Process Creation
  • Agent Process Creation
  • Component Process Creation
  • Successful Component Termination
  • Agent Termination
  • Host Termination
  • Abnormal Component Termination 
How do we Connect from GDE to the Server (Run a Settings)     
  • Host Profile File (.aih)
  • Connection, Co-Operating System, Shell & Host Directory 
What is Graph Programming? What are the basic parts of an Ab Initio Graph?
  • Broad Classification of Components
  • Dataset Components
  • Program / Computing Components 
  • Data Manipulation Language (DML)
  • Record Formats
  • DML Expressions
  • Transform Functions
  • Key Specifiers 
How do we build a basic Ab Initio Graph? Using .dat(Data File) & .dml(Record Format File)

Dataset Components
  • Input File
  • Output File
  • Intermediate File
  • Lookup file 
Editors in the GDE
  • Record Format Editor
  • Expression Editor
  • Transform Editor
  • Key Specifier Editor
  • Package Editor
  • Variables Editor
  • Sequence Specifier Editor 
Transform Functions
  • Transform File(.xfr)
  • Functions, Variables, Statements & Rules 
Transform Components - Multistage and Advanced Components
  • Filter by Expression
  • Dedup Sorted
  • Reformat
  • Aggregate
  • Scan
  • Rollup
  • Join
  • Match Sorted
  • Normalize
  • 10.Denormalize Sorted 
Lookup Dataset's & DML Lookup Functions

Partition Components
  • Broadcast
  • Partition by Key
  • Partition by Expression
  • Partition by Percentage
  • Partition by Range
  • Partition by Round-Robin
  • Partition with Load Balancing 
Multifiles and Multidirectories

De-partition Components
  • Concatenate
  • Gather
  • Interleave
  • Merge 
What is Parallelism - Types    
  • Component ||'sm
  • Pipeline ||'sm
  • Data ||'sm
Repartitioning - When & Why

Layout Implications

Sort Components
  • Sort
  • Sort within Groups
  • Sample
  • Partition by Key & Sort
Working with Database's. Introduction Database Configuration File (.dbc)

Interactive Database Components (IDB)
  • Input Table
  • Output Table
  • Run SQL
  • Truncate Table
  • Update Table
  • Join with DB 
Miscellaneous Components   
  • Redefine Format
  • Gather Logs
  • Replicate
  • Run Program
  • Thrash 
Deprecated Components    
  • Find Splitters 
Compress Components    
  • Compress
  • Uncompress
  • GZip
  • GUnzip 
File Transfer Protocol (FTP) Components 
  • FTP From
  • FTP To 
Testing and Validation of Ab Initio Graphs using Validate Components
  • Check Order
  • Compare Records
  • Validate Records
  • Generate Records
  • Generate Random Bytes
  • Compute Checksum
  • Compare Checksum 
Phasing & Checkpointing  
  • Reusability Features  
  • Sub graphs
  • Graph Parameters 
The AbInitio Enterprise Meta>Environment(EME)

High level anatomy of the EME
  • Relation to sandbox environment
  • Checking in your sandbox
  • Checking out a graph or a sandbox
  • Locking in order to change an object
  • Tagging
  • Common projects
  • Dependency and Impact analysis
Performance Tuning AbInitio Graphs

What is Good Performance?
  • Going Parallel 
  • Serial Inputs 
  • Phases 
  • Record Formats and Components 
  • Memory Usage 
  • Various Tips