$60 Rebate MBNA Cash Card

$60 Rebate MBNA Cash Card
$60 Rebate for new MBNA SmartCash MasterCard®

Sunday, January 20, 2008

Lesson three creating a Pass-Through Mapping,Session and Workflow

In this lesson, create very simple mapping and execute by session and workflow.
Mapping is based on last lesson source and target definitions. Between source and target definition is source qualifier.

Source Qualifier
- When you add a relational or a flat file source definition to a mapping, you need to connect it to a Source Qualifier transformation. The Source Qualifier transformation represents the rows that the PowerCenter Server reads when it runs a sesson.
- Source Qualifier transformation can perform the following tasks:
  • Join data originating from the same source database
  • Filter rows when the PowerCenter Server reads source data
  • Specify sorted ports
  • Select only distinct values from the source
  • Create a custom query to issue a special SELECT statement for the PowerCenter Server to read source data.
Session and Workflow for this lesson are very simple. Looks like most logic are in Mapping. Session just like diagram to link all Mapping. Workflow is just runtime instance of session.

Tuesday, January 15, 2008

Lesson Two, Source , Target Definitions and Target Tables

Creating Source Definitions:
In Source Analyzer, to create a source definition you have to have a source--Database , Flat File or XML.... So you can Import from source and edit it, but seems you can't create by your own. For edit, you can add , delete and update columns, and add some meta data to each table definition.

Creating Target Definitions:
In Target Designer, to create a target definition, you can drag and drop from source definitions table list or create table by your self. Nothing special just like create a normal table. But remember select correct database type.

Creating Target Tables:
In target Designer, you can create table in database and informatica SQL statement from your target table definitions.
  1. Click Targets > Generate/Execute SQL.
  2. In the File Name field, enter the SQL DDL file name.
  3. Select the Create Table, Drop Table, Foreign Key and Primary Key options.
  4. Click the Generate and Execute button.
So from what I did and see , it's simple to create source and target definition. Just like create new tables in database. One thing I don't like, after important source definition from database. The layout view is not well arranged like other database logic view. you have to rearrange them by yourself. So many software can do this job automatically, I don't know why PowerCenter Designer can't do this.

Monday, January 14, 2008

Lesson One, Respository Users, Groups and Folder

This lesson is pretty simple just show you how to connect to repository. My environment was already setup by my company's administrator. So I do need worry about install and setup repository server and service.

If you don't have environment, you have to install server and do configuration them by yourself. I tale looked that part, just need very powerful machine. So I just ignore that.

For this lesson, includes connect to a informatica repository (domain, server, port , username and password) , create groups and create a folder. Setup permission on that created folder. All these did in Informatica PowerCenter Repository Manager. Nothing special, everybody can under stand it easily.

And in this lesson also includes creating source tables and data for this tutorial. You need have a database and execute appropriate SQL file in PowerCenter Designer. I used smpl_ms.sql for SQL Server. This SQL file includes table’s schema and data. Informatica PowerCenter connects database target or source by ODBC. I installed MS SQL Server Express in my local box. It's free and seems ok, doesn’t eat too much resources compare to oracle. When you setup ODBC, server is "locahost\sqlexpress".

After SQL execute you can see few new tables with some data in sql server mater database.

Sunday, January 13, 2008

Informatica PowerCenter Architecture

Before the 6 lessons. I must understand of PowerCenter Architecture. This is not copy and paste from help. I try to write down by my own under stand. It's not big deal for developer view. But may very help in job interview.

Domain and Service Architecture
- Domain:
  • Domain is collection of nodes and services.
  • Primary unit to centralize administration.
- Node:
  • Logical representation of a machine in a domain.
  • One node i each domain servers as a gateway for the domain.
  • All processes in PowerCenter run as services on a node.
- Services:
  • Two type of services: Core services and Application services.
  • Core services: support the domain and application services. E.g. Domain service, Log service.
  • Application services: represent PowerCenter server-based functionality. E.g. Repository service, Intergration service...
- Informatica Repository:
  • Contains a set of metadata tables withing repository database that informatica applications and tools access.
- Informatica Client:
  • Manages users, define sources and targets, builds mappings and mapplets with transformation logic, and create workflows to run the mapping logic. The Informatica Client has four client applications: Repository Manager, Designer, Workflow Manager and Workflow monitor.

What's my plan

1. First follow the tutorial from Powercenter Help .
This includes totally 6 lessons.

2. If I can get Informatica PowerCenter 8 Developer training, I'll follow training agenda
I'm trying to get a on site training combine level one and two together.

PowerCenter 8 – Level I Developer

Agenda:

· Data Integration Concepts

o Data Integration

o Mapping and Transformations

o Tasks and Workflows

o Metadata

· PowerCenter Components and User Interface

o PowerCenter Architecture

o PowerCenter Client Tools

o Lab - Using the Designer and Workflow Manager

· Source Qualifier

o Source Qualifier Transformation

o Velocity Methodology

o Lab Project Overview

o Lab A - Load Payment Staging Table

o Source Qualifier Joins

o Lab B - Load Product Staging Table

o Source Pipelines

o Lab C - Load Dealership and Promotions Staging Table

· Expression, Filter, File Lists and Workflow Scheduler

o Expression Editor

o Filter Transformation

o File Lists

o Workflow Scheduler

o Lab - Load the Customer Staging Table

· Joins, Features and Techniques I

o Joiner Transformation

o Shortcuts

o Lab A - Load Sales Transaction Staging Table

o Lab B - Features and Techniques I

· Lookups and Reusable Transformations

o Lookup Transformation

o Reusable Transformations

o Lab A - Load Employee Staging Table

o Lab B - Load Date Staging Table

· Debugger

o Debugging Mappings

o Lab - Using the Debugger

· Sequence Generator

o Sequence Generator Transformation

o Lab - Load Date Dimension Table

· Lookup Caching, More Features and Techniques

o Lookup Caching

o Lab A - Load Promotions Dimension Table

o Lab B - Features and Techniques II

· Sorter, Aggregator and Self-Join

o Sorter Transformation

o Aggregator Transformation

o Active and Passive Transformations

o Data Concatenation

o Self-Join

o Lab - Reload the Employee Staging Table

· Router, Update Strategy and Overrides

o Router Transformation

o Update Strategy Transformation

o Expression Default Values

o Source Qualifier Override

o Target Override

o Session Task Mapping Overrides

o Lab - Load Employee Dimension Table

· Dynamic Lookup and Error Logging

o Dynamic Lookup

o Error Logging

o Lab - Load Customer Dimension Table

· Unconnected Lookup, Parameters and Variables

o Unconnected Lookup Transformation

o System Variables

o Mapping Parameters and Variables

o Lab - Load Sales Fact Table

· Mapplets

o Mapplets

o Lab - Load Product Daily Aggregate Table

· Mapping Design

o Designing Mappings

o Workshop

· Workflow Variables and Tasks

o Link Conditions

o Workflow Variables

o Assignment Task

o Decision Task

o Email Task

o Lab - Load Product Weekly Aggregate Table

· More Tasks and Reusability

o Event Raise Task

o Event Wait Task

o Command Task

o Reusable Tasks

o Reusable Session Task

o Reusable Session Configuration

o PMCMD Utility

· Worklets and More Tasks

o Worklets

o Timer Task

o Control Task

o Lab - Load Inventory Fact Table

· Workflow Design

o Designing Workflows

o Workshop (Optional)

PowerCenter 8 – Level II Developer

Agenda:

· Architecture Overview and High Availability

o Architectural overview

o Domains, nodes, and services

o Administration Console

o Configuring services

o High Availability

· Mapping and Session Techniques

o Mapping parameters and variables and parameter files

o File lists

o Dynamic lookup cache

o Data driven aggregation

o Incremental aggregation

o Denormalization

· Workflow Techniques

o Using Tasks

o Workflow Control and Restart

o Workflow Alerts

o Dynamic Scheduling

o Pseudo-looping techniques

· Workflow Recovery

o Workflow recovery principles

o Task recovery strategy

o Workflow recovery options

o State of operation

o Resume recovery strategy

o Recovery using the command line

· Transaction Control

o Database Transactions

o Transaction Control transformation

o PowerCenter transaction control options

o Transformation scope

· Error Handling

o Error categories

o Error logging

o Error handling strategies

· Object Queries, Object Migration, and Comparing Objects

o Creating object queries

o Migration

o Comparing objects

o Repository Reporting

o Metadata Reports

o Repository reports

· Memory Allocation

o Optimizing session memory

o Optimizing transformation caches

o Auto-cache sizing

· Performance Tuning Methodology

o Session dynamics

o Measuring performance

o Testing for bottlenecks

o Optimization techniques

· Pipeline Partitioning

o Pipeline types

o Multi-partition sessions

o Partition points and types

o Using dynamic partitioning

Informatica Resources

I really can't find many useful website except Informatica own website.
But you have to be a partner or customer to access these site or have to buy documents.
The lucky thing is I found some Informatica Powercenter 7 documents are free in Chinese.

Anyway, all I get is just help in Powercenter 8.

Here is the list:

Informatica Customer Portal

As an Informatica customer, you can access the Informatica Customer Portal site at http://my.informatica.com. The site contains product information, user group information, newsletters, access to the Informatica customer support case management system (ATLAS), the Informatica Knowledge Base, and access to the Informatica user community.

Informatica Web Site

You can access the Informatica corporate web site at http://www.informatica.com. The site contains information about Informatica, its background, upcoming events, and sales offices. You will also find product and partner information. The services area of the site includes important information about technical support, training and education, and implementation services.

Informatica Developer Network

You can access the Informatica Developer Network at http://devnet.informatica.com. The Informatica Developer Network is a web-based forum for third-party software developers. The site contains information about how to create, market, and support customer-oriented add-on solutions based on interoperability interfaces for Informatica products.

Informatica Knowledge Base

As an Informatica customer, you can access the Informatica Knowledge Base at http://my.informatica.com. Use the Knowledge Base to search for documented solutions to known technical issues about Informatica products. You can also find answers to frequently asked questions, technical white papers, and technical tips.

Why I create this blog

I'm J2EE developer now, I'm interesting to become a Informatica Powercenter ETL developer now. When I google online I can't find many useful website or article. So I thing maybe create a blog to record my study trace and later may create a Informatica Powercenter website.

First I'll follow tutorial in Informatica Powerceneter help as start point, after that I don't know yet. Hope I can find something to continue or put it based on real project in my job.