Informatica
INFORMATICA Transformations Tutorial & Filter Transformation
What is Transformation? Transformations is in Informatica are the objects which creates, modifies or...
Informatica ETL tool consists of following services & components
In this tutorial- you will learn
The overall architecture of Informatica is Service Oriented Architecture (SOA).
For example, in the below screenshot, you can see under domain window the folder "Domain_Rajesh" is created under which we have created a node name "node01_rajesh" and services as "guru99 integration services".
Node is a logical representation of a machine inside the domain. Node is required to run services and processes for Informatica.
You can have multiple nodes in a domain. In a domain, you will also find a gateway node.
The gateway node is responsible for receiving requests from different client tools and routing those requests to different nodes and services.
There are two types of services in Domain
PowerCenter repository is a relational database like Oracle, Sybase, SQL server and it is managed by repository service. It consists of database tables that store metadata.
There are three Informatica Client tools available in Informatica Powercenter. They are Informatica
These clients can access to the repository using repository service only.
To manage a repository there exists an Informatica service called Repository Service. A single repository service handles exclusively only one repository. Also, a repository service can execute on multiple nodes to increase the performance.
The repository services use locks on the objects, so multiple users cannot modify the same object same time.
You can enable version control in the repository. With the version control feature, you can maintain different versions of the same object.
Objects created in the repository can have following three state
As mentioned earlier, domain is the basic administrative control in Informatica. It is the parent entity which consists of other services like integration service, repository service, and various nodes.
The domain configuration can be done using the Informatica admin console. The console can be launched using web browsers.
Once open in a web browser it prompts for administrator login. The password is set during the Informatica installation.
After login into the Informatica domain, the home page looks something like this.
In the left pane it shows the existing nodes, repository services, integration services under the domain.
On the main window, it shows the status of those services, whether those are up or down.
Click on the properties menu in the admin page to view the properties of the domain.
Key properties of the domain are
Resilience timeout – If any of the integration service or repository services goes down then resilience timeout is the no of seconds the application service tries to connect to those services.
Restart Period – It is the maximum number of seconds the domain spends to restart a service.
Dispatch Mode – It is the policy used by the load balancer to dispatch tasks to various nodes.
Database type – The type of database on which domain is configured.
Database host – Hostname of the machine on which domain is configured.
Database port & name – It is the database port and the database instance name for the domain.
These properties can be modified based upon requirement.
PowerCenter client tools are development tools which are installed on the client machines. Powercenter designer, workflow manager, a repository manager, and workflow monitor are the main client tools.
The mappings and objects that we create in these client tools are saved in the Informatica repository which resides on the Informatica server. So the client tools must have network connectivity to the server.
On the other hand, PowerCenter client connects to the sources and targets to import the metadata and source/target structure definitions. So it also must have connectivity to the source/target systems.
The repository service maintains the connections from Powercenter clients to the PowerCenter repository. It is a separate multi-threaded process, and it fetches, inserts and updates the metadata inside the repository. It is also responsible for maintaining consistency inside the repository metadata.
Integration service is the executing engine for the Informatica, in other words, this is the entity which executes the tasks that we create in Informatica. This is how it works
For example, it can combine data from an oracle table and a flat file source.
So, in summary, Informatica integration service is a process residing on the Informatica server waiting for tasks to be assigned for the execution. When we execute a workflow, the integration service receives a notification to execute the workflow. Then the integration service reads the workflow to know the details like which tasks it has to execute like mappings & at what timings. Then the service reads the task details from the repository and proceeds with the execution.
Informatica being an ETL and Data integration tool, you would be always handling and transforming some form of data. The input to our mappings in Informatica is called source system. We import source definitions from the source and then connect to it to fetch the source data in our mappings. There can be different types of sources and can be located at multiple locations. Based upon your requirement the target system can be a relational or flat file system. Flat file targets are generated on the Informatica server machine, which can be transferred later on using ftp.
Relational– these types of sources are database system tables. These database systems are generally owned by other applications which create and maintain this data. It can be a Customer Relationship Management Database, Human Resource Database, etc. for using such sources in Informatica we either get a replica of these datasets, or we get select privileges on these systems.
Flat Files - Flat files are most common data sources after relational databases in Informatica. A flat file can be a comma separated file, a tab delimited file or fixed width file. Informatica supports any of the code pages like ascii or Unicode. To use the flat file in Informatica, its definitions must be imported similar to as we do for relational tables.
What is Transformation? Transformations is in Informatica are the objects which creates, modifies or...
After installing Informatica server and client, Informatica server needs to be configured. The...
Download PDF 1. What do you mean by Enterprise Data Warehousing? When the organization data is...
What is Normalizer Transformation? Normalizer is an active transformation, used to convert a...
What is Transaction Control Transformation? Transaction Control is an active and connected...
What is Router Transformation? Router transformation is an active and connected transformation which is...