Check out the video associated with this blog! In version 9.1 of DataStage, you can read Excel sheet data (.XLS and .XLSX) directly into DataStage using a new stage called the Unstructured Data stage. Let’s take a closer look at how to do this. First, we’ll create a simple parallel job which connects three stages: an Unstructured Data stage, a Transformer stage, and a Data Set stage. This job will read the data from the Excel… read more →
Check out the video associated with this blog! It is now possible to load multiple database tables with a single Connector stage. Let’s take a look at how to do it. First we will create a simple parallel job which has two output links from the Transformer stage going to the target DB2 Connector stage. In this example, we have an input file that contains raw data which will be used to update two tables:… read more →
What is Stream Computing? Nowadays, we hear a lot of buzz around stream computing. What is stream computing? According to the definition from Wikipedia – “Stream processing is a computer programming paradigm, related to SIMD (single instruction, multiple data), that allows some applications to more easily exploit a limited form of parallel processing. In computing, the term stream is used in a number of ways, in all cases referring to a sequence of data elements… read more →
Performance is a key factor in the success of any data warehousing project. Care for optimization and performance should be taken into account from the inception of the design and development process. Ideally, a DataStage job should process large volumes of data within a short period of time. For maximum throughput and performance, a well performing infrastructure is required, or else the tuning of DataStage jobs will not make much of a difference. Determining The… read more →
InfoSphere Information Server v8.7 is IBM’s newest release of the data integration platform. It was released in October 2011 and it offers some of the new tools which can be extremely valuable to a DataStage developer. Why Upgrade to Information Server 8.7? There are many reasons to stay current with one of the newer releases of DataStage. The most significant reason for upgrading to Information Server 8.7 would be for the new features. New options… read more →
The following video presents an overview of DataStage Parallel Processing Architecture.