6 system requirements for infosphere cdc for netezza

94
InfoSphere CDC for Netezza databases /InfoSphere Data Replication for Netezza (Version 10.2) 3 About InfoSphere CDC and InfoSphere CDC Management Console 6 System requirements for InfoSphere CDC for Netezza Technology 9 Hardware and software requirements 10 Supported Netezza JDBC drivers 11 Running in a virtualization environment 12 Disk space requirements 13 RAM requirements 14 Port requirements 15 Before you install InfoSphere CDC for Netezza Technology 16 Required database, user accounts, and privileges 17 Assessing disk space and memory requirements 18 Understanding the importance of an appropriately configured disk subsystem 19 Understanding the InfoSphere CDC memory footprint 20 Considerations for the PureData System for Analytics appliance and InfoSphere CDC 21 Preparing your environment for an installation of InfoSphere CDC for Netezza Technology 22 Calculating database connections required by InfoSphere CDC for Netezza Technology 23 Creating queues in JMS providers 24 Installing or upgrading InfoSphere CDC for Netezza Technology 25 Installing InfoSphere CDC for Netezza Technology 26 To install InfoSphere CDC for Netezza Technology (Linux) 27 To override the locale for the installation (Linux) 28 Installing InfoSphere CDC for Netezza Technology using a silent installation 29 To perform a silent installation of InfoSphere CDC for Netezza Technology (Linux) 30 Upgrading InfoSphere CDC for Netezza Technology 31 To upgrade InfoSphere CDC for Netezza Technology (Linux) 33 Configuring InfoSphere CDC for Netezza Technology (Linux) 35 To add a new instance of InfoSphere CDC for Netezza Technology (Linux) 36 To edit an instance of InfoSphere CDC for Netezza Technology (Linux) 39 To delete an instance of InfoSphere CDC for Netezza Technology (Linux) 40 After you install and configure InfoSphere CDC for Netezza Technology 41 To start InfoSphere CDC for Netezza Technology (Linux) 42 To stop InfoSphere CDC for Netezza Technology (Linux) 43 Privileges required by users 44 Maintaining active TCP connections in a network environment 45 To maintain active TCP connections 46 InfoSphere CDC for Netezza Technology metadata tables 47 Data types supported by InfoSphere CDC for Netezza Technology 48 System parameters for InfoSphere CDC for Netezza Technology 49 Commands for InfoSphere CDC for Netezza Technology 50 Using the InfoSphere CDC for Netezza Technology commands 51 Setting the TSINSTANCE environment variable 52 Database transaction log commands 53 dmshowbookmark - Display Bookmark Information 54 Managing tables for replication commands 56 dmreassigntable - Update Target Table Definition 57 Monitoring replication commands 58

Upload: others

Post on 08-Apr-2022

5 views

Category:

Documents


0 download

TRANSCRIPT

InfoSphere CDC for Netezza databases /InfoSphere Data Replication for Netezza (Version 10.2) 3About InfoSphere CDC and InfoSphere CDC Management Console 6System requirements for InfoSphere CDC for Netezza Technology 9Hardware and software requirements 10Supported Netezza JDBC drivers 11Running in a virtualization environment 12Disk space requirements 13RAM requirements 14Port requirements 15Before you install InfoSphere CDC for Netezza Technology 16Required database, user accounts, and privileges 17Assessing disk space and memory requirements 18Understanding the importance of an appropriately configured disk subsystem 19Understanding the InfoSphere CDC memory footprint 20Considerations for the PureData System for Analytics appliance and InfoSphere CDC 21Preparing your environment for an installation of InfoSphere CDC for Netezza Technology 22Calculating database connections required by InfoSphere CDC for Netezza Technology 23Creating queues in JMS providers 24Installing or upgrading InfoSphere CDC for Netezza Technology 25Installing InfoSphere CDC for Netezza Technology 26To install InfoSphere CDC for Netezza Technology (Linux) 27To override the locale for the installation (Linux) 28Installing InfoSphere CDC for Netezza Technology using a silent installation 29To perform a silent installation of InfoSphere CDC for Netezza Technology (Linux) 30Upgrading InfoSphere CDC for Netezza Technology 31To upgrade InfoSphere CDC for Netezza Technology (Linux) 33Configuring InfoSphere CDC for Netezza Technology (Linux) 35To add a new instance of InfoSphere CDC for Netezza Technology (Linux) 36To edit an instance of InfoSphere CDC for Netezza Technology (Linux) 39To delete an instance of InfoSphere CDC for Netezza Technology (Linux) 40After you install and configure InfoSphere CDC for Netezza Technology 41To start InfoSphere CDC for Netezza Technology (Linux) 42To stop InfoSphere CDC for Netezza Technology (Linux) 43Privileges required by users 44Maintaining active TCP connections in a network environment 45To maintain active TCP connections 46InfoSphere CDC for Netezza Technology metadata tables 47Data types supported by InfoSphere CDC for Netezza Technology 48System parameters for InfoSphere CDC for Netezza Technology 49Commands for InfoSphere CDC for Netezza Technology 50Using the InfoSphere CDC for Netezza Technology commands 51Setting the TSINSTANCE environment variable 52Database transaction log commands 53dmshowbookmark - Display Bookmark Information 54Managing tables for replication commands 56dmreassigntable - Update Target Table Definition 57Monitoring replication commands 58

dmclearevents - Clear Events 59dmshowevents - Show Events 60Exporting and importing configuration commands 62dmexportconfiguration - Export InfoSphere CDC Configuration 63dmimportconfiguration - Import InfoSphere CDC Configuration 64Other commands 65dmbackupmd Back up metadata 66dmconfigurets - Configure InfoSphere CDC 67dmset - Set InfoSphere CDC System Parameter 68dmshowversion - Show InfoSphere CDC Version 69dmshutdown - Shut Down InfoSphere CDC 70dmsupportinfo - Collect IBM Support Information 72dmts64 - Start InfoSphere CDC 74dmmdcommander 75dmmdconsole 76User exits for InfoSphere CDC for Netezza Technology 77Sample Java class user exits for InfoSphere CDC 78To compile the sample Java class user exits (UNIX and Linux) 79InfoSphere CDC API reference - Javadocs 80Uninstalling InfoSphere CDC for Netezza Technology 81To uninstall InfoSphere CDC for Netezza Technology (Linux) 82Troubleshooting 83Using the IBM Support Assistant (ISA DC) 84To use ISA DC to collect data for a product problem (command line) 85To use ISA DC to collect data for a product problem (GUI) 88To use ISA DC to collect data for a question or an enhancement request (command line) 90To use ISA DC to collect data for a question or an enhancement request (GUI) 92Locating log files 93Troubleshooting and contacting IBM Support 94

-

-

-

-

IBM InfoSphere Change Data Capture, Version 10.2 About InfoSphere CDC and InfoSphere CDCManagement Console IBM®InfoSphere® Change Data Capture (InfoSphere CDC) is a replication solutionthat captures database changes as they happen and delivers them to targetdatabases, message queues, or an ETL solution such as InfoSphere DataStage®based on table mappings configured in the InfoSphere CDCManagement ConsoleGUI application. InfoSphere CDC provides low impact capture and fast delivery of data changes forkey information management initiatives including dynamic data warehousing, masterdata management, application consolidations or migrations, operational BI, andenabling SOA projects. InfoSphere CDC also helps reduce processing overheadsand network traffic by only sending the data that has changed. Replication can becarried out continuously or periodically. When data is transferred from a sourceserver, it can be remapped or transformed in the target environment. The following diagram illustrates the key components of InfoSphere CDC.

The key components of the InfoSphere CDC architecture are described below:

Access Server—Controls all of the non-command line access to the replicationenvironment. When you log in to Management Console, you are connecting toAccess Server. Access Server can be closed on the client workstation withoutaffecting active data replication activities between source and target servers.Admin API—Operates as an optional Java™-based programming interface thatyou can use to script operational configurations or interactions.Apply agent—Acts as the agent on the target that processes changes as sent bythe source.Command line interface—Allows you to administer datastores and useraccounts, as well as to perform administration scripting, independent ofManagement Console.

3

-

-

-

-

-

-

-

-

-

-

-

-

-

Communication Layer (TCP/IP)—Acts as the dedicated network connectionbetween the Source and the Target.Source and Target Datastore—Represents the data files and InfoSphere CDCinstances required for data replication. Each datastore represents a database towhich you want to connect and acts as a container for your tables. Tables madeavailable for replication are contained in a datastore. Management Console—Allows you to configure, monitor and manage replicationon various servers, specify replication parameters, and initiate refresh andmirroring operations from a client workstation. Management Console also allowsyou to monitor replication operations, latency, event messages, and other statisticssupported by the source or target datastore. The monitor in Management Consoleis intended for time-critical working environments that require continuous analysisof data movement. After you have set up replication, Management Console can beclosed on the client workstation without affecting active data replication activitiesbetween source and target servers.Metadata—Represents the information about the relevant tables, mappings,subscriptions, notifications, events, and other particulars of a data replicationinstance that you set up.Mirror—Performs the replication of changes to the target table or accumulation ofsource table changes used to replicate changes to the target table at a later time. Ifyou have implemented bidirectional replication in your environment, mirroring canoccur to and from both the source and target tables.Refresh—Performs the initial synchronization of the tables from the sourcedatabase to the target. This is read by the Refresh reader.Replication Engine—Serves to send and receive data. The process that sendsreplicated data is the Source Capture Engine and the process that receivesreplicated data is the Target Engine. An InfoSphere CDC instance can operate asa source capture engine and a target engine simultaneously.Single Scrape—Acts as a source-only log reader and a log parser component. Itchecks and analyzes the source database logs for all of the subscriptions on theselected datastore. Not all InfoSphere CDC engines use Single Scrape. ForInfoSphere CDC for DB2® for i, there is a Scraper job (that acts as a log reader)and a Mirror job that performs the function of mirroring (see Mirror above). Source transformation engine—Processes row filtering, critical columns, columnfiltering, encoding conversions, and other data to propagate to the target datastoreengine.Source database logs—Maintained by the source database for its own recoverypurposes. The InfoSphere CDC log reader inspects these in the mirroring process,but filters out the tables that are not in scope for replication. Target transformation engine—Processes data and value translations, encodingconversions, user exits, conflict detections, and other data on the target datastoreengine.

There are two types of target-only destinations for replication that are notdatabases:

JMS Messages—Acts as a JMS message destination (queue or topic) for row-level operations that are created as XML documents.InfoSphere DataStage—Processes changes delivered from InfoSphere CDC thatcan be used by InfoSphere DataStage jobs.

4

-

-

-

-

IBM InfoSphere Change Data Capture, Version 10.2 About InfoSphere CDC and InfoSphere CDCManagement Console IBM®InfoSphere® Change Data Capture (InfoSphere CDC) is a replication solutionthat captures database changes as they happen and delivers them to targetdatabases, message queues, or an ETL solution such as InfoSphere DataStage®based on table mappings configured in the InfoSphere CDCManagement ConsoleGUI application. InfoSphere CDC provides low impact capture and fast delivery of data changes forkey information management initiatives including dynamic data warehousing, masterdata management, application consolidations or migrations, operational BI, andenabling SOA projects. InfoSphere CDC also helps reduce processing overheadsand network traffic by only sending the data that has changed. Replication can becarried out continuously or periodically. When data is transferred from a sourceserver, it can be remapped or transformed in the target environment. The following diagram illustrates the key components of InfoSphere CDC.

The key components of the InfoSphere CDC architecture are described below:

Access Server—Controls all of the non-command line access to the replicationenvironment. When you log in to Management Console, you are connecting toAccess Server. Access Server can be closed on the client workstation withoutaffecting active data replication activities between source and target servers.Admin API—Operates as an optional Java™-based programming interface thatyou can use to script operational configurations or interactions.Apply agent—Acts as the agent on the target that processes changes as sent bythe source.Command line interface—Allows you to administer datastores and useraccounts, as well as to perform administration scripting, independent ofManagement Console.

6

-

-

-

-

-

-

-

-

-

-

-

-

-

Communication Layer (TCP/IP)—Acts as the dedicated network connectionbetween the Source and the Target.Source and Target Datastore—Represents the data files and InfoSphere CDCinstances required for data replication. Each datastore represents a database towhich you want to connect and acts as a container for your tables. Tables madeavailable for replication are contained in a datastore. Management Console—Allows you to configure, monitor and manage replicationon various servers, specify replication parameters, and initiate refresh andmirroring operations from a client workstation. Management Console also allowsyou to monitor replication operations, latency, event messages, and other statisticssupported by the source or target datastore. The monitor in Management Consoleis intended for time-critical working environments that require continuous analysisof data movement. After you have set up replication, Management Console can beclosed on the client workstation without affecting active data replication activitiesbetween source and target servers.Metadata—Represents the information about the relevant tables, mappings,subscriptions, notifications, events, and other particulars of a data replicationinstance that you set up.Mirror—Performs the replication of changes to the target table or accumulation ofsource table changes used to replicate changes to the target table at a later time. Ifyou have implemented bidirectional replication in your environment, mirroring canoccur to and from both the source and target tables.Refresh—Performs the initial synchronization of the tables from the sourcedatabase to the target. This is read by the Refresh reader.Replication Engine—Serves to send and receive data. The process that sendsreplicated data is the Source Capture Engine and the process that receivesreplicated data is the Target Engine. An InfoSphere CDC instance can operate asa source capture engine and a target engine simultaneously.Single Scrape—Acts as a source-only log reader and a log parser component. Itchecks and analyzes the source database logs for all of the subscriptions on theselected datastore. Not all InfoSphere CDC engines use Single Scrape. ForInfoSphere CDC for DB2® for i, there is a Scraper job (that acts as a log reader)and a Mirror job that performs the function of mirroring (see Mirror above). Source transformation engine—Processes row filtering, critical columns, columnfiltering, encoding conversions, and other data to propagate to the target datastoreengine.Source database logs—Maintained by the source database for its own recoverypurposes. The InfoSphere CDC log reader inspects these in the mirroring process,but filters out the tables that are not in scope for replication. Target transformation engine—Processes data and value translations, encodingconversions, user exits, conflict detections, and other data on the target datastoreengine.

There are two types of target-only destinations for replication that are notdatabases:

JMS Messages—Acts as a JMS message destination (queue or topic) for row-level operations that are created as XML documents.InfoSphere DataStage—Processes changes delivered from InfoSphere CDC thatcan be used by InfoSphere DataStage jobs.

7

IBM InfoSphere Change Data Capture, Version 10.2 Hardware and software requirements Click the following links to view hardware and software requirements forInfoSphere® CDC, Management Console, and Access Server: Linux, UNIX, Windows and System i® replication engines: https://ibm.biz/BdxyzE Mainframe replication engine: https://ibm.biz/Bdxyd5

10

IBM InfoSphere Change Data Capture, Version 10.2 Supported Netezza JDBC drivers

Netezza® JDBC driverNetezza JDBC driver, version 6.0.3 or above.

11

IBM InfoSphere Change Data Capture, Version 10.2 Running in a virtualization environment The InfoSphere® CDC products adhere to the Virtualization Policy for IBM®Software and can be run in any virtualization environment for only the supportedoperating systems and versions listed specifically within IBMInfoSphere DataReplication System Requirements. For more information on the policy, see http://www-01.ibm.com/software/support/virtualization_policy.html

12

IBM InfoSphere Change Data Capture, Version 10.2 Disk space requirements

Disk spaceInfoSphere® CDC target system:5 GB—For installation files, dataqueues, and log files.Global disk quota—Disk space is requiredon your target system for this quota which is used to store LOBdata received from your InfoSphere CDC source system. Theamount of disk space required is determined by your replicationenvironment and the amount of LOB data you are replicating. Toimprove performance, InfoSphere CDC will only persist LOB datato disk if RAM is not available on your target system. Use themirror_global_disk_quota_gb system parameter to configure theamount of disk space used by this quota.

13

IBM InfoSphere Change Data Capture, Version 10.2 RAM requirements

Although InfoSphere CDC memory requirements will fluctuate, you must work withyour system administrator to ensure the allocated memory for each instance of theproduct is available at all times. This may involve deployment planning since otherapplications with memory requirements may be installed on the same server withInfoSphere CDC. Using values other than the defaults or allocating more RAM thanis physically available on your server should only be undertaken after consideringthe impacts on product performance. Related concepts: Configuring InfoSphere CDC for Netezza Technology (Linux)

RAMEach instance of InfoSphere® CDC requires memory for theJava™ Virtual Machine (JVM). The following default values formemory are assigned:8192 MB of RAM—Default value for each 64-bit instance ofInfoSphere CDC. Use the InfoSphere CDC configuration tool toconfigure the memory for each instance of InfoSphere CDC.

14

IBM InfoSphere Change Data Capture, Version 10.2 Port requirements InfoSphere® CDC requires that you allocate a port for communication with clientworkstations running Management Console and other servers. The port must beaccessible through a firewall, although you do not require access to the Internet.

Related concepts: Maintaining active TCP connections in a network environment

Protocol Default port PurposeTCP 11301 Accepts connections

from:ManagementConsoleOtherinstallations ofInfoSphere CDC as asource ofreplicationCommandline utilities

15

-----

-

-

-

IBM InfoSphere Change Data Capture, Version 10.2 Before you install InfoSphere CDC for Netezza®Technology This section contains information on the tasks that you must complete beforeinstalling InfoSphere® CDC. This section assumes that you have met all of thehardware, software, database, and port requirements. You must complete all of thetasks below before installing InfoSphere CDC. In this section, you will learn:

Required database, user accounts, and privileges Assessing disk space and memory requirements Understanding the importance of an appropriately configured disk subsystem Understanding the InfoSphere CDC memory footprint Considerations for the PureData System for Analytics appliance and InfoSphereCDC Preparing your environment for an installation of InfoSphere CDC for NetezzaTechnology Calculating database connections required by InfoSphere CDC for NetezzaTechnology Creating queues in JMS providers

16

IBM InfoSphere Change Data Capture, Version 10.2 Required database, user accounts, and privileges Configuring a connection to the Netezza database When you configure InfoSphere® CDC, you are prompted for the hostname of theNetezza® database you want InfoSphere CDC to connect to and replicate data.Before installing InfoSphere CDC, ensure that this database exists and that youhave created and set up a database user that has access to it. Setting up a Linux user account When you are installing InfoSphere CDC on a Linux machine, you must set up anew, or decide on an existing Linux account that you will use to install, configure, orupgrade InfoSphere CDC. You can install InfoSphere CDC in the directory of yourchoice, however, it must be owned by the Linux account. Requirements for the Netezza database user account If the ROWSETLIMIT value for your Netezza user account is set too low, replicationperformance will be affected. In order for InfoSphere CDC to perform, the minimumacceptable value for ROWSETLIMIT is 1,000,000. To achieve optimumperformance, you should set this value to zero (which makes the value unlimited). Configuring a Netezza database When you configure InfoSphere CDC, you are prompted for the name of the Netezza database to which you want InfoSphere CDC to replicate data. Beforeinstalling InfoSphere CDC, ensure that this Netezza database exists and that youhave created and set up a database user that has access to it.

17

IBM InfoSphere Change Data Capture, Version 10.2 Assessing disk space and memory requirements InfoSphere® CDC requires disk space and memory when it processes change datafrom your source database. In order to process change data efficiently and replicatethese changes to your target system, it is very important that InfoSphere CDC hasadequate disk space and memory for each of the components described in thissection. Memory requirements for the JVM (Java Virtual Machine) As a Java-based product, InfoSphere CDC requires you to allocate the maximumamount of memory (RAM) to be used by the Java™ Virtual Machine (JVM). Thisprevents InfoSphere CDC from using all of the available memory on the systemwhere it is installed. The Maximum Memory Allowed value is set on a per-instance basis for eachinstance you create for your target database. In most cases the default values areappropriate for 64-bit instances. However, if your database is processing anextremely heavy workload, you may have to adjust the default values. The RAMallocated must be physically available on your system. Disk space requirements for the global disk quota Disk space is required on your target system for this quota which is used to store in-scope change data that has not been committed in your database. The amount ofdisk space required is determined by your replication environment and the workloadof your source database. You can configure the amount disk space that is allocated to this quota with themirror_global_disk_quota_gb system parameter. The default setting of this systemparameter is such that InfoSphere CDC will only stop replicating after this disk quotaexhausts all available disk space on your system. If you would prefer InfoSphereCDC to stop replicating after it uses a specific amount of disk space, you can specifythe value with this system parameter in Management Console.

18

IBM InfoSphere Change Data Capture, Version 10.2 Understanding the importance of an appropriatelyconfigured disk subsystem There are many types of disk subsystems in use to meet either business orperformance needs. Not all of these disk subsystems are suitable for use bydatabases or InfoSphere® Data Replication out of the box. Some may need to betuned to ensure that appropriate input/output semantics are in place for reliablecontinuous operation. Symptoms of an unreliable disk subsystem Without appropriate disk subsystem configuration, both the database itself orInfoSphere Data Replication may exhibit any of a wide variety of input/output relatederrors, usually random in nature. Any one of them can stop replication. If thedatabase transaction logs themselves become corrupted due to this kind ofmisconfiguration, then the database itself may become unrecoverable, putting theentire business at risk. Having an appropriately configured disk subsystem istherefore essential to the operation of both database and InfoSphere DataReplication. What makes a disk subsystem unreliable? Typically, disk mounting options that interfere with or modify the read visibility ofwrite operations are the ones which will cause data to be read inaccurately, therebycausing applications such as databases and InfoSphere Data Replication to reporterrors and fail. The expectations of these semantics between the database andInfoSphere Data Replication must be compatible with those provided by the optionsused to mount the disk subsystem in order to avoid corruption issues. Somedatabases exhibit specific behaviors only with certain disk subsystem types, soproper care and attention is needed to properly configure the disk subsystem. Special notes regarding specific configurations Direct I/O on Linux—Due to the nature of the implementation of direct I/O (directio)on Linux, applications that read from files being written using direct I/O must employexactly the same direct I/O options as the writing application. If this is not done, thereading application may not ever see the data written by the writing application andthe reading application can therefore exhibit a stall. Linux versions of InfoSphereCDC prior to version 6.5.1 Interim Fix 17 for Oracle, version 6.5.2 Interim Fix 20 forOracle, and InfoSphere Data Replication versions prior to 10.2 for Oracle andSybase can exhibit this behaviour under certain conditions. The best resolution is toupgrade to the latest Interim Fix level for InfoSphere CDC or to version 10.2 or laterfor InfoSphere Data Replication.

19

IBM InfoSphere Change Data Capture, Version 10.2 Understanding the InfoSphere CDC memoryfootprint Current® versions of InfoSphere® CDC on Linux, UNIX, and Windows platforms arewritten in the Java™ programming language. The memory specified in theInfoSphere CDC configuration tool refers to the amount of memory that the JavaVirtual Machine (JVM) will allocate to InfoSphere CDC to run. This memory is strictlyenforced by the JVM itself and the JVM will ensure that it is not exceeded. The JVM itself also consumes some memory. The amount of this other memoryvaries considerably by Java version, bit length, and operating system. A simple Javaprogram consumes 13212 KB of overhead when run in a 32-bit Java 1.5 JVM onAIX®, but 173509 KB of overhead when run in a 32-bit Java 1.5 JVM on Linux. Inother words, the overhead on Linux is 13 times larger than the overhead on AIX,when controlling for the other variables. The amount of memory overhead consumed by the JVM itself can also change overtime. This is especially true for Linux and UNIX systems. For those systems, oncethe operating system allocates memory to a process, it is not reclaimed until theprocess ends. Thus, the total amount of memory for any given process never goesdown. Given these factors, you should expect that more memory is used by InfoSphereCDC than is allocated in the configuration tool. InfoSphere CDC has no control overthis memory usage and cannot track or otherwise manage it.

20

-

-

-

IBM InfoSphere Change Data Capture, Version 10.2 Considerations for the PureData System forAnalytics appliance and InfoSphere CDC There are several factors you should consider when working with InfoSphere® CDCfor Netezza® Technology.

In order to maintain the operational efficiency of the PureData System for Analyticsappliance, InfoSphere CDC replicates rows in periodic bulk load operations. As anadministrator of InfoSphere CDC, you can expect a certain level of latency (inminutes) during replication because these operations are staged in memory beforethey are applied to the database. Latency may be higher when compared toInfoSphere CDC performance on other supported databases, however thethroughput of rows per hour may be similar or greater. If you want to control thelevel of latency in your environment, you can modify theacceptable_latency_in_minutes system parameter in Management Console. InfoSphere CDC for Netezza Technology attempts to apply data such that SQLquery performance against the target database is minimally affected. Replication oflarge volumes of changed data may affect query performance because of bulk loadoperations being performed on the database. Query performance may also beaffected by the level of acceptable latency you specify with theacceptable_latency_in_minutes system parameter.While the PureData System for Analytics appliance permits and maintains primarykey, default, foreign key, unique, and references, the appliance does not supportconstraint checks and referential integrity. Therefore, InfoSphere CDC will not beable to detect situations where the data has been changed either manually or byan application. As an administrator of InfoSphere CDC for Netezza Technology,you must ensure that there are no duplicate rows in the table and ensurereferential integrity in the data being replicated.

21

-

-

-

IBM InfoSphere Change Data Capture, Version 10.2 Preparing your environment for an installation ofInfoSphere CDC for Netezza Technology You need to allocate a Linux Red Hat version 5.3 or later machine on which you caninstall InfoSphere® CDC. This Linux machine should be close to the PureDataSystem for Analytics appliance in order for InfoSphere CDC to obtain a strongnetwork connection. Before you install, consider the following:

You can only install one instance of InfoSphere CDC for Netezza® Technology ona single Netezza database.InfoSphere CDC for Netezza Technology must not be installed on the PureDataSystem for Analytics appliance.InfoSphere CDC for Netezza Technology only supports 64 bit Linux operatingsystems.

The following diagram illustrates a typical topology for InfoSphere CDC for NetezzaTechnology product.

22

-

--

---

IBM InfoSphere Change Data Capture, Version 10.2 Calculating database connections required byInfoSphere CDC for Netezza Technology As an administrator, you may find it necessary to calculate how many databaseconnections are needed before installing InfoSphere® CDC on either a source or atarget database. Calculating the upper bound (both permanent and temporary)database connections will help you plan your environment so that it canaccommodate InfoSphere CDC. If you are installing InfoSphere CDC Event Server, InfoSphere CDC for InfoSphereDataStage®, InfoSphere CDC for Teradata, or InfoSphere CDC for Netezza®Technology, then you only need to calculate database connections for the targetdatabase as these products only replicate to target-only destinations. Calculating connections required by InfoSphere CDC on atarget database (4+G)*I + 3*SWhere:

G = number of Management Console GUI and CHCCLP scripting applications thatare connected to your instances of InfoSphere CDC.I = number of InfoSphere CDC instances.S = number of subscriptions in all of your InfoSphere CDC instances.

Example: How to calculate required connections for a targetdatabase You want to set up InfoSphere CDC in the target environment as follows:

1 installed Management Console GUI application.2InfoSphere CDC instances.3 subscriptions.

The number of connections required on the target database will be: (4 + 1)*2 + 3*3 = 19 You should plan for a maximum of 19 database connectionsbefore installing InfoSphere CDC on the target database.

23

IBM InfoSphere Change Data Capture, Version 10.2 Creating queues in JMS providers If you choose to use a JMS provider as the communications protocol forInfoSphere® CDC, you will need to define the queues to be used by InfoSphereCDC before you attempt to configure an instance. The queues will need to be named in the format CDC_<port>, where <port> is thefive digit TCP listening port number of the instance. You can left pad the numberwith zeroes if necessary to ensure five digits (example, CDC_00123). Each InfoSphere CDC instance will require its own queue. Instances cannot share aqueue. When you create the queue, you must ensure that they are defined to holdmessages of the type BytesMessage.

24

---

IBM InfoSphere Change Data Capture, Version 10.2 Installing or upgrading InfoSphere CDC forNetezza® Technology Before attempting to install or upgrade InfoSphere® CDC, consult the database,operating system and hardware requirements for the specific version of the softwarethat you want to install, to ensure that it is compatible with your system. If you are upgrading to a later version or installing a fix pack, an installation ofInfoSphere CDC must already be present in order to successfully complete theprocess. In this section, you will learn:

Installing InfoSphere CDC for Netezza Technology Installing InfoSphere CDC for Netezza Technology using a silent installation Upgrading InfoSphere CDC for Netezza Technology You can upgrade InfoSphere CDC by installing a later version of the software overtop of an existing installation.

Related concepts: Before you install InfoSphere CDC for Netezza Technology

25

--

IBM InfoSphere Change Data Capture, Version 10.2 Installing InfoSphere CDC for Netezza Technology You can install InfoSphere® CDC on a Linux server. The Linux machine must be version 5.3 or later and you must install InfoSphereCDC for Netezza® Technology on a 64 bit operating system. See also:

To install InfoSphere CDC for Netezza Technology (Linux) To override the locale for the installation (Linux)

26

1.2.

3.4.

5.

6.7.

8.9.

10.

IBM InfoSphere Change Data Capture, Version 10.2 To install InfoSphere CDC for Netezza® Technology(Linux)

Log on to the account you set up for InfoSphere® CDC.Copy the InfoSphere CDC installation file for your Linux platform from theInfoSphere CDC DVD or the download file.Make the installation binary file executable.Run the installation program by typing the following command:./<installation_binary_name>.bin If you already have InfoSphere CDC installed, the installation program willprompt you to upgrade. Press Enter on the Introduction screen to display the license agreement. Followthe instructions on the screen to navigate through the license agreement.To accept the license agreement, type 1.Enter the absolute path to your installation directory or press Enter to accept thedefault.Note: The directory that you specify must be owned by the account youare using for the installation. If the installation program cannot create thedirectory, you are prompted to specify a different directory. Review the installation summary. Press Enter to start the installation.After completing the installation, InfoSphere CDC gives you the option oflaunching the configuration tool for InfoSphere CDC.Type 1 to launch the configuration tool.

Note: If you have X-Windows installed, the installation program will launch theconfiguration tool in a graphical environment. Related concepts: Configuring InfoSphere CDC for Netezza Technology (Linux)

27

1.2.

----

-

IBM InfoSphere Change Data Capture, Version 10.2 To override the locale for the installation (Linux) Use the following procedure to override the locale for the installer. English,Japanese and Simplified Chinese are supported.

Navigate to the directory that contains the InfoSphere® CDC installation file.Start the installer with the following flags to override the locale of the installation:

English—<installation_file_name>.bin -l enJapanese—<installation_file_name>.bin -l jaKorean—<installation_file_name>.bin -l koSimplified Chinese—<installation_file_name>.bin -l zh_CN

where: <installation_file_name> is the name of the installation file.

After the installation is complete, you have the option of launching the InfoSphereCDC configuration tool. The configuration tool will use the locale settings for yoursystem.

28

-

IBM InfoSphere Change Data Capture, Version 10.2 Installing InfoSphere CDC for Netezza® Technologyusing a silent installation A silent installation allows you to automatically install InfoSphere® CDC byspecifying a command with various parameters. You can use this type of installationmethod for large-scale deployments of InfoSphere CDC by embedding the silentinstallation command in a script. See also:

To perform a silent installation of InfoSphere CDC for Netezza Technology (Linux)

29

1.2.

3.4.

-5.

-

IBM InfoSphere Change Data Capture, Version 10.2 To perform a silent installation of InfoSphere CDCfor Netezza® Technology (Linux)

Log on to the account you set up for InfoSphere® CDC.Copy the InfoSphere CDC installation binary from the InfoSphere CDC CD-ROMor download it from the InfoSphere CDC web site.Make the installation binary executable.Install InfoSphere CDC and generate a response file with the following command:<installation_binary_name> -r <response-file> where: <response-file> is the full path to the installation response file.

On another system, perform the silent installation by running the followingcommand:<installation_binary_name> -i silent -f <response-file> where: <response-file> is the full path to the installation response file.

30

-

-

-

-

-

--

-

IBM InfoSphere Change Data Capture, Version 10.2 Upgrading InfoSphere CDC for Netezza Technology You can upgrade InfoSphere® CDC by installing a later version of the software overtop of an existing installation. Interim fixes cannot be used to upgrade InfoSphere CDC to later versions. You mustfirst install the general availability (GA) release of the software for the later versionand accept the software license agreement, before applying any interim fixes. Afterthe interim fix has been installed, you can start the InfoSphere CDC instances andcomplete the upgrade. Before attempting to upgrade the software, you should be aware of the followingprerequisites:

All subscriptions in all InfoSphere CDC for Netezza® Technology instancesassociated with the installation to be upgraded must be stopped.All InfoSphere CDC for Netezza Technology instances associated with theinstallation must be stopped.When logging in, you must use the same account that was used during the originalinstallation of InfoSphere CDC for Netezza Technology.It is a best practice to backup the installation directory of the current InfoSphereCDC for Netezza Technology installation.It is a best practice to backup the InfoSphere CDC metadata tables (TS_AUTH,TS_BOOKMARK, TS_CONFAUD, and TS_DDLAUD) that are stored in theNetezza database instance that you are replicating to and from. In the event of afailure during the upgrade, having a backup of the metadata will allow you to revertto the point in time before the upgrade. In addition to the InfoSphere CDCmetadata tables stored in your database, InfoSphere CDC maintains some othermetadata in an internal database. It is a best practice to backup the InfoSphereCDC internal metadata at the same time as the InfoSphere CDC metadata tablesin your database are backed up. The dmbackup command can be used to backupthe internal InfoSphere CDC metadata tables.Do not upgrade InfoSphere CDC as a root user.The installation directory requires file system permissions of 700 to install theproduct, create and configure instances, or upgrade the product.

When upgrading an InfoSphere CDC replication engine, you must also upgradeManagement Console and Access Server to the same version or later to access thefull range of functionality that was introduced in the later version of the engine.Management Console and Access Server are backward-compatible and will supportthe functionality available in earlier versions of the engines. CAUTION: You cannot export and import subscriptions across different versions of InfoSphereCDC. Do not attempt to import a subscription file from a previous version into anupgraded version. Once the upgrade is complete, you should create a new exportedsubscriptions xml file. See also:

To upgrade InfoSphere CDC for Netezza Technology (Linux)

31

32

1.2.3.

4.

5.6.

7.8.

9.

10.11.

12.13.

14.15.

IBM InfoSphere Change Data Capture, Version 10.2 To upgrade InfoSphere CDC for Netezza®Technology (Linux)

Ensure that all subscriptions in all InfoSphere® CDC instances are stopped.Ensure that all InfoSphere CDC instances are stopped.Ensure that you have a backup of the TS_AUTH, TS_BOOKMARK,TS_CONFAUD, and TS_DDLAUD metadata tables that are stored in thedatabase instance that you are replicating to and from. In the event of a failureduring the upgrade, having a backup of the metadata will allow you to revert tothe point in time before the upgrade. In addition to the InfoSphere CDCmetadata tables stored in your database, InfoSphere CDC maintains some othermetadata in an internal database. It is a best practice to backup the InfoSphereCDC internal metadata at the same time as the InfoSphere CDC metadatatables in your database are backed up. The dmbackup command can be used tobackup the internal InfoSphere CDC metadata tables.Ensure that you have backed up your InfoSphere CDC installation directory.Important: The backup of the installation directory and the metadata tablesshould be from the same timeframe, so that they contain an identical snapshotof data. Log on to the account you set up for InfoSphere CDC.Copy the InfoSphere CDC installation file for the version to which you want toupgrade. This file is available on the InfoSphere CDC DVD or you can downloadthe desired version from the IBM® web site. Ensure that you have copied theinstallation file for the applicable operating system.Make the installation binary file executable.Run the installation program by typing the following command:./<installation_binary_name>.bin If you already have InfoSphere CDC installed, the installation program willprompt you to upgrade. Press Enter on the Introduction screen to display the license agreement. Followthe instructions on the screen to navigate through the license agreement.To accept the license agreement, type 1.Enter the absolute path to your installation directory or press Enter to accept thedefault.Note: The directory that you specify must be owned by the account youare using for the installation. If the installation program cannot create thedirectory, you are prompted to specify a different directory. Confirm the absolute path. If it is correct, type Y and press Enter. Type 1 to confirm that you want to upgrade the existing installation and pressEnter.Review the pre-upgrade summary. Press Enter to start the upgrade.After upgrading the software, you must start all the configured instances in orderto complete the upgrade process. Depending on the number of tables andsubscriptions configured, as well as the complexity of the mappings, the upgradeprocess can take anywhere from several minutes to hours. Once the upgradeprocess is complete, InfoSphere CDC will be ready for replication and willrestart.

33

34

---

IBM InfoSphere Change Data Capture, Version 10.2 Configuring InfoSphere CDC for Netezza®Technology (Linux) After installing InfoSphere® CDC, the installation program launches a configurationtool. The configuration tool allows you to configure one or more InfoSphere CDCinstances for your environment. You must configure InfoSphere CDC before you canstart replication. In this section, you will learn:

To add a new instance of InfoSphere CDC for Netezza Technology (Linux) To edit an instance of InfoSphere CDC for Netezza Technology (Linux) To delete an instance of InfoSphere CDC for Netezza Technology (Linux)

35

1.

2.

3.4.5.

6.7.

8.9.

A.

B.C.D.E.F.G.

H.I.J.

IBM InfoSphere Change Data Capture, Version 10.2 To add a new instance of InfoSphere CDC forNetezza Technology (Linux)

If you are configuring the first instance of InfoSphere® CDC after installation,you can proceed to Step 3 of this procedure.At the command prompt, launch the configuration tool by issuing the followingcommand in the specified directory:/<InfoSphere CDC Installation Directory>/bin/dmconfigurets At the welcome message, press Enter to continue.Enter the name of the instance you want to add and press Enter.Enter the port number which InfoSphere CDC uses for communication with clientworkstations running Management Console and other servers. InfoSphere CDCdisplays a default port of 11301. Press Enter.This port number cannot be usedby other applications installed on the same server. You will use this port numberwhen specifying access parameters for your datastore in the Access Managerperspective in Management Console. Press Enter to bypass auto-discovery. This feature is disabled by default.Enter the amount of physically available RAM that you want to allocate for thisinstance of InfoSphere CDC and press Enter. By default, the configuration toolallocates 8192 MB of RAM for each 64-bit instance.Note: Using values otherthan the defaults or allocating more RAM than is physically available on yourserver should only be undertaken after considering the impacts on productperformance. Specify the JDBC driver for the Netezza® database.If you want to use TCP/IP as the exclusive method of communication betweendatastores, enter n and press Enter. If you want to have the option to use eithera JMS provider or TCP/IP as the communications protocol, perform the followingsteps:A JMS provider should be used when characteristics of your networkprevent the existence of a long term, stable TCP/IP connection.

Ensure that a queue has been created by your system administrator and isnamed correctly. Each InfoSphere CDC instance that is to use a JMSmessage provider must have a queue named in the format CDC_<port>,where <port> is the five digit TCP listening port number of the instance. Youcan left pad the number with zeroes if necessary to ensure five digits(example, CDC_00123).Enter y and press Enter.Enter 2 to add a JMS provider. Enter the fully qualified path to your JMS provider .jar file and press Enter.Enter 4 and press Enter to complete the configuration of the JMS providers.Enter 1 to add a JMS connection.Enter a JMS remote connection factory name and press Enter. For example,jms/ConnectionFactory. A connection factory encapsulates a set ofconnection configuration parameters that has been defined by anadministrator. InfoSphere CDC uses this to create a connection with yourJMS provider.Enter the user name and press Enter.Enter the password to authenticate to the JMS server and press Enter.Enter the password a second time to confirm and press Enter.

36

K.L.M.N.O.P.Q.

R.

10.11.

12.

13.14.

15.

16.17.

A.

B.-

--

C.18.

Enter the JNDI initial context and press Enter.Enter the URL that is relative to the JNDI Initial Context and press Enter.Enter the user name for the JNDI Principal and press Enter.Enter the JNDI credentials password and press Enter.Enter the password a second time to confirm and press Enter.Press Enter again to return to the Engine Communication Connection menu.Press 5 if you want to verify the connection and then press Enter to return tothe Engine Communication Connection menu.If the JMS Provider is notconfigured correctly, InfoSphere CDC will use TCP/IP as the communicationprotocol between datastores. Enter 7 to complete the configuration of the engine communicationconnection.

Enter the hostname of the Netezza database and press Enter.Enter the name of the database that contains all of the tables for replication andpress Enter.Enter the port number of the database and press Enter. InfoSphere CDCdisplays a default port of 5480.Enter the user name for the database and press Enter.Enter the password for the specified database and press Enter.The configurationtool will now search the database for schemas. Enter the number that corresponds to the database schema used by InfoSphereCDC for metadata tables and press Enter. You can specify any schema exceptthose in use by other installed instances of InfoSphere CDC for the givendatabase.Note:InfoSphere CDC metadata tables contain important configurationinformation and should be backed up as part of your database backup strategy. Enter the absolute path for the refresh loader and press Enter.If InfoSphere CDC detects an unsupported encoding, an error message will bedisplayed and you will be asked to choose an alternate encoding.

Enter y to proceed. If you enter n and press Enter to cancel, the instance willnot be created. Enter a value to choose how the alternate encodings will be displayed:

1—Displays the available alternate encodings that are the closest match tothe database.2—Displays the available alternate encodings in order of byte length.3—Displays all available alternate encodings.

Enter the number for the encoding to be used and press Enter.The configuration tool creates the InfoSphere CDC instance and prompts you tostart the instance. Enter y to start the instance.The configuration tool will promptyou if your configuration is about to overwrite the metadata for an existinginstance.

Related concepts: Creating queues in JMS providers Related tasks: To start InfoSphere CDC for Netezza Technology (Linux)

37

1.

2.

3.

4.5.6.

7.

IBM InfoSphere Change Data Capture, Version 10.2 To edit an instance of InfoSphere CDC for Netezza®Technology (Linux)

Stop InfoSphere® CDC by using the dmshutdown command. You cannot edit aninstance that is running.At the command prompt, launch the configuration tool by issuing the followingcommand from the <InfoSphere CDC Installation Directory>/bin directory:./dmconfigurets Enter 1 and press Enter to list the installed instances of InfoSphere CDC. Recordthe name of the instance you want to modify.Press Enter to return to the main menu.Enter 3 and press Enter to modify an instance of InfoSphere CDC. Enter the number of the instance that you want to modify and press Enter.Theconfiguration tool allows you to edit a number of values that you specified whenadding an instance. After making your changes, enter 5 and press Enter to apply your changes andreturn to the main menu. Enter 6 and press Enter to discard your changes.

39

1.2.

3.

4.5.

IBM InfoSphere Change Data Capture, Version 10.2 To delete an instance of InfoSphere CDC forNetezza® Technology (Linux)

Stop InfoSphere® CDC by using the dmshutdown command.At the command prompt, launch the configuration tool by issuing the followingcommand from the <InfoSphere CDC installation directory>/bin directory:./dmconfigurets Enter 1 and press Enter to list the installed instances of InfoSphere CDC. Recordthe name of the instance you want to delete.Enter 4 and press Enter to delete an instance of InfoSphere CDC.Enter the instance name that you want to delete and press Enter.

40

---

IBM InfoSphere Change Data Capture, Version 10.2 After you install and configure InfoSphere CDC forNetezza® Technology Once you have installed and configured InfoSphere® CDC, you can start usingInfoSphere CDC. In this section, you will learn:

To start InfoSphere CDC for Netezza Technology (Linux) To stop InfoSphere CDC for Netezza Technology (Linux) Privileges required by users

41

-

IBM InfoSphere Change Data Capture, Version 10.2 To start InfoSphere® CDC for Netezza® Technology(Linux) Issue the following start command:

dmts64 - I <instance_name>

42

1.

2.

IBM InfoSphere Change Data Capture, Version 10.2 To stop InfoSphere CDC for Netezza® Technology(Linux)

End replication on all subscriptions in Management Console. For moreinformation on how to end replication on subscriptions, see your ManagementConsole documentation.Depending on how you want to stop InfoSphere® CDC, issue one of the followingstop commands in the bin directory in your InfoSphere CDC installation directory:

Option Descriptiondmshutdown [-I<instance_name>]

Use this command to gracefullyshut down InfoSphere CDC.

dmterminate [-L <locale>] Use this command to terminateall processes for all instancesrunning on a Linux server. Use this command when youcannot completely shut downInfoSphere CDC using thedmshutdown command.

43

------------

IBM InfoSphere Change Data Capture, Version 10.2 Privileges required by users Users must be granted the following Netezza® database privileges in order toreplicate with InfoSphere® CDC: The following privileges are required:

GRANT TRUNCATE ON TABLE GRANT LIST ON <databaseName>GRANT LIST, SELECT ON TABLEGRANT LIST, SELECT ON VIEWGRANT INSERT, UPDATE, DELETE, ALTER, DROP ON TABLE GRANT CREATE TABLEGRANT SELECT ON _T_DATABASEGRANT SELECT ON _T_ENVIRONGRANT SELECT ON _T_OBJECTGRANT SELECT ON _T_VT_PG_TIME_OFFSETGRANT SELECT ON _V_RELATION_KEYDATA_XDB GRANT SELECT ON _VT_TBLDICT

44

-

IBM InfoSphere Change Data Capture, Version 10.2 Maintaining active TCP connections in a networkenvironment If your deployment of InfoSphere® CDC is in a network environment that uses afirewall, VPN gateway, or local system tools to detect idle TCP connections, it maybe necessary to configure the product to prevent these connections from beingclosed during periods of application inactivity between the source and target. By default, InfoSphere CDC sends a message over TCP connections every 20seconds to ensure these connections remain active during periods of inactivity. Ifyour network policies close TCP connections for idle periods of less than 20seconds, you must change the configuration of each instance of InfoSphere CDC toensure the TCP connections remain open. See also:

To maintain active TCP connections

45

1.

2.3.

4.5.

IBM InfoSphere Change Data Capture, Version 10.2 To maintain active TCP connections

For each instance of InfoSphere® CDC, navigate to the following directory:UNIXor Linux: <CDC_installation_directory>/instance/<instance_name>/conf Open the comms.ini file in a text editor.Change the KEEP_ALIVE_TIMEOUT parameter to a value that is lower than thetime used to detect idle connections in your network. For example, if yournetwork disables idle TCP connections after 15 seconds, you can change theKEEP_ALIVE_TIMEOUT parameter to a value of 10 seconds:KEEP_ALIVE_TIMEOUT=10 Save the comms.ini file.For the changes to take effect, use the configuration tool to restart all instances ofInfoSphere CDC.

InfoSphere CDC will now send messages over the TCP connection every 10seconds.

46

--

IBM InfoSphere Change Data Capture, Version 10.2 InfoSphere CDC for Netezza® Technology metadatatables InfoSphere® CDC maintains a set of metadata tables that represent data about yourcurrent replication configuration. These tables are created in the database that youspecify in the configuration tool and should be part of the backup strategy for yourdatabase. InfoSphere CDC will not replicate these tables. Do not modify thecontents of these tables unless requested to do so by your IBM® representative. The names of the metadata tables created by InfoSphere CDC are as follows:

TS_AUTHTS_BKMARK

Related concepts: Configuring InfoSphere CDC for Netezza Technology (Linux)

47

IBM InfoSphere Change Data Capture, Version 10.2 Data types supported by InfoSphere CDC forNetezza Technology For information about data types supported by InfoSphere® CDC for Netezza®Technology, see Supported data types.

48

IBM InfoSphere Change Data Capture, Version 10.2 System parameters for InfoSphere CDC for NetezzaTechnology For information about system parameters for InfoSphere® CDC for Netezza®Technology, see System parameters for InfoSphere CDC for Netezza.

49

-------

IBM InfoSphere Change Data Capture, Version 10.2 Commands for InfoSphere CDC for Netezza®Technology This section discusses the commands available with InfoSphere® CDC. Using thesecommands you can control replication, manage your tables for replication, monitorreplication, and perform various other tasks. In this section, you will learn:

Using the InfoSphere CDC for Netezza Technology commands Setting the TSINSTANCE environment variable Database transaction log commands Managing tables for replication commands Monitoring replication commands Exporting and importing configuration commands Other commands

50

--

-

-

--

-

--

IBM InfoSphere Change Data Capture, Version 10.2 Using the InfoSphere CDC for Netezza® Technologycommands You can issue InfoSphere® CDC commands at a command line prompt or as part ofa batch file or shell script. Commands are located in the bin directory of yourInfoSphere CDC installation directory. You must run the commands from thisdirectory. Note: Use the -? flag to list the available parameters for a command and a shortdescription of each parameter. For example, dmstartmirror -?. Command formats For each command, the following items of information are provided:

Syntax—Identifies the name of the command and lists the command parameters.Parameters—Describes each parameter in the command and identifies the valuesthat can be specified.Result—Indicates the values that are returned by the command if it is successful.These values can be useful for scripting. This section also specifies the informationthat is displayed on the screen, if any, as a result of executing the command.Examples—Provides one or more examples of invoking the command.

Parameter formats Note the following conventions in the definition of the command parameters:

Angle brackets ( < > ) indicate a mandatory parameter.Square brackets ( [ ] ) indicate an optional parameter. If you omit the parameter,InfoSphere CDC uses a default value.A vertical bar ( | ) separating one or more parameters indicate that only one of theparameters in the list can be used. When one or more vertical bars appear in a listof parameters that is enclosed by square brackets [ ], the choices are limited to theparameters in the list, but you have the option to not specify any of the parameters.Ellipsis ( ... ) means that a parameter or option can be repeated more than once.You can issue the commands in Linux.

51

-

IBM InfoSphere Change Data Capture, Version 10.2 Setting the TSINSTANCE environment variable Before using InfoSphere® CDC commands, you can set the TSINSTANCEenvironment variable to the name of your InfoSphere CDC instance. After you set the TSINSTANCE environment variable, you no longer have to specifythe instance name when issuing commands. The following command is for kshell. You can run similar commands in other shells: export TSINSTANCE=<instance_name> where:

<instance_name> is the name of your InfoSphere CDC instance.

52

-

IBM InfoSphere Change Data Capture, Version 10.2 Database transaction log commands This section contains commands that help you manage your database transactionlog or bookmarks. See also:

dmshowbookmark - Display Bookmark Information

53

--

--

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmshowbookmark - Display Bookmark Information CAUTION: Improper use of this command in conjunction with the dmsetbookmark commandcan result in data loss or data duplication. You should only execute thedmsetbookmark command when directed by IBM Technical Support. Use this command on your InfoSphere® CDC target system to obtain the replicationposition (bookmark) in the stream of change data for a subscription. After generatingthe replication position information with this command, you can use thedmsetbookmark command on the source system to set the replication position for asubscription. Syntax dmshowbookmark [-I <INSTANCE_NAME>] -s <SOURCE_ID>

[-f <bookmark_file_name>] [-x <bookmark_file_name>] [-v] [-L <locale>] Parameters

-I <INSTANCE_NAME>The name of the InfoSphere CDC instance. You can set the TSINSTANCEenvironment variable to the name of your InfoSphere CDC instance. After this iscomplete, you no longer have to specify the instance when issuing commands.

-s <SOURCE_ID>Specifies the source ID of the subscription for which you want to obtain thereplication position (bookmark).Source IDs are automatically generated basedon truncating the subscription name to 8 characters during subscriptioncreation. Source IDs must be unique.

-f <bookmark_file_name>Specifies the name of the binary file that will be generated by this command.The generated file contains information about the replication position(bookmark) for the specified subscription. You can specify an absolute path for the location where you want to create thefile. If you do not specify an absolute path, the file is created in the InfoSphereCDC installation directory. Use the -f parameter in the dmsetbookmark command to read the binary filegenerated by this parameter. Note: Use the -x parameter if you are issuing this command from the target of aDB2® for LUW DPF source environment.

-x <bookmark_file_name>Specifies the name of the XML file that will be generated by this command. Thegenerated file contains information about the replication position (bookmark) forthe specified subscription. Use this parameter if you are replicating from a DB2for LUW DPF source environment. The XML file contains replication positions(bookmarks) for all partitions.You can specify an absolute path for the locationwhere you want to create the file. If you do not specify an absolute path, the fileis created in the InfoSphere CDC installation directory. Use the -f parameter in the dmsetbookmark command to read the XML filegenerated by this parameter.

54

--

--

-vDisplays verbose information about the replication position (bookmark),including a hexadecimal-encoded string. The amount of information displayeddepends on the type and version of the source engine. The hexadecimal-encoded string is always displayed. This parameter displays a subset of whatthe dmdecodebookmark command displays. If not specified, only ahexadecimal-encoded string is displayed.Note: Use the -x parameter if you areissuing this command from the target of a DB2 LUW DPF source environment.

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails. Examples dmshowbookmark -I MYINSTANCE -s MASTER -f bookmarkInfoSphere CDCobtains the replication position (bookmark) information for the specified instance andthe MASTER source ID. Replication position (bookmark) information is contained inthe bookmark binary file which will be placed in the InfoSphere CDC installationdirectory since no absolute path has been specified. dmshowbookmark -I MYINSTANCE -s FINANCE -x mybookmarksInfoSphere CDCobtains the replication position (bookmark) information for the specified instance andthe FINANCE source ID. Replication position (bookmark) information is contained inthe mybookmarks XML file which will be placed in the InfoSphere CDC installationdirectory since no absolute path has been specified.

55

-

IBM InfoSphere Change Data Capture, Version 10.2 Managing tables for replication commands This section contains commands that help you manage the tables that you want toreplicate with InfoSphere® CDC. See also:

dmreassigntable - Update Target Table Definition

56

--

--

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmreassigntable - Update Target Table Definition Use this command to update the definition of a target table in InfoSphere® CDCmetadata after you change the definition of the target table in your database. Syntax Parameters

-I <INSTANCE_NAME>Specifies the name of the InfoSphere CDC instance. Alternatively, you canspecify the TSINSTANCE environment variable in place of this value.

-s <SUBSCRIPTION_NAME>Specifies the subscription that contains the source table that is mapped to thetarget table which was updated in your database. To specify multiplesubscriptions, list the subscriptions separated by a space.

-ASpecifies that InfoSphere CDC updates definitions for all target tables in thesubscription.

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the operation was successful. If it fails, thiscommand returns a non-zero value. Example dmreassigntable -I NEWINSTANCE -s FINANCE -AInfoSphere CDC updatesdefinitions for all target tables in the Finance subscription.

57

--

IBM InfoSphere Change Data Capture, Version 10.2 Monitoring replication commands This section contains commands that help you monitor replication in InfoSphere®CDC. See also:

dmclearevents - Clear Events dmshowevents - Show Events

58

--

--

--

--

--

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmclearevents - Clear Events Use this command to delete events from the Event Log view in ManagementConsole. Syntax dmclearevents [-I <INSTANCE_NAME>] [-S|-T-|-B] <-A|-s <SUBSCRIPTION_NAME ...>

[-L <locale>] Parameters

-I <INSTANCE_NAME>Specifies the name of the InfoSphere® CDC instance. Alternatively, you canspecify the TSINSTANCE environment variable in place of this value.

-SSpecifies that InfoSphere CDC clears events from the source.

-TSpecifies that InfoSphere CDC clears events from the target.

-BSpecifies that InfoSphere CDC clears events from both the source and target. Ifnone of the S, T, and B options are specified, InfoSphere CDC assumes B bydefault.

-ASpecifies that InfoSphere CDC clears events for all subscriptions.

-s <SUBSCRIPTION_NAME>Specifies that InfoSphere CDC clears events for the indicated subscription. Tospecify multiple subscriptions, list the subscriptions separated by a space.

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails. Examples dmclearevents -I MYINSTANCE -S -AInfoSphere CDC clears events from thesource for all subscriptions for the specified instance. dmclearevents -I MYINSTANCE -B -s FINANCE MARKETINGInfoSphere CDCclears events from both the source and target for the Finance and Marketingsubscriptions for the specified instance.

59

--

--

--

--

--

--

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmshowevents - Show Events Use this command to display InfoSphere® CDC events to standard output. You canuse this command as an alternative to showing InfoSphere CDC events in theEvent Log view in Management Console. The output of this command shows events in chronological order with the mostrecent event shown first in the list. Syntax dmshowevents [-I <INSTANCE_NAME>] <-a|-s <SUBSCRIPTION_NAME> ...

|-t <SOURCE_ID> ...|-s <SUBSCRIPTION_NAME> ... -t <SOURCE_ID> ...> [-h] [-c max_msg]

[-L <locale>] or dmshowevents -I <INSTANCE_NAME> <-a|-s <SUBSCRIPTION_NAME>|-t

<SOURCE_ID>> ...> [-h] [-c <max_msg>] [-L <locale>] Parameters

-I <INSTANCE_NAME>Specifies the name of the InfoSphere CDC instance. Alternatively, you canspecify the TSINSTANCE environment variable in place of this value.

-aSpecifies that InfoSphere CDC shows events for all subscriptions.

-s <SUBSCRIPTION_NAME>Specifies the name of the subscription for which InfoSphere CDC displayssource events. To specify multiple subscriptions, list the subscriptionsseparated by a space.

-t <SOURCE_ID>Specifies the source ID of the subscription for which InfoSphere CDC displaystarget events. List the source IDs if you specify more than one.Source IDs areautomatically generated based on truncating the subscription name to 8characters during subscription creation. Source IDs must be unique.

-hSpecifies that InfoSphere CDC displays a header before the list of events. Thisoption helps you identify each item of information that is displayed for eachevent.

-c <max_msg>Specifies the maximum number of events that InfoSphere CDC displays. If youomit this parameter or you specify a value greater than the total number ofevents, InfoSphere CDC displays all events for the specified subscriptions andsource IDs.

Minimum Setting—0. No events are shown.Maximum Setting—2147483647

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the operation was successful. If it fails, this

60

command returns a non-zero value. Examples dmshowevents -I NEWINSTANCE -s FINANCEInfoSphere CDC displays all eventsfor the Finance subscription for the specified instance. dmshowevents -I MYINSTANCE –a –hInfoSphere CDC displays all events for allsubscriptions. A header is displayed before the list of events for the specifiedinstance. dmshowevents -I NEWINSTANCE –s FINANCE MARKETING –t ATLANTA –h –c20InfoSphere CDC displays the most recent 20 events for the Finance andMarketing subscriptions and for the Atlanta source ID. A header is displayed beforethe list of events for the specified instance. Sample output TIME|AGENTTYPE|SUBSCRIPTION|EVENTID|SEVERITY|EVENTPROGRAM|EVENTTEXT

2006-04-21 17:23:08.817|T|ATLANTA|95|Information|class com.datamirror.ts.target.

publication.c|IBM InfoSphere Change Data Capture Communications ending.

2006-04-21 17:23:08.614|T|ATLANTA|1538|Information|class com.datamirror.ts.target.

publication.c|---IBM InfoSphere Change Data Capture for ATLANTA terminating

normally.

2006-04-21 17:23:08.333|T|ATLANTA|1537|Information|class com.datamirror.ts.target.

publication.c|Describe conversation with ATLANTA completed successfully.

2006-04-21 17:23:07.911|T|ATLANTA|1536|Information|class com.datamirror.ts.target.

publication.c|Describe conversation started by ATLANTA.

2006-04-21 17:23:07.333|T|ATLANTA|1531|Information|class com.datamirror.ts.target.

publication.c|Communication with ATLANTA successfully started on Data channel.

2006-04-21 17:23:06.973|T|ATLANTA|1534|Information|class com.datamirror.ts.engine.a

|Code page conversation from the source database's code page 1252 to the target

database's code page Cp1252 for ATLANTA will be performed by the Remote system Fields in each record are separated by vertical bars ( | ). These fields are identifiedin the first line of the output. In the AGENTTYPE field, S indicates source and Tindicates target.

61

--

IBM InfoSphere Change Data Capture, Version 10.2 Exporting and importing configuration commands This section contains commands that allow you to export and/or import yourInfoSphere® CDC global configuration. See also:

dmexportconfiguration - Export InfoSphere CDC Configuration dmimportconfiguration - Import InfoSphere CDC Configuration

62

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmexportconfiguration - Export InfoSphere CDCConfiguration Use this command to export the configuration details of an installed instance ofInfoSphere® CDC. Configuration details are sent to an XML configuration file. Youcan use the dmimportconfiguration command to import the XML file that you createwith this command into another instance of InfoSphere CDC. Note: This command does not export subscription-specific settings that areconfigured in Management Console. Subscription-specific settings can be exportedto an XML file in Management Console. Note: This command is interactive and will prompt you for your password. Youcannot script this command. Syntax dmexportconfiguration <absolute_path_to_configuration_file> [-L <locale>] Parameters

<absolute_path_to_configuration_file>The absolute path to the XML configuration file that you want to export.

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails. Examples dmexportconfiguration c:\configuration.xmlInfoSphere CDC exports the XML file tothe specified absolute path. Related reference: dmimportconfiguration - Import InfoSphere CDC Configuration

63

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmimportconfiguration - Import InfoSphere CDCConfiguration Use this command to import the InfoSphere® CDC configuration settings from anXML file which you created with the dmexportconfiguration command. Syntax dmimportconfiguration <absolute_path_to_configuration_file> [-L <locale>] Parameters

<absolute_path_to_configuration_file>The absolute path to the XML configuration file that you are importing.

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails. Examples dmimportconfiguration c:\configuration.xml InfoSphere CDC imports the XML configuration file from the specified absolute path. Related reference: dmexportconfiguration - Export InfoSphere CDC Configuration

64

---------

IBM InfoSphere Change Data Capture, Version 10.2 Other commands This section contains miscellaneous commands that allow you to determine theversion of InfoSphere® CDC, verify communications, stop InfoSphere CDC, setsystem parameters, and back up your metadata. See also:

dmbackupmd Back up metadata dmconfigurets - Configure InfoSphere CDC dmset - Set InfoSphere CDC System Parameter dmshowversion - Show InfoSphere CDC Version dmshutdown - Shut Down InfoSphere CDC dmsupportinfo - Collect IBM Support Information dmts64 - Start InfoSphere CDC dmmdcommander dmmdconsole

65

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmbackupmd Back up metadata Use this command to create a backup of the InfoSphere® CDC metadata databasewhich contains information about your current replication configuration. You shouldalways back up your metadata when there are changes to your subscriptionconfiguration and table status. You can only back up your metadata whileInfoSphere CDC is running. The backup of the metadata database is created in <Installation_directory>\instance\<instance_name>\conf\backup. The files in the backup directory should bestored on separate media for possible recovery. Syntax dmbackupmd [-I <INSTANCE_NAME>] [-L <locale>] Parameters

-I <INSTANCE_NAME>Specifies the name of the InfoSphere CDC instance. Alternatively, you canspecify the TSINSTANCE environment variable in place of this value.

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails.

66

--

IBM InfoSphere Change Data Capture, Version 10.2 dmconfigurets - Configure InfoSphere CDC Use this command to launch the InfoSphere® CDC configuration tool. You can usethis tool to create instances and configure your installation of InfoSphere CDC. If the DISPLAY environment variable has been set, the configuration tool willattempt to launch the graphical user interface (GUI) version of the configuration toolwhen this command is issued. If you do not have the graphical libraries installed toview the GUI, you will need to ensure that the DISPLAY environment variable hasbeen cleared in order to launch the command line version. Syntax dmconfigurets [-L <locale>] Parameters

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails.

67

--

--

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmset - Set InfoSphere CDC System Parameter Use this command to view or change InfoSphere® CDC system parameters. Youcan also change system parameters in Management Console. Note: You can set any system parameter using this command. However, it will onlydisplay system parameters that are set to non-default values. Syntax dmset [-I <INSTANCE_NAME>] [<parameter_name>[=[<parameter_value>]]] [-L <locale>] Parameters

-I <INSTANCE_NAME>Specifies the name of the InfoSphere CDC instance. Alternatively, you canspecify the TSINSTANCE environment variable in place of this value.

<parameter_name>Specifies the name of the InfoSphere CDC system parameter.

<parameter_value>Specifies the value that you want to assign to the system parameter.

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails. Examples dmset -I MYINSTANCEDisplays all of the system parameters that are set to non-default values. dmset -I MYINSTANCE events_max_retain=20000Sets the events_max_retainsystem parameter to 20000. dmset -I MYINSTANCE events_max_retainDisplays the current value of thespecified parameter. dmset -I MYINSTANCE stop_replication=Deletes the stop_replication systemparameter.

68

--

IBM InfoSphere Change Data Capture, Version 10.2 dmshowversion - Show InfoSphere CDC Version Use this command to display the InfoSphere® CDC version and build number. Runthis command before you contact your IBM® representative. Syntax dmshowversion [-L <locale>] Parameters

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the operation was successful. If it fails, thiscommand returns a non-zero value.

69

--

--

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmshutdown - Shut Down InfoSphere CDC Use this command to stop an instance of InfoSphere® CDC. This command is oftenused prior to taking a server or database offline for maintenance purposes orupgrading InfoSphere CDC. Note: As a best practice before you run this command and to ensure that itcompletes successfully, use the dmendreplication command to end replication on allsubscriptions that use the instance specified in this command. To end replication on subscriptions that use the specified instance, you can use the–a parameter which will generate an error when forcefully ending replication onsubscriptions that use the instance as a target. If this command does not end InfoSphere CDC processes and stop the specifiedinstance, use the dmterminate command on the UNIX and Linux platform to force acomplete shut down. Syntax dmshutdown [-I <INSTANCE_NAME>] [-c|-i|-a] [-L <locale>] Parameters

-I <INSTANCE_NAME>Specifies the name of the InfoSphere CDC instance. Alternatively, you canspecify the TSINSTANCE environment variable in place of this value.

-cSpecifies that InfoSphere CDC stops the specified instance with the Normaloption. InfoSphere CDC will use this option by default if you do not specify -i or–a. The instance will only stop if no subscriptions are currently running.Normalis the most appropriate option for most business requirements and is thepreferred method for stopping an instance in most situations.

-iSpecifies that InfoSphere CDC stops the specified instance with the Immediateoption. The instance will only stop if no subscriptions are currently running.Starting the target instance after using this option can be slower than -c.

-aSpecifies that InfoSphere CDC stops the specified instance and endsreplication on all subscriptions that use the instance with the Abort option.Subscriptions that use the specified instance will end replication with an error.This option stops all in progress work and then ends replication rapidly. Startingreplication on subscriptions after using this option can be much slower thanusing -c. A refresh in progress will be interrupted and the target will stopprocessing any data that has not been committed before replication ends. Attention: Use this option if your business reasons require a rapid end toreplication and you are willing to tolerate a much slower start when you resumereplication on the specified subscriptions. A sudden business requirement for an unplanned shutdown of your sourcesystem may require this option for ending replication. Note: As a best practice, use the dmendreplication command to end replicationon all subscriptions that use the instance specified in this command.

70

--

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isthe locale of the machine where InfoSphere CDC is installed.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails. Examples dmshutdown -I MYINSTANCE -cInfoSphere CDC stops the specified instance withthe Normal option. The instance will only stop if no subscriptions are currentlyrunning. dmshutdown -I MYINSTANCE –aInfoSphere CDC stops the specified instance andends replication with the Abort option for all subscriptions that use the instance.Subscriptions that use the specified instance will end replication with an error.

71

--

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmsupportinfo - Collect IBM Support Information Note: You should only run this command when the Management ConsoleSupportAssistant cannot connect to your InfoSphere® CDC datastore because it is notrunning or it will not run. Use this command (when requested by IBM® Technical Support) to collectInfoSphere CDC environment information in a generated .zip file that is used todiagnose and troubleshoot your support issue. Once the command has completed collecting information and generating the .zipfile, the output will display the full path and name of the .zip file. If you run thiscommand multiple times, the generated .zip files are numbered randomly. Note thatyou are responsible for deleting the generated .zip files when they are no longerrequired. Syntax dmsupportinfo [-I <INSTANCE_NAME>] [-t <"yyyy-MM-dd hh:mm:ss to yyyy-MM-dd hh:mm:ss">] [-L <locale>] Parameters

-I <INSTANCE_NAME>Specifies the name of the InfoSphere CDC instance. Alternatively, you canspecify the TSINSTANCE environment variable in place of this value.If you donot specify an instance (possibly because you could not create an instance),this command will only collect non-instance specific information.

-t <"yyyy-MM-dd hh:mm:ss to yyyy-MM-dd hh:mm:ss">Specifies the date and time range (relative to the time zone of the operatingsystem where you issue this command) used by InfoSphere CDC to retrieveenvironment information.Note: As a best practice, specify a date and time rangethat only captures the time period when you experienced problems. This allowsfor easier problem diagnosis and reduces the size of the files retrieved.

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails. Example dmsupportinfo -I PRODUCTION -t "2009-12-03 08:00:00 to 2009-12-03 12:00:00"Retrieves support information for the Production instance from 8:00 AM to 12:00 PMon December 3, 2009. This is the time range when you experienced support issueswith this instance of InfoSphere CDC. Related concepts: Troubleshooting and contacting IBM Support

72

73

--

--

IBM InfoSphere Change Data Capture, Version 10.2 dmts64 - Start InfoSphere CDC Use this command to start a 64-bit instance of InfoSphere® CDC. Syntax dmts64 [-I <INSTANCE_NAME>] [-L <locale>] Parameters

-I <INSTANCE_NAME>Specifies the InfoSphere CDC instance for which you want to start.

-L <locale>The name of the locale used for the InfoSphere CDC instance. The default isyour machine's locale.

Result This command returns a value of 0 if the command was successful and a non-zerovalue if the command fails. Examples dmts64 -I MYINSTANCEInfoSphere CDC starts for the specified instance.

74

IBM InfoSphere Change Data Capture, Version 10.2 dmmdcommander This command is for internal use only.

75

IBM InfoSphere Change Data Capture, Version 10.2 dmmdconsole This command is for internal use only.

76

-

--

IBM InfoSphere Change Data Capture, Version 10.2 User exits for InfoSphere CDC for NetezzaTechnology A user exit lets you define a set of actions that InfoSphere® CDC can run before orafter a database event occurs on a specified table. User exits allow you tocustomize your environment to meet your business requirements. After compilingthe user exit, you can specify the user exit in Management Console. Note: When developing user exits with InfoSphere CDC for Netezza® Technology,there is no capability provided to retrieve a connection to the target Netezzadatabase. InfoSphere CDC provides the following type of user exit:

Java class—This type of user exit utilizes the InfoSphere CDC API. For moreinformation, see the API reference Javadocs.

Sample Java™ class user exits are also provided with InfoSphere CDC. You canextend or modify these samples to suit your environment. In this section, you will learn:

Sample Java class user exits for InfoSphere CDC InfoSphere CDC API reference – Javadocs

77

-

-

-

-

-

-

-

-

IBM InfoSphere Change Data Capture, Version 10.2 Sample Java class user exits for InfoSphere CDC InfoSphere® CDC provides sample user exits that you can extend or modify to suityour environment. The samples are found in samples.jar, which is located in thesamples directory in your InfoSphere CDC installation directory. The Java™ filecontains the following samples:

DEUserExitSample.java—Used in expressions using the %USERFUNC columnfunction. It calculates the sum of the user-supplied parameters (in the expression)and returns the sum incremented by 1. This sample is located incom.datamirror.ts.derivedexpressionmanager.UserExitSample.java—Subscribes to replication events to retrieve the details ofthe events which took place. This sample is located incom.datamirror.ts.target.publication.userexit.sample.UserExitSample1.java—Records new rows inserted into a table on the target andstores them in a text file. The user specifies the name of the text file as aparameter. This sample is located incom.datamirror.ts.target.publication.userexit.sample.

Note the following: To run the sample user exits without modifying them, you must specify the fullyqualified path to the compiled user exit in Management Console. For example,com.datamirror.ts.target.publication.userexit.sample.UserExitSample.Compiled sample user exits are located in the ts.jar file which is found in the libdirectory in your InfoSphere CDC installation directory. Note that the compiled userexits in the ts.jar file have a *.class extension.If you want to modify the sample user exits, you must compile the user exit afteryou make changes to the source code.The user exit class must also be in the InfoSphere CDC runtime classpath.

For more information on how to specify Java class user exits in ManagementConsole, see your Management Console documentation. See also:

To compile the sample Java class user exits (UNIX and Linux)

78

1.2.

3.4.

5.

6.7.

IBM InfoSphere Change Data Capture, Version 10.2 To compile the sample Java class user exits (UNIXand Linux)

Stop InfoSphere® CDC.Unzip the samples.jar file into the lib directory in your InfoSphere CDC installationdirectory. Make sure you maintain the directory structure when unzipping the jarfile.After unzipping the jar file, you will have a directory structure like the following:<installation_directory>/lib/com/datamirror/ts/target

/publication/userexit/sample Make your changes to the sample user exit.Compile the modified user exit. For example, if you want to compileUserExitSample.java, open a command window, navigate to the lib directory andissue the following command:javac -classpath ts.jar:.com/datamirror/ts/target/publication/userexit/sample

/UserExitSample.java If this command runs successfully, there will be no output on your screen. Note: Your system must have the Java™ JDK to run this command. After running the command successfully, navigate to the following directory andconfirm that you have created a UserExitSample.class file:<installation_directory>/lib/com/datamirror/ts/target

/publication/userexit/sample Start InfoSphere CDC.The final step to configure the user exit is to specify the fully qualified path toUserExitSample in Management Console. For example:com.datamirror.ts.target.publication.userexit.sample.UserExitSample Note: Do not specify the .class extension.

For more information on how to specify Java class user exits in ManagementConsole, see your Management Console documentation. Note: If you plan to use the sample user exits in production environments, you willhave to test the samples before they are deployed. IBM® does not assumeresponsibility for adverse results caused by modified or customized user exitclasses.

79

--

IBM InfoSphere Change Data Capture, Version 10.2 InfoSphere CDC API reference – Javadocs The API reference is available in Javadoc format in your InfoSphere® CDCinstallation directory. To view the API reference, navigate to the api directory belowand click the index.html file to open the Javadoc documentation in your browser:

Windows—<InfoSphere CDC installation directory>\docs\apiUNIX—<InfoSphere CDC installation directory>/docs/api

80

-

IBM InfoSphere Change Data Capture, Version 10.2 Uninstalling InfoSphere CDC for Netezza®Technology This section provides step-by-step instructions on how to uninstall InfoSphere®CDC. In this section, you will learn:

To uninstall InfoSphere CDC for Netezza Technology (Linux)

81

1.2.

3.

4.5.6.

7.

IBM InfoSphere Change Data Capture, Version 10.2 To uninstall InfoSphere CDC for Netezza®Technology (Linux)

Stop InfoSphere® CDC by using the dmshutdown command.At the command prompt, launch the configuration tool by issuing the followingcommand from the <InfoSphere CDC installation directory>/bin directory:./dmconfigurets Enter 1 and press Enter to list the installed instances of InfoSphere CDC. Recordthe names of all these instances. Uninstalling InfoSphere CDC is simply deletingthe InfoSphere CDC instances.Enter 4 and press Enter to delete the first instance of InfoSphere CDC.Enter the instance name that you want to delete and press Enter.Repeat the above steps to delete all the InfoSphere CDC instances you recordedpreviously.Delete the InfoSphere CDC installation directory.

82

---

--

-

IBM InfoSphere Change Data Capture, Version 10.2 Troubleshooting If you encounter issues while running InfoSphere® CDC, you have a number ofoptions for tracking and troubleshooting issues to help with problem resolution. There are three methods that you can use in InfoSphere CDC for tracking andtroubleshooting issues:

Data Collection with the IBM® Support Assistant (ISA DC)Management Console Support AssistantThe dmsupportinfo command, which is executed on the replication engine

If you are trying to troubleshoot issues with InfoSphere CDC version 10.2 or later onLinux, UNIX and Windows operating systems, you should use the ISA DC toolunless otherwise instructed by IBM Technical Support. In this section, you will learn:

Using the IBM Support Assistant (ISA DC) Locating log files In addition to the Management Console event log, InfoSphere CDC producesother logs to help troubleshoot installation and replication errors.Troubleshooting and contacting IBM Support

83

-

-

--

---

-

IBM InfoSphere Change Data Capture, Version 10.2 Using the IBM Support Assistant (ISA DC) You can use the IBM® Support Assistant Data Collection tool (ISA DC) to collectInfoSphere® CDC data to provide to IBM Technical Support to assist you introubleshooting issues with InfoSphere CDC, to request a product enhancement orto ask a question about InfoSphere CDC. ISA DC can be used with InfoSphere CDC replication engines that are version 10.2or later, except InfoSphere CDC for z/OS®. The ISA DC tool is included in the InfoSphere CDC installation process, and doesnot require configuration. The executable files are located in the isa folder in theInfoSphere CDC directory. Simply run the isadc.bat, isadc.sh or index.html file, asappropriate, to launch the tool. Prerequisites and considerations for using ISA DC Prerequisites: The following prerequisite must be satisfied on the machine on which ISA DC will berun, in order to successfully use the tool:

IBM JRE/JDK version 1.6 or laterConsiderations: The following issues should be taken into consideration before you attempt to useISA DC:

ISA DC cannot be run remotely. It must be run on the machine where the instanceis configured.ISA DC cannot be used to collect data from InfoSphere CDC for z/OS.If InfoSphere CDC is installed but you have not configured an instance or areunable to configure an instance, ISA DC can still be used to collect minimal data toassist IBM Technical Support in resolving the issue.

See also:

To use ISA DC to collect data for a product problem (command line) To use ISA DC to collect data for a product problem (GUI) To use ISA DC to collect data for a question or an enhancement request(command line) To use ISA DC to collect data for a question or an enhancement request (GUI)

84

1.

2.

3.

4.5.

6.7.8.

9.

10.

11.

12.

13.

A.

IBM InfoSphere Change Data Capture, Version 10.2 To use ISA DC to collect data for a product problem(command line)

Launch the IBM® Support Assistant.Run the isadc.bat or isadc.sh file, located inthe isa\isadc folder in the root directory of the InfoSphere® CDC instance. Enter 1 to accept the license agreement and press Enter.After the licenseagreement has been accepted, it will not be shown again. Provide a file name and press Enter.The name provided will be given to the .zipfile containing the data collection results. If you do not want to assign a name to the data collection results, press Enterand a default name will be used. Enter 1 to confirm your chosen file name and press Enter to continue.Enter 1 to run the InfoSphere Change Data CaptureSupport Assistant DataCollector and press Enter.The Welcome page is displayed. Read the Welcome page information and enter 1 to proceed. Press Enter.Enter 1 to collect data for a product problem and press Enter.Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3 andpress Enter. Select the name of the InfoSphere CDC instance for which data will be collected.If you have multiple instances of InfoSphere CDC configured, you will be askedto select which instance for which you want to collect. Enter the correspondingnumber for the instance name and press Enter. If you have a single InfoSphere CDC instance configured, it will be selectedautomatically and this step will be skipped. Even if you do not have an instance configured, ISA DC will still collect whatdata is available. If no instance is configured, you can skip to Step 14. Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3 andpress Enter. If your selected instance is not running, you will be alerted by ISA DC. As onlyminimal data is available if the instance is stopped, it is preferable that theinstance be running during data collection.Try to start your instance. When theinstance is running, enter 1 and press Enter. If you cannot start your instance and want to continue the data collectionprocess, enter 2 and press Enter. Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3 andpress Enter. If the instance is running, you will be asked for information regarding when theproblem occurred.

Enter the date and time when you think the problem began and press Enter.This information must be entered in the following format: yyyy-mm-ddhh:mm:ss

85

B.

C.

D.

14.

-

-

-

-

15.

16.

17.A.

B.

C.

D.

Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter. Determine the period of time for which the data will be collected and pressEnter.The amount specified will be applied as a before value and an aftervalue to the date and time specified previously. For example, if you select 1Day as the time period, data will be collect for 24 hours before the specifieddate and time and for the 24 hours after the specified date and time. Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter.

Select the method to transfer the data collection archive file and press Enter.Choose one of the following options:

Send using secure transfer to IBM Support (HTTPS)—Sends the .zip file toIBM Support using a secure protocol.Send using FTP to IBM Support (unencrypted)—Sends the .zip file to IBMSupport using an unencrypted protocol.Send using FTP to another location (unencrypted)—Sends the .zip file to arecipient of your choice, using an unencrypted protocol.End the collection without sending—Ends the data collection and createsthe .zip file, but does not transfer it.

Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3 andpress Enter. If you chose to end the collection without sending the output, ISA DC will notifyyour when the .zip file has been successfully created. Enter 1 and press Enter toexit the application.If you chose to transfer the file using HTTPS, follow these steps:

If you want to receive a confirmation email when the upload was successful,enter an email address and press Enter. If you do not want to receiveconfirmation, press Enter to continue. Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter. Enter the PMR number that was given to you by IBM Technical Support andpress Enter. Ensure that the PMR number follows the required namingconvention of PMRNumber.BranchNumber.CountryCode. If an unknownPMR number is entered, you will be asked to correct the PMR number andre-send the data. Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter.

86

18.

A.

B.

19.A.B.C.D.E.

20.

If you chose to transfer the file to IBM Technical Support using unencryptedFTP, follow these steps:

Enter the PMR number that was given to you by IBM Technical Support andpress Enter. Ensure that the PMR number follows the required namingconvention of PMRNumber.BranchNumber.CountryCode. If an unknown PMRnumber is entered, you will be asked to correct the PMR number and re-sendthe data. Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter.

If you chose to transfer the file using unencrypted FTP, follow these steps:Enter the FTP host name and press Enter.Enter the user name and press Enter.Enter the password for the user name and press Enter.Enter the path for the directory on the FTP server and press Enter.Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter.

When you receive notice that the operation has completed successfully, enter 1and press Enter to exit the application.

87

1.

2.

3.4.5.6.7.

8.

9.

10.

-----

11.

12.

A.B.

IBM InfoSphere Change Data Capture, Version 10.2 To use ISA DC to collect data for a product problem(GUI)

Launch the IBM® Support Assistant.Run the index.html file, located in theisa\isadc folder in the root directory of the InfoSphere® CDC instance. Read the license agreement and click OK to accept it.After the licenseagreement has been accepted, it will not be shown again. Click Start.The Welcome screen opens. Click OK.Select A product problem from the drop down box.Click OK.Select the name of an InfoSphere CDC instance from the drop down list andclick OK.If you have multiple instances of InfoSphere CDC configured, you willbe asked to select which instance for which you want to collect. If you have a single InfoSphere CDC instance configured, it will be selectedautomatically and this step will be skipped. If your selected instance is not running, you will be alerted by ISA DC. As onlyminimal data is available if the instance is stopped, it is preferable that theinstance be running during data collection.Try to start your instance. When theinstance is running, select Yes, I have started the instance from the drop downbox and click OK. If you cannot start your instance and want to continue the data collectionprocess, select No, continue with minimal data collection from the drop downbox and click OK. If the instance is running, you will be asked for information regarding when theproblem occurred. Enter the date and time when you think the problem beganand click OK.This information must be entered in the following format: yyyy-mm-dd hh:mm:ss. Determine the period of time for which the data will be collected and click OK.Choose one of the following values:

6 hours12 hours1 Day2 Days7 Days

The amount specified will be applied as a before value and an after value to thedate and time specified previously. For example, if you select 1 Day as the timeperiod, data will be collect for 24 hours before the specified date and time andfor the 24 hours after the specified date and time. If you chose to end the collection without sending the output, select Do nottransfer data to IBM. ISA DC will notify you when the .zip file has beensuccessfully created.If you want to transfer the data to IBM using a secure transfer (HTTPS), selectthe Transfer to IBM option.

Choose the HTTPS transfer type option.Enter the PMR number that was given to you by IBM Technical Support.Ensure that the PMR number follows the required naming convention ofPMRNumber.BranchNumber.CountryCode. If an unknown PMR number is

88

C.D.

13.

A.B.

C.14.

A.

B.C.D.E.

15.

entered, you will be asked to correct the PMR number and re-send the data. Enter your email address.Click Transfer.

If you want to transfer the data to IBM using unencrypted FTP, select theTransfer to IBM option.

Choose the FTP transfer type option.Enter the PMR number that was given to you by IBM Technical Support.Ensure that the PMR number follows the required naming convention ofPMRNumber.BranchNumber.CountryCode. If an unknown PMR number isentered, you will be asked to correct the PMR number and re-send the data. Click Transfer.

If you choose to send the data to a location other than IBM using unencryptedFTP, click Transfer to another server via FTP

Enter the email address or IP address of the recipient in the Hotmail/IPAddress field.Enter the user name.Enter the password.Enter the path for the directory on the FTP server.Click Transfer.

When you receive notice that the operation has completed successfully, clickBrowse directory if you want to see the file you created or click Start NewCollection to collect more data.To exit the application, close your browser tab orwindow.

89

1.

2.

3.

4.5.

6.7.

8.

9.

-

-

-

-

10.

11.

12.A.

B.

C.

IBM InfoSphere Change Data Capture, Version 10.2 To use ISA DC to collect data for a question or anenhancement request (command line)

Launch the IBM® Support Assistant.Run the isadc.bat or isadc.sh file, located inthe isa\isadc folder in the root directory of the InfoSphere® CDC instance. Enter 1 to accept the license agreement and press Enter.After the licenseagreement has been accepted, it will not be shown again. Provide a file name and press Enter.The name provided will be given to the .zipfile containing the data collection results. If you do not want to assign a name to the data collection results, press Enterand a default name will be used. Enter 1 to confirm your chosen file name and press Enter to continue.Enter 1 to run the InfoSphere Change Data CaptureSupport Assistant DataCollector and press Enter.The Welcome page is displayed. Read the Welcome page information and enter 1 to proceed. Press Enter.Enter 2 to collect data for a question or an enhancement request and pressEnter.Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3 andpress Enter. Select the method to transfer the data collection archive file and press Enter.Choose one of the following options:

Send using secure transfer to IBM Support (HTTPS)—Sends the .zip file toIBM Support using a secure protocol.Send using FTP to IBM Support (unencrypted)—Sends the .zip file to IBMSupport using an unencrypted protocol.Send using FTP to another location (unencrypted)—Sends the .zip file to arecipient of your choice, using an unencrypted protocol.End the collection without sending—Ends the data collection and createsthe .zip file, but does not transfer it.

Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3 andpress Enter. If you chose to end the collection without sending the output, ISA DC will notifyyour when the .zip file has been successfully created. Enter 1 and press Enter toexit the application.If you chose to transfer the file using HTTPS, follow these steps:

If you want to receive a confirmation email when the upload was successful,enter an email address and press Enter. If you do not want to receiveconfirmation, press Enter to continue. Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter. Enter the PMR number that was given to you by IBM Technical Support andpress Enter. Ensure that the PMR number follows the required naming

90

D.

13.

A.

B.

14.A.B.C.D.E.

15.

convention of PMRNumber.BranchNumber.CountryCode. If an unknownPMR number is entered, you will be asked to correct the PMR number andre-send the data. Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter.

If you chose to transfer the file to IBM Technical Support using unencryptedFTP, follow these steps:

Enter the PMR number that was given to you by IBM Technical Support andpress Enter. Ensure that the PMR number follows the required namingconvention of PMRNumber.BranchNumber.CountryCode. If an unknown PMRnumber is entered, you will be asked to correct the PMR number and re-sendthe data. Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter.

If you chose to transfer the file using unencrypted FTP, follow these steps:Enter the FTP host name and press Enter.Enter the user name and press Enter.Enter the password for the user name and press Enter.Enter the path for the directory on the FTP server and press Enter.Enter 1 to process your input and continue collecting data. Press Enter.If youwant to cancel the collection, enter 2 and press Enter. If you want to go back and change your input for the previous step, enter 3and press Enter.

When you receive notice that the operation has completed successfully, enter 1and press Enter to exit the application.

91

1.

2.

3.4.5.6.7.

8.

A.B.

C.D.

9.

A.B.

C.10.

A.

B.C.D.E.

11.

IBM InfoSphere Change Data Capture, Version 10.2 To use ISA DC to collect data for a question or anenhancement request (GUI)

Launch the IBM® Support Assistant.Run the index.html file, located in theisa\isadc folder in the root directory of the InfoSphere® CDC instance. Read the license agreement and click OK to accept it.After the licenseagreement has been accepted, it will not be shown again. Click Start.The Welcome screen opens. Click OK.Select A question or enhancement request from the drop down box.Click OK.If you chose to end the collection without sending the output, select Do nottransfer data to IBM. ISA DC will notify you when the .zip file has beensuccessfully created.If you want to transfer the data to IBM using a secure transfer (HTTPS), selectthe Transfer to IBM option.

Choose the HTTPS transfer type option.Enter the PMR number that was given to you by IBM Technical Support.Ensure that the PMR number follows the required naming convention ofPMRNumber.BranchNumber.CountryCode. If an unknown PMR number isentered, you will be asked to correct the PMR number and re-send the data. Enter your email address.Click Transfer.

If you want to transfer the data to IBM using unencrypted FTP, select theTransfer to IBM option.

Choose the FTP transfer type option.Enter the PMR number that was given to you by IBM Technical Support.Ensure that the PMR number follows the required naming convention ofPMRNumber.BranchNumber.CountryCode. If an unknown PMR number isentered, you will be asked to correct the PMR number and re-send the data. Click Transfer.

If you choose to send the data to a location other than IBM using unencryptedFTP, click Transfer to another server via FTP

Enter the email address or IP address of the recipient in the Hotmail/IPAddress field.Enter the user name.Enter the password.Enter the path for the directory on the FTP server.Click Transfer.

When you receive notice that the operation has completed successfully, clickBrowse directory if you want to see the file you created or click Start NewCollection to collect more data.To exit the application, close your browser tab orwindow.

92

-

-

-

-

IBM InfoSphere Change Data Capture, Version 10.2 Locating log files In addition to the Management Console event log, InfoSphere® CDC producesother logs to help troubleshoot installation and replication errors. Review the log files in the <CDC_installation directory>\Uninstall\Logs directoryif you encounter any errors during the installation of InfoSphere CDC. If you encounter replication errors or replication stops, review any of these tracelogs:

<CDC_installation_directory>/log—This directory contains information for anInfoSphere CDC problem. Refer to this directory if the problem is related toconfiguring an InfoSphere CDC instance. However, it is always useful to refer thisdirectory as well as the <CDC_installation_directory>/instance/<instance_name>/log directory to troubleshoot any problem.<CDC_installation_directory>/instance/<instance_name>/log—This directorystores trace files for a specific InfoSphere CDC instance. It is also useful to refer tothe <CDC_installation_directory>/instance/<instance_name>/log directory totroubleshoot your problem. When tracing has been enabled, the trace files will beenabled under <CDC_installation_directory>/instance/<instance_name>/log/on.<CDC_installation_directory>/instance/<instance_name>/tmp—This directorytemporarily stores data such as incomplete large transactions and large LOB datavalues.<CDC_installation_directory>/instance/<instance_name>/stagingstore—Thisdirectory stores sincle scrape staging store data that does not fit in memory. Whenan InfoSphere CDC instance is stopped normally, the contents of this staging storeare written to files that are stored in this directory.

93

-

-

IBM InfoSphere Change Data Capture, Version 10.2 Troubleshooting and contacting IBM Support The following support page contains the latest troubleshooting information anddetails on how to open a service request with IBM® Support:

http://www.ibm.com/software/data/infosphere/support/change-data-capture/ For contact information in your region:

http://www.ibm.com/planetwide/ Related reference: dmsupportinfo - Collect IBM Support Information

94