Datastage Interview Questions and Answers V1.5

Datastage Interview Related Questions and Answers V1.5
This post is very useful for datastage beginners..DK®

# In which two situations would you use the Web Services Client stage?DK®
You need the Web service to act as either a data source or a data target during an operation.
You do not need both input and output links in a single web service operation.

#. You need to invoke a job from the command line that is a multi-instance enabled. What is the correct syntax to start a multi-instance job?DK®
dsjob -run -mode NORMAL .

# A client must support multiple languages in selected text columns when reading from  DB2 database. Which two actions will allow selected columns to support such data?
Choose Unicode setting in the extended column attribute.
Choose NVar/NVarchar as data types.

# Which two system variables/techniques must be used in a parallel Transformer derivation to generate a unique sequence of integers across partitions?
@PARTITIONNUM
@NUMPARTITIONS

# You are experiencing performance issues for a given job. You are assigned the task of understanding what is happening at run time for the given job. What are the first two steps you should take to understand the job performance issues .
Run job with $APT_TRACE_RUN set to true.
Review the objectives of the job.

# Your customer asks you to identify which stages in a job are consuming the largest amount of CPU time. Which product feature would help identify these stages?
$APT_PM_PLAYER_TIMING

# Unix Command to stop Datastage Engine ?DK®
bin/uv –admin –stop

# Unix Command to start Datastage Engine ?DK®
bin/uv –admin –start

# Unix command to check datastage jobs running at server ?
DK®
ps -ef | grep phantom
# Unix Command to check Datastage sessions running at backend ?
netstat –na | grep dsr
netstat –a | grep dsr
netstat –a | grep dsrpc
# How to unlock a Datastage job ?
Cleanup Resourses in Director
Clear Status File in Director
DS.Tools in Administrator
DS.Tools in Unix

# Command to check the Datastage Job Status ?
dsjob -status

# Part of configuration File ?
Node
ServerName
Pools
FastName
ResourceDisk

# Where datastage temprory dataset files stored while running a Datastage parallel Job ?
ResourceScratchDisk

# Which three statements describe a DataStage installation in a clustered environment? DK®
The conductor node will create the job score and consolidate messages to the DataStage log.
For clustered implementations, appropriate node pools are useful to reduce data shipping.
Compiled job and all shared components must be available across all servers.

# Which three defaults are set in DataStage Administrator?
Project level defaults for environment variables.
project level default for compile options
project level default for Runtime Column Propagation

# Which two environment variables should be set to "True" to allow you to see operator process statistics at run-time in the job log?
$APT_PM_PLAYER_MEMORY
$APT_PM_PLAYER_TIMING

#8 Which three statements are true about National Language Support (NLS?
NLS must be selected during installation to use it.
Within an NLS enabled DataStage environment, maps are used to convert external data into UTF-#6.
Reading or writing 7-bit ASCII data from a database does not require NLS support.

# Which three statements describe a DataStage installation in a clustered environment?
The conductor node will create the job score and consolidate messages to the DataStage log.
For clustered implementations, appropriate node pools are useful to reduce data shipping.
Compiled job and all shared components must be available across all servers.

# Upon which two conditions does the number of data files created by a File Set depend?
The number of processing nodes in the default node pool.
The number of disks in the export or default disk pool connected to each processing node in the default node pool

# Which command line switch can be used to return the most recent start time for a given job?
dsjob -jobinfo

# You are working on a project that contains a large number of jobs contained in many folders. You would like to review the jobs created by the former developer of the project. How can you find these jobs?
Use the Advanced Find feature contained in the Designer interface.

# Which two statements are true for named node pools?DK®
Using appropriately named node pools can allow separation of buffering from sorting disks.
Using appropriately named node pools constraints will limit stages to be executed only on the nodes defined in the node pools.

# Which three methods can be used to import metadata from a Web Services Description Language (WSDL document?
Web Service Function Definitions
XML Table Definitions
Web Services WSDL Definitions

# What are two tasks that can create DataStage projects?
Install the DataStage engine.
Add new projects from DataStage Administrator.

# Which two conditions does the No of data files created by a Dataset Set depend?
The number of processing nodes in the default node pool.
The number of disks in the export or default disk pool connected to each processing node in the default node pool.

# Which requirement must be met to read from a database in parallel using the ODBC Enterprise stage ?
Specify the partition col property.

# For APT_DISABLE_COMBINATION which statements are true ?
Disabling generates more processes requiring more system resources and memory.
Globally disables operator combining.

# Techniques you will use to abort a job in Transformer stage ?
Create a dummy output link with a constraint that tests for the condition to abort on set the "Abort After Rows" property to #.

# The dsrpcd daemon is the means by which processes that represent DataStage jobs are started ? The environment that DataStage processes inherit when they are started is the same environment as for dsrpcd.ODBC drivers and some plug-ins require that certain directories are included in the shared library environment variable setting for dsrpcd.

# You set environment variable $APT_ORACLE_LOAD_OPTIONS=PTIONS (DIRECT=TRUE,PARALLEL=TRUE?  for loading index organized tables.You set environment variable $APT_ORACLE_LOAD_OPTIONS=?PTIONS(DIRECT=TRUE, PARALLEL=TRUE?for loading index organized tables. Which statement is accurate regarding the resulting effect of this environment variable setting?
Oracle load will fail when executed.

# A job design consists of an input sequential file, a Modify stage, followed by a Filter stage and an output SequentialFile stage. The job is run on an SMP machine with a configuration file defined with three nodes. No environment variables were set for the job. How many osh processes will this job create?DK®
9

# Using a DB2 for z/OS source database, a 200 million row source table with 30 million distinct values must be aggregated to calculate the average value of two column attributes. What would provide optimal performance while satisfying the business requirements?
Select all source rows using a DB2 API stage.Aggregate using a Sort Aggregator.

# In your DB2 database you have column names that use characters # and $. Which two steps should be completed to allow DataStage to convert these characters into internal format and back as necessary?
Set environment variable $DS_ENABLE_RESERVED_CHAR_CONVERT to true.
Avoid using the strings __035__ and __036__ in your IBM DB2 column names.
# When invoking a job from a third-party scheduler, it is often desirable to invoke a job and wait for its completion in order to return the job's completion status. Which three commands would invoke a job named "BuildWarehouse" in project DevProject and wait for the job's completion?
dsjob -run -jobstatus DevProject BuildWarehouse
dsjob -run -userstatus DevProject BuildWarehouse
dsjob -run -wait DevProject BuildWarehouse

# What is the default Array Size in OCI stage ?
32767

# What is the default cache size of Datastage ?
256MB

#Which is a part of Managing active sessions in Datastage ?
Viewing all active sessions
Setting session limits
Opening user details
Disconnecting a session

# What can you do from the Administrator client?
Set up user permissions for projects
Purge job log file
Set Environment variable default value
Add, delete, and move InfoSphere® DataStage® projects

# In which two situations would not use the Web Services Client stage?
You want to deploy a service.
You need to create a WSDL.

# Which two actions can improve sort performance in a DataStage job?
Specify only the key columns which are necessary.
Minimize the number of sorts used within a job flow.
Adjusting the "Restrict Memory Usage" option in the Sort stage.

# You have created a parallel job in which there are several stages that you want to be able to re-use in other jobs. You decided to create a parallel shared container from these stages. Identify two things that are False about this shared container.
It can be used in sequencer jobs.
It can be used in Transformer stage derivations.

No comments:

Post a Comment