DataStage Certification 000-421 Sample Questions

1) Which two features of Data Sets make them suitable for job restart points? (Choose two.)

A. They are persistent.

B. They are indexed to improve access.

C. They are compressed to minimize storage space.

D. They use the same data types as the parallel framework.

Answer: A,D

2) Which three actions can improve sort performance in a DataStage job? (Choose three.)

A. Specify only the key columns which are necessary.

B. Minimize the number of sorts used within a job flow.

C. Adjust the “Restrict Memory Usage” option in the Sort stage.

D. Run the job sequentially so that only one sort process is invoked.

E. Use the stable-sort option to avoid the random ordering of non-key data.

Answer: A,B,C

3) The number of File Set data files created depends upon what three items? (Choose three.)

A. Schema definition of the file.

B. File system limitations.

C. Number of physical processing nodes.

D. Number of processing nodes in the default node pool.

E. Number of disks in the export or default disk pool connected to each processing node in the default node pool

Answer: B,D,E

4) You are about to begin major changes to jobs in a project. You want to conveniently identify job changes on an ad hoc basis. What two tasks will allow you to identify changes to your jobs?

(Choose two.)

A. Import the original job from a .dsx export.

B. Select the job,then right click Compare within.

C. Select the job,then right click Cross Project Compare.

D. Before making a change to a job make a copy of the job in a different category folder.

Answer: B,D

5) What would help debug the run-time environment?

A. $APT_VERBOSE

B. $OSH_PRINT_SCHEMAS

C. Check the message filters for the job.

D. View detailed runtime stats through Job monitor in Director Client.

Answer: B

6) Which two property areas must be configured when using the ODBC connector stage as a target in your job design? (Choose two.)

A. Define columns for the output link.

B. Specify the remote server property.

C. Specify properties for the input link.

D. Define the connection properties to an ODBC data source.

Answer: C,D

7) In which two situations is it appropriate to use a Sparse Lookup? (Choose two.)

A. When accessing DB2 data using the DB2 API stage.

B. When the output of the Lookup stage needs to be hashed partitioned.

C. When reference data is significantly larger than the streaming data (100:1).

D. When invoking a stored procedure within a database per row in the streaming link.

Answer: C,D

8) Which two partitioning methods require keys? (Choose two.)

A. Hash

B. Entire

C. Modulus

D. Round Robin

Answer: A,C

9) How must the input data set be organized for input into the join stage? (Choose two.)

A. Unsorted

B. Key partitioned

C. Hash partitioned

D. Entire partitioned

E. Sorted in ascending order

Answer: B,E

10) The Change Apply stage produces a change Data Set with a new column representing the code for the type of change. What are three change values identified by these code values? (Choose three.)

A. Copy

B. Edit

C. Final

D. Deleted

E. Remove Duplicates

Answer: A,B,D

11) You are setting up project defaults. Which three items can be set in DataStage Administrator? (Choose three.)

A. suite roles

B. default for compile options

C. defaults for environment variables

D. default for Runtime Column Propagation

E. default prompting options, such asAutosave job before compile

Answer: B,C,D

12) What are two advantages of using Runtime Column Propagation (RCP)? (Choose two.)

A. RCP is not a feature of DataStage.

B. Only columns used in the data flow need to be defined.

C. Only columns that are defined asVarChar need RCP enabled.

D. Sequential files don’t require schema files when using RCP.

E. Columns not specifically used in the flow are propagated as if they were.

Answer: B,E

13) You are using the Change Capture stage in your job design to identify changes made to the input link (before image) to obtain the results in the output link. This job will be using a multi-node configuration file when executed. By default, the Change Capture stage produces an output change link with which two characteristics? (Choose two.)

A. added column namedChange_Code

B. added column namedDifference_Value

C. Table Definition based on the after input link

D. Table Definition based on the keys of the input link

Answer: A,C

“Pass Any Exam. Any Time.” – 100% Guaranteed 28

14) A DataStage job uses an Inner Join to combine data from two source parallel datasets that were written to disk in sort order based on the join key columns. Which two methods could be used to dramatically improve performance of this job? (Choose two.)

A. Disable job monitoring.

B. Set the environment variable $APT_SORT_INSERTION_CHECK_ONLY.

C. Unset the Preserve Partitioning flag on the output of each parallel dataset.

D. Explicitly specify hash partitioning and sorting on each input to the Join stage.

E. Add a parallel sort stage before each Join input, specifying the “Don’t Sort, Previously Grouped”

sort key mode for each key.

Answer: B,E

15) You have been asked to assess the impact of replacing the DB2 Enterprise Stage with the DB2 Connector. You must identify all of the jobs and shared containers that use the DB2 Enterprise Stage. What two Advanced Find criteria would you set to identify the jobs and containers? (Choose two.)

A. Type

B. Where used

C. Dependent upon

D. Include Read only objects

Answer: A,B

Comments

comments

Leave a Reply