APT_CONFIG_FILE is an environment variable that points to a configuration file (.apt file) which defines:
This file controls parallelism in Data Stage PX (Parallel Extender).A datastage job cannot run without .apt configuration file.
Example APT Config File (2-node)
This means the job will run on 2 nodes, in parallel.
Why is APT_CONFIG_FILE Important?
✔ Controls job parallelism
More nodes = higher performance.
✔ Controls resource usage
Disk paths, node pools, etc.
✔ Controls load distribution
E.g., partitioning, sorting, aggregations all depend on node definitions.
Where is APT_CONFIG_FILE set?
Usually set in:
-
Project → Properties → Environment Variables → APT_CONFIG_FILE
-
Or in DS job → Job Properties
-
Or inside UNIX shell before execution:
Q: What happens if APT_CONFIG_FILE has 4 nodes but your job has only 1 input row?
Job will still run in 4-way parallel, but only 1 node will get the row; others will be idle.
Q: How do you check which config file your job is using?
In Director → Right-click job → View Log → It shows the APT config file name in the startup message.
No comments:
Post a Comment