Environment Variables
Configuration via environment variables.
module env_vars
Environment variable configuration for SkyRL.
All environment variables used by SkyRL should be defined here for discoverability.
Attributes:
| Name | Type | Description |
|---|---|---|
SKYRL_RAY_PG_TIMEOUT_IN_S | Timeout for allocating the placement group for different actors in SkyRL. | |
SKYRL_WORKER_NCCL_TIMEOUT_IN_S | Timeout for initializing the NCCL process group for the worker, defaults to 10 minutes. | |
SKYRL_VLLM_DP_PORT_OFFSET | Offset for the data parallel port of the vLLM server. | |
SKYRL_WAIT_UNTIL_INFERENCE_SERVER_HEALTHY_TIMEOUT_S | Timeout for waiting until the inference server is healthy. | |
SKYRL_LD_LIBRARY_PATH_EXPORT | Whether to export LD_LIBRARY_PATH environment variable from the driver to the workers with Ray's runtime env. | |
SKYRL_PYTHONPATH_EXPORT | Whether to export PYTHONPATH environment variable from the driver to the workers with Ray's runtime env. | |
SKYRL_DUMP_INFRA_LOG_TO_STDOUT | When enabled, infrastructure logs (vLLM, Ray, workers) are shown on stdout |
attr SKYRL_RAY_PG_TIMEOUT_IN_S
SKYRL_RAY_PG_TIMEOUT_IN_S = int(os.environ.get('SKYRL_RAY_PG_TIMEOUT_IN_S', 180))Timeout for allocating the placement group for different actors in SkyRL.
attr SKYRL_WORKER_NCCL_TIMEOUT_IN_S
SKYRL_WORKER_NCCL_TIMEOUT_IN_S = int(os.environ.get('SKYRL_WORKER_NCCL_TIMEOUT_IN_S', 600))Timeout for initializing the NCCL process group for the worker, defaults to 10 minutes.
attr SKYRL_VLLM_DP_PORT_OFFSET
SKYRL_VLLM_DP_PORT_OFFSET = int(os.environ.get('SKYRL_VLLM_DP_PORT_OFFSET', 500))Offset for the data parallel port of the vLLM server.
attr SKYRL_WAIT_UNTIL_INFERENCE_SERVER_HEALTHY_TIMEOUT_S
SKYRL_WAIT_UNTIL_INFERENCE_SERVER_HEALTHY_TIMEOUT_S = int(os.environ.get('SKYRL_WAIT_UNTIL_INFERENCE_SERVER_HEALTHY_TIMEOUT_S', 600))Timeout for waiting until the inference server is healthy.
attr SKYRL_LD_LIBRARY_PATH_EXPORT
SKYRL_LD_LIBRARY_PATH_EXPORT = str(os.environ.get('SKYRL_LD_LIBRARY_PATH_EXPORT', 'False')).lower() in ('true', '1', 'yes')Whether to export LD_LIBRARY_PATH environment variable from the driver to the workers with Ray's runtime env.
For example, if you are using RDMA, you may need to customize the LD_LIBRARY_PATH to include the RDMA libraries (Ex: EFA on AWS).
attr SKYRL_PYTHONPATH_EXPORT
SKYRL_PYTHONPATH_EXPORT = str(os.environ.get('SKYRL_PYTHONPATH_EXPORT', 'False')).lower() in ('true', '1', 'yes')Whether to export PYTHONPATH environment variable from the driver to the workers with Ray's runtime env.
See https://github.com/ray-project/ray/issues/56697 for details on why this is needed.
attr SKYRL_DUMP_INFRA_LOG_TO_STDOUT
SKYRL_DUMP_INFRA_LOG_TO_STDOUT = str(os.environ.get('SKYRL_DUMP_INFRA_LOG_TO_STDOUT', 'False')).lower() in ('true', '1', 'yes')When enabled, infrastructure logs (vLLM, Ray, workers) are shown on stdout instead of being redirected to the log file. Useful for debugging startup issues.
Default: False (infrastructure logs go to file only, stdout shows training progress).
Set SKYRL_DUMP_INFRA_LOG_TO_STDOUT=1 to show all logs on stdout.