SkyRL
API ReferenceSkyRL

Environment Variables

Configuration via environment variables.

module env_vars

Environment variable configuration for SkyRL.

All environment variables used by SkyRL should be defined here for discoverability.

Attributes:

NameTypeDescription
SKYRL_RAY_PG_TIMEOUT_IN_STimeout for allocating the placement group for different actors in SkyRL.
SKYRL_WORKER_NCCL_TIMEOUT_IN_STimeout for initializing the NCCL process group for the worker, defaults to 10 minutes.
SKYRL_VLLM_DP_PORT_OFFSETOffset for the data parallel port of the vLLM server.
SKYRL_WAIT_UNTIL_INFERENCE_SERVER_HEALTHY_TIMEOUT_STimeout for waiting until the inference server is healthy.
SKYRL_LD_LIBRARY_PATH_EXPORTWhether to export LD_LIBRARY_PATH environment variable from the driver to the workers with Ray's runtime env.
SKYRL_PYTHONPATH_EXPORTWhether to export PYTHONPATH environment variable from the driver to the workers with Ray's runtime env.
SKYRL_DUMP_INFRA_LOG_TO_STDOUTWhen enabled, infrastructure logs (vLLM, Ray, workers) are shown on stdout

attr SKYRL_RAY_PG_TIMEOUT_IN_S

SKYRL_RAY_PG_TIMEOUT_IN_S = int(os.environ.get('SKYRL_RAY_PG_TIMEOUT_IN_S', 180))

Timeout for allocating the placement group for different actors in SkyRL.

attr SKYRL_WORKER_NCCL_TIMEOUT_IN_S

SKYRL_WORKER_NCCL_TIMEOUT_IN_S = int(os.environ.get('SKYRL_WORKER_NCCL_TIMEOUT_IN_S', 600))

Timeout for initializing the NCCL process group for the worker, defaults to 10 minutes.

attr SKYRL_VLLM_DP_PORT_OFFSET

SKYRL_VLLM_DP_PORT_OFFSET = int(os.environ.get('SKYRL_VLLM_DP_PORT_OFFSET', 500))

Offset for the data parallel port of the vLLM server.

attr SKYRL_WAIT_UNTIL_INFERENCE_SERVER_HEALTHY_TIMEOUT_S

SKYRL_WAIT_UNTIL_INFERENCE_SERVER_HEALTHY_TIMEOUT_S = int(os.environ.get('SKYRL_WAIT_UNTIL_INFERENCE_SERVER_HEALTHY_TIMEOUT_S', 600))

Timeout for waiting until the inference server is healthy.

attr SKYRL_LD_LIBRARY_PATH_EXPORT

SKYRL_LD_LIBRARY_PATH_EXPORT = str(os.environ.get('SKYRL_LD_LIBRARY_PATH_EXPORT', 'False')).lower() in ('true', '1', 'yes')

Whether to export LD_LIBRARY_PATH environment variable from the driver to the workers with Ray's runtime env.

For example, if you are using RDMA, you may need to customize the LD_LIBRARY_PATH to include the RDMA libraries (Ex: EFA on AWS).

attr SKYRL_PYTHONPATH_EXPORT

SKYRL_PYTHONPATH_EXPORT = str(os.environ.get('SKYRL_PYTHONPATH_EXPORT', 'False')).lower() in ('true', '1', 'yes')

Whether to export PYTHONPATH environment variable from the driver to the workers with Ray's runtime env.

See https://github.com/ray-project/ray/issues/56697 for details on why this is needed.

attr SKYRL_DUMP_INFRA_LOG_TO_STDOUT

SKYRL_DUMP_INFRA_LOG_TO_STDOUT = str(os.environ.get('SKYRL_DUMP_INFRA_LOG_TO_STDOUT', 'False')).lower() in ('true', '1', 'yes')

When enabled, infrastructure logs (vLLM, Ray, workers) are shown on stdout instead of being redirected to the log file. Useful for debugging startup issues.

Default: False (infrastructure logs go to file only, stdout shows training progress). Set SKYRL_DUMP_INFRA_LOG_TO_STDOUT=1 to show all logs on stdout.

On this page