FP8 Quantization

Cosmos-RL supports FP8 quantization for policy and rollout. Currently, only rowwise quantization is supported.

Enable FP8 Quantization

To enable FP8 quantization, you need to set the following configuration:

[train.fp8]
enable_fp8 = true
fp8_recipe = "dynamic_scaling"
quant_recipe = "rowwise"

[rollout]
quantization = "fp8"

For policy:

fp8_recipe could only be set to dynamic_scaling now.
quant_recipe could be set to rowwise or tensorwise. rowwise is recommended for better accuracy.

For rollout:

quantization should be set to fp8. Then the rollout will dynamically quantize weights from policy to fp8 during weight synchronization in rowwise manner.