Skip to content

Vision AI Gallery

Authors: Aiden ChangAkul Santhosh

Organization: NVIDIA

We provide a dedicated Brev instance to help you follow along with these examples. The default configuration uses 8× H100 GPUs, but you can switch to 1× H100 to reduce costs (with slower inference performance).

Brev Instance

Overview

This page showcases results generated with Cosmos Transfer 2.5 for Vision AI applications. The examples demonstrate sim-to-real transfer across a variety of urban and roadway scenarios, illustrating how source videos can be transformed to reflect different times of day, lighting conditions, weather, environmental effects, and scene elements.

To understand what each control modality does, please refer to our control modality concepts page. This page will be focused on showing some different results that we can make.

Use Case: Vision based applications can leverage these techniques to train, test, and validate perception systems under diverse and challenging conditions without additional data collection.

Input Video

We showcase the different input control modalities used for this highway scene.

We now show example results generated using these control modalities.

Examples

Parameters - Fog
guidance: 3, edge: 0.5, depth: 1.0
Input Prompt
A video of a highway with a dense, heavy fog hangs low over the highway, dramatically reducing visibility and softening the outlines of the surrounding hills and leafless trees. A white sedan travels away from the camera in the right lane, its taillights glowing dimly through the fog. The scene conveys slow-moving traffic under conditions with near-whiteout visibility.
Parameters - Morning Sunlight
guidance: 3, edge: 1.0, depth: 0.9
Input Prompt
A video of a winding four-lane divided highway cutting through a rural landscape of rolling hills under clear morning sunlight. The low sun casts long, soft shadows across the gently curving roadway and illuminates dry brown grass and leafless trees along the roadside with a warm, golden glow. A white sedan travels away from the camera in the right lane. The sky is pale blue with thin, high clouds, and the scene captures the calm flow of light traffic in crisp, early-day conditions.
Parameters - Night
guidance: 3, edge: 0.5, depth: 1.0
Input Prompt
A video of a winding four-lane divided highway cutting through a rural landscape of rolling hills at night. The scene is illuminated primarily by vehicle headlights and sparse roadside lighting, with reflective lane markings and road signs glowing against the dark asphalt. The surrounding hills and leafless trees fade into deep shadows beyond the roadway. A white sedan travels away from the camera in the right lane, its red taillights tracing the gentle S-curve. The sky is black and clouded, and the scene conveys light traffic moving steadily through a quiet, nighttime rural environment.
Parameters - Rain
guidance: 3, edge: 0.9, depth: 1.0
Input Prompt
A video of a winding four-lane divided highway cutting through a rural landscape of rolling hills, now soaked by a severe rainstorm. The roadway is partially flooded, with standing water pooling across multiple lanes and flowing toward the shoulders, where drainage ditches have overflowed. Dark, rain-slick asphalt reflects headlights and the gray sky above. A white sedan travels away from the camera in the right lane, sending up wide sprays of water. Sheets of rain reduce visibility, and low clouds hang heavy over the scene, conveying hazardous driving conditions during a flood event.
Parameters - Snow
guidance: 3, edge: 0.9, depth: 1.0
Input Prompt
A video of a winding four-lane divided highway cutting through a rural landscape of rolling hills blanketed in snow, with patches of icy pavement and snowbanks lining the shoulders. Leafless trees are dusted with fresh snow. A white sedan travels away from the camera in the right lane. The scene captures the flow of light traffic under a cold, gray, overcast winter sky.
Parameters - Wooden Road
guidance: 7, edge: 0.6, seg: 0.4
Input Prompt
A video of a winding four-lane divided highway cutting through a rural landscape of rolling hills, dry brown grass, and leafless trees. The roadway is constructed from long, weathered wooden planks laid lengthwise, with visible seams, grain patterns, and slight warping between boards. The wooden surface follows the gentle curves of the highway and shows subtle wear from traffic. A white sedan travels away from the camera in the right lane. The scene captures the flow of light traffic on a gray, overcast day.
Parameters - Debris
guidance: 7, edge: 0.5, seg: 0.8, depth: 0.4
Input Prompt
A video of a winding four-lane divided highway cutting through a rural landscape of rolling hills, dry brown grass, and leafless trees under a gray, overcast sky. A large brown bear stands in the middle of the roadway near the center divide, facing slightly toward the oncoming lanes.
Parameters - Small Car
guidance: 3, edge: 0.5, seg: 0.4, seg_mask: True, depth: 1.0
Input Prompt
A video of a winding four-lane divided highway cutting through a rural landscape of rolling hills, dry brown grass, and leafless trees. A blue Smart Fortwo microcar travels away from the camera in the right lane, appearing notably small against the wide roadway. The scene captures the flow of light traffic on a gray, overcast day.
Parameters - Van
guidance: 7, edge: 0.5, seg: 0.8, seg_mask: True, depth: 0.5
Input Prompt
A video of a winding four-lane divided highway cutting through a rural landscape of rolling hills, dry brown grass, and leafless trees. A black Ford Transit cargo van travels away from the camera in the right lane, its tall, boxy profile clearly visible against the wide roadway. The scene captures the flow of light traffic on a gray, overcast day.
Parameters - People Generation
guidance: 3, edge: 0.5, seg: 0.7, depth: 0.5
Input Prompt
A video of a winding four-lane divided highway cutting through a rural landscape of rolling hills, dry brown grass, and leafless trees. A white sedan travels away from the camera in the right lane. Both sides of the road are lined with wide sidewalks densely populated with pedestrians—dozens of clearly visible people walking in clusters and alone. Individuals wear jackets, hats, and backpacks, some talking to each other, others looking at their phones or walking dogs. The constant movement of people along the sidewalks is a dominant visual element, contrasting with the light vehicle traffic on the road. The scene unfolds under a gray, overcast sky, emphasizing a cool, busy daytime atmosphere.

Edge & Depth Control only for Environmental Variations

This example demonstrates how to transform videos into scenes with different environmental conditions and surface materials using edge and depth control. Edge control preserves the original scene structure and motion, while depth control maintains the spatial relationships between objects. All the prompts are the same as the above examples.

Fog Changes

This scene shows different fog augmentations generated by varying the control modalities.

Lighting Changes

This scene shows different lighting conditions generated by varying the control modalities.

Night Augmentations

This scene shows different night conditions generated by varying the control modalities.

Rain Augmentations

This scene shows different rainy conditions generated by varying the control modalities.

Snow Augmentations

This scene shows different snowy conditions generated by varying the control modalities.

Other Video Examples

Here are some results from similar other videos.

Video Example 1

Video Example 2

Video Example 3