Arpit Jadon*, Haoran Wang, Phillip Thomas, Michael Stanley, S. Nathaniel Cibik, Rachel Laurat, Omar Maher, Lukas Hoyer, Ozan Unal*, Dengxin Dai
As perception models continue to develop, the need for large-scale datasets increases. However, data annotation remains far too expensive to effectively scale and meet the demand. Synthetic datasets provide a solution to boost model performance with substantially reduced costs. However, current synthetic datasets remain limited in their scope, realism, and are designed for specific tasks and applications. In this work, we present RealDriveSim, a realistic multi-modal synthetic dataset for autonomous driving that not only supports popular 2D computer vision applications but also their LiDAR counterparts, providing fine-grained annotations for up to 64 classes. We extensively evaluate our dataset for a wide range of applications and domains, demonstrating state-of-the-art results compared to existing synthetic benchmarks. The dataset is publicly available at https://realdrivesim.github.io/
Dataset | Camera | LiDAR | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Adverse-W | 2D Seg. | Sem-Cls. | 2D Det. | 3D Det. | Depth | Optical-Flow | MOT | 3D Det. | 3D Seg. | Sem-Cls. | Scene-Flow | MOT | SLAM | |
SYNTHIA | ✓ | ✓ | 22 | ✓ | ✓ | ✓ | ✗ | ✗ | - | - | - | - | - | - |
GTA-V | ✓ | ✓ | 19 | ✗ | ✗ | ✗ | ✗ | ✗ | - | - | - | - | - | - |
VIPER | ✓ | ✓ | 32 | ✓ | ✓ | ✗ | ✓ | ✓ | - | - | - | - | - | - |
Synscapes | ✗ | ✓ | 19 | ✓ | ✓ | ✓ | ✗ | ✗ | - | - | - | - | - | - |
SHIFT | ✓ | ✓ | 23 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✗ | - | ✗ | ✓ | ✗ |
PreSIL | ✗ | ✓ | 12 | ✓ | ✓ | ✓ | ✗ | ✗ | ✓ | ✗ | 12 | ✗ | ✗ | ✗ |
SynLIDAR | - | - | - | - | - | - | - | - | ✗ | ✓ | 32 | ✗ | ✗ | ✗ |
RealDriveSim | ✓ | ✓ | 61 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | 64 | ✓ | ✓ | ✓ |
Modality | Download (File Size) |
---|---|
2D Modalities | |
Images (RGB) | 367.42 GB |
2D Object Detection | 1.78 GB |
2D Semantic Segmentation | 7.75 GB |
2D Instance Segmentation | 2.54 GB |
Depth Maps | 605.66 GB |
2D Motion Vectors / Optical Flow | 91.54 GB |
3D Modalities | |
3D Point Clouds | 430.33 GB |
3D Object Detection | 62 GB |
3D Semantic Segmentation | 3.53 GB |
3D Instance Segmentation | 1.29 GB |
3D Motion Vectors / Scene Flow | 96.95 GB |
Other | |
Calibration Data | 6 MB |
Modality | Download (File Size) |
---|---|
2D Modalities | |
Images (RGB) | 73.52 GB |
2D Object Detection | 367.51 MB |
2D Semantic Segmentation | 1.55 GB |
2D Instance Segmentation | 522.26 MB |
Depth Maps | 121.13 GB |
2D Motion Vectors / Optical Flow | 19.18 GB |
3D Modalities | |
3D Point Clouds | 86.09 GB |
3D Object Detection | 12.40 GB |
3D Semantic Segmentation | 725.52 MB |
3D Instance Segmentation | 268.09 MB |
3D Motion Vectors / Scene Flow | 19.60 GB |
Other | |
Calibration Data | 6 MB |
Modality | Download (File Size) |
---|---|
2D Modalities | |
Images (RGB) | 11.54 GB |
2D Object Detection | 74.15 MB |
2D Semantic Segmentation | 323.89 MB |
2D Instance Segmentation | 107.68 MB |
Depth Maps | 24.38 GB |
2D Motion Vectors / Optical Flow | 3.75 GB |
3D Modalities | |
3D Point Clouds | 6.16 GB |
3D Object Detection | 2.35 GB |
3D Semantic Segmentation | 75.96 MB |
3D Instance Segmentation | 28.14 MB |
3D Motion Vectors / Scene Flow | 1.46 GB |
Other | |
Calibration Data | 248 KB |
Modality | Download (File Size) |
---|---|
2D Modalities | |
Images (RGB) | 4.07 GB |
2D Object Detection | 25.62 MB |
2D Semantic Segmentation | 119.61 MB |
2D Instance Segmentation | 37.79 MB |
Depth Maps | 8.57 GB |
2D Motion Vectors / Optical Flow | 1.42 GB |
3D Modalities | |
3D Point Clouds | 2.24 GB |
3D Object Detection | 820.03 MB |
3D Semantic Segmentation | 27.68 MB |
3D Instance Segmentation | 9.66 MB |
3D Motion Vectors / Scene Flow | 554.15 MB |
Other | |
Calibration Data | 88 KB |
Dataset Name | No. of Sequences | No. of Frames | Remarks |
---|---|---|---|
Normal Weather – Full | 6,343 | 126,680 | Contains clear weather [including cloudy and overcase conditions] images taken at different times of the day. |
Normal Weather – Sampled | 6,343 | 25,372 | Smaller version of the full normal weather dataset with every 5th frame uniformly sampled from each full sequence. |
Adverse Weather – Batch 1 | 258 | 5,160 | Contains foggy, night, and rainy scenes |
Adverse Weather – Batch 2 | 90 | 1,800 | Contains foggy, night, and rainy scenes |
Note 1: The experiments in our paper were conducted using a combined dataset consisting of Normal Weather – Sampled Dataset, Adverse Weather [Batch 1], and Adverse Weather [Batch 2]. If the full Normal Weather dataset is too large for your needs, we recommend using the same sampled version as used in our experiments.
Note 2: Compared to the dataset used in the paper, we have added one additional sequence each to Adverse Weather Batch 1 and Batch 2.
RealDriveSim Label ID | RealDriveSim Label Name | Cityscapes Train ID | Cityscapes Label Name |
---|---|---|---|
20 | OwnCar(EgoCar) | 255 | ego vehicle |
68 | GuardRail | 255 | guard rail |
24 | Road | 0 | road |
27 | RoadMarking | 0 | road |
49 | RoadMarkingSpeed | 0 | road |
57 | RoadMarkingArrows | 0 | road |
58 | RoadMarkingBottsDots | 0 | road |
66 | RoadBoundary(CurbRoadLevel) | 0 | road |
11 | LaneMarking | 0 | road |
41 | LaneMarkingGap | 0 | road |
61 | LaneMarkingSpan | 0 | road |
40 | LaneMarkingOther | 0 | road |
56 | LaneMarking(ParkingIndicator) | 0 | road |
8 | CrossWalk | 0 | road |
59 | StopLine | 0 | road |
12 | LimitLine | 0 | road |
63 | ParkingSpot | 0 | road |
15 | OtherDriveableSurface | 255 | parking |
48 | ParkingLot | 255 | parking |
55 | LaneMarking(Parking) | 255 | parking |
28 | SideWalk | 1 | sidewalk |
26 | RoadBoundary(Curb) | 1 | sidewalk |
54 | RoadBoundary(CurbFlat) | 1 | sidewalk |
64 | RoadBoundary(CurbTop) | 1 | sidewalk |
65 | RoadBoundary(CurbSide) | 1 | sidewalk |
3 | Building | 2 | building |
19 | Overpass/Bridge/Tunnel | 255 | bridge |
69 | Wall | 3 | wall |
25 | RoadBarriers | 3 | wall |
9 | Fence | 4 | fence |
42 | Fence(Transparent) | 4 | fence |
38 | VerticalPole | 5 | pole |
10 | HorizontalPole | 5 | pole |
45 | OtherPole | 5 | pole |
33 | TrafficLight | 6 | traffic light |
34 | TrafficSign | 7 | traffic sign |
37 | Vegetation | 8 | vegetation |
44 | Vegetation(Bush) | 8 | vegetation |
50 | Vegetation(GroundCover) | 8 | vegetation |
52 | Vegetation(Tree) | 8 | vegetation |
51 | Vegetation(Grass) | 8 | vegetation |
31 | Terrain | 9 | terrain |
29 | Sky | 10 | sky |
46 | Powerline | 10 | sky |
22 | Pedestrian | 11 | person |
2 | Bicyclist | 12 | rider |
14 | Motorcyclist | 12 | rider |
18 | OtherRider | 12 | rider |
103 | Van | 13 | car |
5 | Car | 13 | car |
6 | Caravan/RV | 255 | caravan |
36 | Truck | 14 | truck |
104 | ConstructionVehicle(Truck) | 14 | truck |
4 | Bus | 15 | bus |
47 | SchoolBus | 15 | bus |
35 | Train | 16 | train |
23 | Railway | 255 | rail track |
13 | Motorcycle | 17 | motorcycle |
1 | Bicycle | 18 | bicycle |
32 | TowedObject | 255 | trailer |
0 | Animal | 255 | dynamic |
17 | OtherMovable | 255 | dynamic |
30 | TemporaryConstructionObject | 255 | dynamic |
39 | WheeledSlow | 255 | dynamic |
53 | Debris | 255 | dynamic |
60 | ChannelizingDevice | 255 | dynamic |
67 | Water | 255 | ground |
16 | OtherFixedStructure | 255 | static |
21 | ParkingMeter | 255 | static |
43 | StaticObject(Trashcan) | 255 | static |
62 | StaticObject(BikeRack) | 255 | static |
255 | Void | 255 | unlabeled |