3D Layout-Guided Autonomous Driving Scene Generation

Yu Liu

doi:10.1051/itmconf/20268404011

Open Access

Issue		ITM Web Conf. Volume 84, 2026 2026 International Conference on Advent Trends in Computational Intelligence and Data Science (ATCIDS 2026)


Article Number		04011
Number of page(s)		8
Section		Computer Vision, Robotic Systems, and Intelligent Control
DOI		https://doi.org/10.1051/itmconf/20268404011
Published online		06 April 2026

ITM Web of Conferences 84, 04011 (2026)

3D Layout-Guided Autonomous Driving Scene Generation

Yu Liu^*

Huaxin Software College, Tianjin University of Technology, 300387, Tianjin, China

^* Corresponding author’s email: This email address is being protected from spambots. You need JavaScript enabled to view it.

Abstract

Improving the robustness of autonomous driving perception models relies on large-scale, diverse scenario data. However, real-world road data has challenges such as high collection costs, scarcity of extreme scenarios, and complexity in multi-view labeling. Generative AI scene synthesis technology has emerged as a key solution, with diffusion models gradually replacing GAN models as the mainstream. This paper provides a systematic review of autonomous driving scene synthesis technology, outlining the evolution of the technology, clarifying the core features and logic of different generations; it focuses on analyzing the representative solution DrivingDiffusion, the first video generation framework to achieve “3D layout controllability, multi-view coordination, and temporal coherence,” dissecting its architecture and core module design based on latent diffusion models (LDM). It further compares the performance of diffusion-based methods with traditional GAN-based approaches across key metrics like scene fidelity and label consistency. Moreover, it extracts the key issues and challenges in the current field; finally, it looks forward to future development directions, providing a reference for subsequent research on related virtual data generation.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.