| Issue |
ITM Web Conf.
Volume 80, 2025
2025 2nd International Conference on Advanced Computer Applications and Artificial Intelligence (ACAAI 2025)
|
|
|---|---|---|
| Article Number | 01009 | |
| Number of page(s) | 5 | |
| Section | Machine Learning & Deep Learning Algorithms | |
| DOI | https://doi.org/10.1051/itmconf/20258001009 | |
| Published online | 16 December 2025 | |
Conditional Latent Diffusion for Precision-Controllable Image Generation
College of Artificial Intelligence and Automation, Hohai University, Changzhou, China
* Corresponding author: 2424020132@hhu.edu.cn
Some models based on Latent Diffusion Models (LDMs), like Stable Diffusion, have revolutionized the image generation field in recent years. But LDMs’ inherent precision control is often not effective enough to solve practical application problems. This paper reviews and compares five classic or state-of-the-art conditional control mechanisms—ControlNet, T2I-Adapter, Composer, UniControl, and FreeControl—designed to address this limitation. This paper analyze their architectural principles, performance trade-offs (e.g., in average FID score, computational cost, and inference speed), and applicability across different domains. Our comparative analysis demonstrates that while UniControl and Composer excel in dealing with tasks with high-quality requirement for their good performance in fine- grained control, methods like T2I-Adapter and FreeControl offer superior efficiency for mobile deployment due to their low computational demands. As the earliest control mechanism, ControlNet is still an effective mechanism and has certain application value. This overview provides a foundation for selecting appropriate control mechanisms for specific image generation tasks.
© The Authors, published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

