| Issue |
ITM Web Conf.
Volume 78, 2025
International Conference on Computer Science and Electronic Information Technology (CSEIT 2025)
|
|
|---|---|---|
| Article Number | 01005 | |
| Number of page(s) | 7 | |
| Section | Deep Learning and Reinforcement Learning – Theories and Applications | |
| DOI | https://doi.org/10.1051/itmconf/20257801005 | |
| Published online | 08 September 2025 | |
Towards Efficient Deep Reinforcement Learning: A Comprehensive Review of Optimization Techniques
School of Computer and Electronic Information /School of Artificial Intelligence, Nanjing Normal University, Nanjing, Jiangsu, China
Deep reinforcement learning (DRL) has achieved remarkable success in various domains, including robotics, game playing, and autonomous systems. However, optimizing DRL models remains a significant challenge due to issues such as sample inefficiency, instability, and high computational costs. To address these challenges, researchers have explored various optimization strategies that enhance learning efficiency and model performance. This paper provides a comprehensive review of optimization methods in DRL, focusing on hyperparameter optimization, structural optimization, and algorithm optimization. It explores common techniques for hyperparameter optimization, including Bayesian optimization, grid search, gradient optimization, and evolutionary algorithms, discussing their strengths, weaknesses, and suitable application scenarios. The paper also examines structural optimization, highlighting key mechanisms such as attention mechanisms and generative adversarial networks (GANs), and their impact on improving model performance in various domains. Additionally, it analyzes algorithm optimization strategies, including Double-Q-learning, Proximal Policy Optimization (PPO), MuZero, and deep evolutionary strategies (DES), comparing their effectiveness in solving complex tasks. Future research directions include the combination of optimization methods, with a focus on generalization and interpretability, as well as exploring real-world applications to further improve existing strategies. This review provides valuable insights for researchers and practitioners aiming to advance DRL technologies
© The Authors, published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

