Comparative Evaluation of Mean Cumulative Regret in Multi-Armed Bandit Algorithms: ETC, UCB, Asymptotically Optimal UCB, and TS
ITM Web Conf., 73 (2025) 01026
Published online: 17 February 2025
DOI: 10.1051/itmconf/20257301026