Reinforcement Learning in Medical Settings —A Review of Counterfactual Reward Estimation Methods Based on Causal GraphsZheng Fan and Yue PengITM Web Conf., 78 (2025) 01018DOI: https://doi.org/10.1051/itmconf/20257801018