Focus on the Optimization of the RLHF Algorithm to Enhance the Training Effect After LLM
, and
ITM Web Conf., 84 (2026) 03006
Published online: 06 April 2026
DOI: 10.1051/itmconf/20268403006

