Download citation

Focus on the Optimization of the RLHF Algorithm to Enhance the Training Effect After LLM

ITM Web Conf., 84 (2026) 03006
DOI: https://doi.org/10.1051/itmconf/20268403006