Legal LLM Explainable Reward Engine — 法律大模型可解释奖励引擎 (GRPO + 三维奖励函数)
python law reinforcement-learning explainable-ai legal-ai legal-nlp large-language-models rlhf reward-model grpo llm-alignment judicial-reasoning
-
Updated
Jun 4, 2026 - Python