A Survey on the Honesty of Large Language Models

A Survey on the Honesty of Large Language Models [115.8]
正直とは、大きな言語モデル(LLM)を人間の価値と整合させる基本的な原則である。将来性はあるものの、現在のLLMは依然として重大な不正直な行動を示す。
論文参考訳（メタデータ） (Fri, 27 Sep 2024 14:34:54 GMT)
「Honesty is a fundamental principle for aligning large language models (LLMs) with human values, requiring these models to recognize what they know and don’t know and be able to faithfully express their knowledge.」から始まるサーベイ。
リポジトリはGitHub – SihengLi99/LLM-Honesty-Survey

コメントを残す

コメントを残す コメントをキャンセル