Enhancing LLM Planning Capabilities through Intrinsic Self-Critique

Enhancing LLM Planning Capabilities through Intrinsic Self-Critique [34.8]
検証器などの外部ソースを使わずに、本質的な自己批判を通じてデータセットを計画する際の顕著な性能向上を示す。自己批判が計画のパフォーマンスを大幅に向上させる方法について説明する。
論文参考訳（メタデータ） (Tue, 30 Dec 2025 09:23:25 GMT)
「Each iteration of the self-improvement mechanism comprises two key steps: i) plan generation and ii) self-critiquing, aimed at iteratively refining LLM outputs. In step i), the LLM generates a plan (symbolized by a map) based on a prompt incorporating domain-specific knowledge and instructions (symbolized by the treasure chest). Step ii) involves a self-critique mechanism where the LLM evaluates its own performance, providing correctness assessments and justifications, again leveraging domain knowledge.」と自己批判による改善手法の提案。
それなりに使われるテクニックであるとは思うのだが、イテレーションを含めしっかりと検証されていてとても参考になる。

コメントを残す

コメントを残す コメントをキャンセル