SSR: Socratic Self-Refine for Large Language Model Reasoning

SSR: Socratic Self-Refine for Large Language Model Reasoning [78.6]
Socratic Self-Refine (SSR)は、大規模言語モデル(LLM)のきめ細かい評価と精度向上のための新しいフレームワークである。提案したSSRはモデル応答を検証可能な(サブクエスト,サブサブアンサー)ペアに分解し,ステップレベルの信頼度推定を可能にする。 5つの推論ベンチマークと3つのLCMによる実証的な結果から、SSRは一貫して最先端の反復的自己修正ベースラインを上回っていることが分かる。
論文参考訳（メタデータ） (Fri, 14 Nov 2025 02:00:16 GMT)
「We propose a novel framework, Socratic Self-Refine (SSR), that allows more fine-grained confidence estimation and precise error control over decomposed reasoning steps. By formulating reasoning as a sequence of (sub-question, sub-answer) pairs, SSR overcomes the limitations of existing holistic self-refinement methods.」というフレームワークを提案、効果を確認。
リポジトリはGitHub – SalesforceAIResearch/socratic-self-refine-reasoning

コメントを残す

コメントを残す コメントをキャンセル