Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models [122.6]
機械的解釈可能性 (MI) は、大規模言語モデル (LLM) の意思決定を決定づける重要なアプローチとして登場した。 Awesomeinterventionable-MI-Survey” というパイプラインを中心に構築された実践的調査を提案する。
論文参考訳（メタデータ） (Tue, 20 Jan 2026 14:23:23 GMT)
LLMの意思決定を“Locate, Steer, and Improve.”というパイプラインでとらえてのサーベイ。
リポジトリはGitHub – rattlesnakey/Awesome-Actionable-MI-Survey: The Github repo for our survey paper: “Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models”

コメントを残す

コメントを残す コメントをキャンセル