2025年12月19日 – arXiv最新論文の紹介

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification [91.2]
本稿では、長い思考の連鎖から要約された結果の合理化過程を検証する、アウトカムベースプロセス検証(OPV)を提案する。 OPV は 76.3 と比較して F1 スコアが 83.1 の Qwen3-Max-Preview など,はるかに大きなオープンソースモデルよりも優れています。
論文参考訳（メタデータ） (Thu, 11 Dec 2025 15:47:38 GMT)
「We introduced the Outcome-based Process Verifier (OPV), which bridges outcome and process verification by operating on summarized solutions from long CoTs. Through an iterative active learning framework with expert annotations, OPV progressively improves its verification capabilities while minimizing annotation costs.」とCoT的な推論過程を検証するアプローチの提案。

Deep Research: A Systematic Survey [118.8]
Deep Research (DR)は、大規模言語モデルの推論能力と検索エンジンなどの外部ツールを組み合わせることを目的としている。本調査は,深層研究システムの包括的かつ体系的な概要を提示する。
論文参考訳（メタデータ） (Mon, 24 Nov 2025 15:28:28 GMT)
Deep Resaerchに関するサーベイ。関連研究を含め幅広いサーベイになっている。引用論文リストからは（当然と言えば当然だが）2025年以降に非常に盛り上がっている状況が分かる。
リポジトリはGitHub – mangopy/Deep-Research-Survey: A Systematic Survey of Deep Research