2025年10月15日 – arXiv最新論文の紹介

Imperceptible Jailbreaking against Large Language Models

Imperceptible Jailbreaking against Large Language Models [107.8]
変分セレクタと呼ばれるUnicode文字のクラスを利用する非受容ジェイルブレイクを導入する。目に見えない変分セレクタを悪意のある質問に追加することで、ジェイルブレイクプロンプトは画面上の元の悪意のある質問と視覚的に同じように見える。本研究では,このような逆接尾辞を生成し,有害な応答を誘導する探索パイプラインを提案する。
論文参考訳（メタデータ） (Mon, 06 Oct 2025 17:03:50 GMT)
目に見えないUnicode文字を使った imperceptible jailbreaksの提案。
リポジトリはGitHub – sail-sg/imperceptible-jailbreaks: [ArXiv 2025] Imperceptible Jailbreaking against Large Language Models

AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning [110.6]
自己進化型エージェント推論システムであるAlphaApolloについて述べる。基礎モデル(FM)における2つのボトルネックに対処することを目的としている。 AlphaApolloは、意図的に検証可能な推論を可能にするために、複数のモデルをプロのツールで編成する。
論文参考訳（メタデータ） (Sun, 05 Oct 2025 15:42:24 GMT)
Tool-augmented reasoningのためのフレームワーク。Appollo計画と共通点があるという主張は若干謎ではある、
リポジトリはGitHub – tmlr-group/AlphaApollo: [arXiv:2510.06261] “AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning”

Artificial Hippocampus Networks for Efficient Long-Context Modeling [17.2]
ロングシーケンス・モデリングは、RNNのようなモデルにおける圧縮固定サイズメモリの効率と、注目ベースのトランスフォーマーにおけるメモリの増大の忠実さとのトレードオフに直面している。認知科学における多段階モデルに着想を得て,人工ニューラルネットワークのメモリフレームワークを導入する。長文ベンチマークのLV-EvalとInfiniteBenchの実験は、AHN拡張モデルがスライディングウインドウベースラインを一貫して上回ることを示した。
論文参考訳（メタデータ） (Wed, 08 Oct 2025 17:59:55 GMT)
「AHNs address the efficiency limitation of standard transformers by maintaining a sliding window of KV cache as lossless memory while transforming out-of-window information into a fixed-size compressed memory This approach enables AHN-augmented models to achieve constant memory and computational complexity per token over long sequences. Experiments」と長文に強い構造の提案。
リポジトリはGitHub – ByteDance-Seed/AHN: AHN: Artificial Hippocampus Networks for Efficient Long-Context Modeling