bias – arXiv最新論文の紹介

Your AI, Not Your View: The Bias of LLMs in Investment Analysis

Your AI, Not Your View: The Bias of LLMs in Investment Analysis [55.3]
金融分野では、事前訓練されたパラメトリック知識とリアルタイム市場データとの相違により、LLM(Large Language Models)は頻繁に知識紛争に直面している。 LLMに基づく投資分析において、確認バイアスの最初の定量的分析を行う。われわれは、大口株に対する一貫した選好と、ほとんどのモデルにおけるコントラリアン戦略を観察する。
論文参考訳（メタデータ） (Mon, 28 Jul 2025 16:09:38 GMT)
LLMの投資に関するバイアスの定量的分析。
「The results show that LLMs are not neutral decision-makers, with distinct preferences for certain financial factors depending on the model. While sector preferences varied significantly across models, showing no overall trend, a common bias towards large- size stocks and a consistent preference for a contrarian investment view over momentum were observed.」というバイアスがあるというのと、「While the models correctly reversed their decisions when presented only with counter-evidence, their flexibility sharply decreased in situations where supporting and counter-evidence were mixed and conflicting.」とかなり頑固なよう。
LLMに何かを判断させる際には細心の注意が必要。

MLLMs are Deeply Affected by Modality Bias

MLLMs are Deeply Affected by Modality Bias [158.6]
MLLM(Multimodal Large Language Models)の最近の進歩は、テキストや画像などの多様なモダリティを統合する上で、有望な成果を示している。 MLLMはモダリティバイアスに強く影響され、しばしば言語に依存し、視覚入力のような他のモダリティを過小評価する。本稿では,MLLMはモダリティバイアスの影響を強く受けており,様々なタスクにまたがってその発現を明らかにする。
論文参考訳（メタデータ） (Sat, 24 May 2025 11:49:31 GMT)
「Modality bias arises when certain modalities dominate the learning process, while others are underutilized or contribute less effectively」というモダリティバイアスの検証。
「From a model learning perspective, [49] identifies the differing convergence rates of modalities as a core cause of modality bias. The varying levels of difficulty in fitting category labels across different modalities contribute to this disparity.」というのは直観的にもそうだと思いつつ、解消するのは大変そう。「百聞は一見に如かず」とかいうが人間はどうやって対応しているんだろう。

Assessing Judging Bias in Large Reasoning Models: An Empirical Study

Assessing Judging Bias in Large Reasoning Models: An Empirical Study [99.9]
DeepSeek-R1やOpenAI-o1のような大きな推論モデル(LRM)は、顕著な推論能力を示している。本稿では、主観的嗜好アライメントデータセットと客観的事実ベースデータセットの両方において、LLMとLRMの偏りを判定するベンチマークを示す。
論文参考訳（メタデータ） (Mon, 14 Apr 2025 07:14:27 GMT)
LRMにおけるJudge時のバイアスに関する検証
基本的にLRMのJudgeに関する性能は高く「Through investigation of bandwagon, authority, position, and distraction biases, we uncover four key findings: (1) despite their advanced reasoning capabilities, LRMs remain susceptible to the above biases; (2) LRMs demonstrate better robustness than LLMs specifically on fact-related datasets; (3) LRMs exhibit notable position bias, preferring options in later positions; and (4) we identify a novel “superficial reflection bias” where phrases mimicking reasoning (e g , “wait, let me think…”) significantly influence model judgments.」とのこと。
「We identify a novel “superficial reflection bias” in LRMs, where phrases mimicking reasoning significantly influence judging outcomes, demonstrating how reasoning mechanisms can introduce new vulnerabilities in automated evaluation.」という点、おそらく学習過程によるものであろうということが興味深い。

Biased AI can Influence Political Decision-Making

Biased AI can Influence Political Decision-Making [64.9]
本稿では、AI言語モデルにおけるパルチザンバイアスが政治的意思決定に及ぼす影響について検討する。政治的に偏見のあるモデルに晒された参加者は、意見を採用し、AIの偏見と一致した決定を下す可能性が著しく高いことがわかった。
論文参考訳（メタデータ） (Tue, 08 Oct 2024 22:56:00 GMT)
「We found that participants exposed to politically biased models were significantly more likely to adopt opinions and make decisions aligning with the AI’s bias, regardless of their personal political partisanship.」、「However, we also discovered that prior knowledge about AI could lessen the impact of the bias, highlighting the possible importance of AI education for robust bias mitigation.」という指摘。教育の効果はあるようだが、今後問題は大きくなっていくんじゃないかと思う。。

大規模言語モデルのバイアス（CoDaを用いた検証）

Do ever larger octopi still amplify reporting biases? Evidence from judgments of typical colour [27.8]
原文で訓練された言語モデル(LM)は、物理世界に直接アクセスすることができない。より大きな言語モデルにおける色の観点からの報告バイアスについて検討する。
論文参考訳（メタデータ） (Mon, 26 Sep 2022 15:45:23 GMT)
- 大規模言語モデルのバイアスを色に関するPromptで検証した論文。言語モデルが非常に大規模になるとGoogle Ngramよりも人間のスコアに近づいているのが面白い。
- データセットとしてnala-cub/coda: The World of an Octopus: How Reporting Bias Influences a Language Model’s Perception of Color (github.com)　を使用している

機械学習におけるバイアス緩和のサーベイ

Bia Mitigation for Machine Learning Classifiers: A Comprehensive Survey [25.3]
本稿では,機械学習(ML)モデルにおける公平性を実現するためのバイアス緩和手法を包括的に調査する。 ML分類器のバイアス軽減に関する合計234の論文を収集する。本論文では,既存のバイアス緩和手法について検討する。
論文参考訳（メタデータ） (Thu, 14 Jul 2022 17:16:45 GMT)
- 機械学習による分類器に対してバイアスを緩和する手法のサーベイ。200以上の論文がサーベイ対象であり、多種多様なアプローチ、手法があることに驚き。

Transfer Learningとバイアス

When does Bias Transfer in Transfer Learning? [89.2]
トランスファーラーニングを使用して、トレーニング済みの”ソースモデル”を下流の”ターゲットタスク”に適応させることで、ダウンサイドのないように見えるパフォーマンスを劇的に向上させることができる。結局のところ、バイアス伝達や、モデルがターゲットクラスに適応した後でも、ソースモデルのバイアスが持続する傾向というマイナス面が存在することを実証する。
論文参考訳（メタデータ） (Wed, 6 Jul 2022 17:58:07 GMT)
- バイアスがかかった事前学習モデルをTransfer Learningで利用した場合、データセットにバイアスが無くても、最終的なモデルにバイアスが発生するとの報告。危険性を認識しておく必要がある。
- リポジトリはGitHub – MadryLab/bias-transfer

NLPにおけるジェンダーバイアスのサーベイ

A Survey on Gender Bias in Natural Language Processing [22.9]
自然言語処理における性別バイアスに関する304論文について調査する。ジェンダーバイアスの検出と緩和に対するコントラストアプローチの比較を行った。性別偏見の研究は、4つの中核的な限界に悩まされている。1)ジェンダーを流動性と連続性を無視した二変数変数として扱う。 2) 単言語で実施されている。 3) 倫理的考察を無視している。 4) 男女差の非常に限定的な定義と, 評価基準とパイプラインの欠如に根本的な欠陥がある。
論文参考訳（メタデータ） (Tue, 28 Dec 2021 14:54:18 GMT)
- AIの社会実装において逃げてはいけないジェンダーバイアスに関するサーベイ。4つの問題が指摘されているが、その中でもジェンダー及びジェンダーバイアスの定義ができていないというのは非常に重要な指摘であると思う。

2025年8月
月	火	水	木	金	土	日
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31