Survey – ページ 13 – arXiv最新論文の紹介

Generative AI and Creative Work: Narratives, Values, and Impacts

Generative AI and Creative Work: Narratives, Values, and Impacts [37.2]
私たちは、オンラインメディアをレビューし、彼らが伝達するクリエイティブな仕事に対するAIの影響に関する支配的な物語を分析します。この談話は、人的労働を通じて物質的実現から解放された創造性を促進する。この言説は、支配的なテクノ実証主義のビジョンに対応し、創造的経済と文化に対する権力を主張する傾向にある。
論文参考訳（メタデータ） (Thu, 06 Feb 2025 10:26:56 GMT)
「In this article, we review online media outlets and analyze the dominant narratives around AI’s impact on creative work that they convey.」
参入障壁の低下が良いことなのか、アイデアと実行でアイデアの重要性（比率）が上がるのは好ましいのか、などは人によって考え方が異なるとはいえ、テクノロジーの普及は止められない。。それはそれとして「For example, we believe that five years ago, narratives of generative AI in art emphasized the replacement of artists by technology, whereas current narratives focus more on augmentation and collaboration.」は本当なんだろうか・・・という疑問も。

A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges

A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges [46.6]
深層学習(DL)の急速な発展により,大量のリモートセンシング画像(RSI)上で,自動的かつ高精度かつ堅牢な変化検出(CD)が可能になった。 CD手法の進歩にもかかわらず、実際の文脈における実践的応用は、多様な入力データと応用コンテキストのために制限されている。本稿では,様々なCDタスクに関する文献的手法と,サンプル限定シナリオでDLベースのCDメソッドをトレーニングおよびデプロイするための戦略とテクニックを要約する。
論文参考訳（メタデータ） (Wed, 05 Feb 2025 02:36:09 GMT)
「this article summarizes the literature methods for different CD tasks and the available strategies and techniques to train and deploy DL-based CD methods in sample-limited scenarios.」というサーベイ（CD=Change Detection）

A Survey on Memory-Efficient Large-Scale Model Training in AI for Science

A Survey on Memory-Efficient Large-Scale Model Training in AI for Science [20.3]
この調査は、生物学、医学、化学、気象学などの科学分野にまたがる応用をレビューする。本稿では,変圧器アーキテクチャに基づく大規模言語モデル(LLM)のメモリ効率トレーニング手法について概説する。予測精度を保ちながら,メモリ最適化手法がストレージ需要を削減できることを実証する。
論文参考訳（メタデータ） (Tue, 21 Jan 2025 03:06:30 GMT)
科学への応用にフォーカスしたMemory Efficientなモデルのサーベイ
「Using AlphaFold 2 as an example, we demonstrate how tailored memory optimization methods can reduce storage needs while preserving prediction accuracy.」という内容も。

A Survey of World Models for Autonomous Driving

A Survey of World Models for Autonomous Driving [63.3]
自動運転車の最近のブレークスルーは、車両が周囲を知覚し、相互作用する方法に革命をもたらした。世界モデルは、マルチセンサーデータ、セマンティックキュー、時間ダイナミクスを統合する駆動環境の高忠実度表現を提供する。これらの世界モデルは、より堅牢で信頼性があり、適応可能な自動運転ソリューションの道を開いた。
論文参考訳（メタデータ） (Mon, 20 Jan 2025 04:00:02 GMT)
自動運転にフォーカスしたWorld modelのサーベイ。

Generative Physical AI in Vision: A Survey

Generative Physical AI in Vision: A Survey [25.9]
生成人工知能(AI)は、コンピュータビジョンの分野を急速に進歩させ、機械が前例のない高度なビジュアルデータを作成し、解釈できるようにする。生成AIが進化して物理リアリズムと動的シミュレーションを統合するにつれ、その「世界シミュレータ」として機能する可能性が高まっている。この調査は、コンピュータビジョンにおける物理学を意識した生成AIの出現する分野を体系的にレビューする。
論文参考訳（メタデータ） (Sun, 19 Jan 2025 03:19:47 GMT)
世界シミュレータとしての進化が期待されるPhysics aware generationのサーベイ。
リポジトリはGitHub – BestJunYu/Awesome-Physics-aware-Generation: Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and shaping the future!

A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities

A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities [31.2]
医療におけるEmAIは、アルゴリズム、ロボティクス、バイオメディシンといった多様な分野にまたがる。医療のためのEmAIの”脳”の概要を包括的に紹介し、認識、アクティベーション、計画、記憶のためのAIアルゴリズムを紹介します。我々は、技術的な障壁を議論し、倫理的考察を探求し、医療におけるEmAIの将来を前方視する。
論文参考訳（メタデータ） (Mon, 13 Jan 2025 16:35:52 GMT)
医療におけるEmbodiedAIのサーベイ。非常に広範な内容で引用数は800を超える

Harnessing Large Language Models for Disaster Management: A Survey

Harnessing Large Language Models for Disaster Management: A Survey [57.0]
大規模言語モデル(LLM)は、その例外的な能力で科学研究に革命をもたらし、様々な分野を変革した。本研究の目的は,災害対策のための高度LLMの開発における専門家コミュニティの指導であり,自然災害に対するレジリエンスを高めることである。
論文参考訳（メタデータ） (Sun, 12 Jan 2025 21:00:50 GMT)
災害へのLLM適用に関するサーベイで、Mitigation、Preparedness、Response、Recoveryの軸で整理

Generative AI for Cel-Animation: A Survey

Generative AI for Cel-Animation: A Survey [40.2]
GenAIは、技術的障壁を低くし、アクセシビリティを拡大し、アーティストがクリエイティブな表現と芸術的革新に集中できるようにすることによって、伝統的なアニメーションに革命をもたらしている。その可能性にもかかわらず、一貫性の維持、スタイリスティックな一貫性の確保、倫理的配慮への対処といった問題は引き続き課題を提起している。
論文参考訳（メタデータ） (Wed, 08 Jan 2025 20:57:39 GMT)
アニメーションにおける生成AIのサーベイ。
リポジトリはGitHub – yunlong10/Awesome-AI4Animation: 🔥🔥🔥 This repository includes latest papers, projects and datasets on GenAI for Cel-Animation.

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models [33.1]
大規模言語モデル(LLM)は、複雑な推論タスクに対処するためにそれらを活用することに大きな研究の関心を呼んだ。最近の研究は、LLMがテスト時間推論中により多くのトークンで”考える”ことを奨励することは、推論の精度を著しく向上させることを示した。 OpenAIのo1シリーズの導入は、この研究の方向性において重要なマイルストーンである。
論文参考訳（メタデータ） (Thu, 16 Jan 2025 17:37:58 GMT)
OpenAI o1ライクなモデル、Large Reasoning Modelsのサーベイ。「We begin by introducing the foundational background of LLMs and then explore the key technical components driving the development of large reasoning models, with a focus on automated data construction, learning-to-reason techniques, and test-time scaling.」とある通り包括的な内容。
下記でも思ったが本当に進展が速い

O1 Replication Journey — Part 3: Inference-time Scaling for Medical Reasoning [27.8]
この研究は、医学的推論タスクのための大規模言語モデル(LLM)における推論時間スケーリングの可能性を探るものである。 500サンプルを適度にトレーニングすることで,本モデルでは6%-11%の性能向上を実現した。
論文参考訳（メタデータ） (Sat, 11 Jan 2025 07:10:23 GMT)
プロジェクトサイトはGitHub – SPIRAL-MED/Ophiuchus

Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey

Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey [6.7]
VLM(Multimodal Vision Language Models)は、コンピュータビジョンと自然言語処理の交差点において、トランスフォーメーション技術として登場した。 VLMは、視覚的およびテキスト的データに対して強力な推論と理解能力を示し、ゼロショット分類において古典的な単一モダリティ視覚モデルを上回る。
論文参考訳（メタデータ） (Sat, 04 Jan 2025 04:59:33 GMT)
「we provide a systematic overview of VLMs in the following aspects: [1] model information of the major VLMs developed over the past five years (2019-2024); [2] the main architectures and training methods of these VLMs; [3] summary and categorization of the popular benchmarks and evaluation metrics of VLMs; [4] the applications of VLMs including embodied agents, robotics, and video generation; [5] the challenges and issues faced by current VLMs such as hallucination, fairness, and safety.」とVLMのサーベイ。
リポジトリはGitHub – zli12321/VLM-surveys: A most Frontend Collection and survey of vision-language model papers, and models GitHub repository

2026年7月
月	火	水	木	金	土	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31