LongVie 2: Multimodal Controllable Ultra-Long Video World Model

LongVie 2: Multimodal Controllable Ultra-Long Video World Model [94.9]
LongVie 2はエンドツーエンドの自動回帰フレームワークで、3段階でトレーニングされている。 LongVie 2は、長距離制御性、時間的コヒーレンス、視覚的忠実さにおいて最先端の性能を達成する。
論文参考訳（メタデータ） (Mon, 15 Dec 2025 17:59:58 GMT)
「LongVie 2 achieves state-of-the-art performance in controllable long video generation and can autoregressively synthesize high-quality videos lasting up to 3–5 minutes, marking a significant step toward video world modeling.」とのこと
プロジェクトサイトはLongVie 2

コメントを残す

コメントを残す コメントをキャンセル