ShowUI-$π$: Flow-based Generative Models as GUI Dexterous Hands

ShowUI-$π$: Flow-based Generative Models as GUI Dexterous Hands [59.2]
そこで我々は,GUI dexterous Handとして最初のフローベース生成モデルである ShowUI-$ を開発した。 ShowUI-$$は、たった450万のパラメータで26.98を達成する。
論文参考訳（メタデータ） (Wed, 31 Dec 2025 16:51:14 GMT)
「ShowUI-π highlights the following architecture: (i) Unified Discrete-Continuous Actions: ShowUI-π casts discrete clicks as drags with negligible movements, and integrates them with continuous drags into a unified modeling. Under this formulation, both action types are represented by a sequence of (x,y,m) triplets, where (x,y) are cursor coordinates and m ∈ {down,up} is the mouse button state. This unified design allows ShowUI-π to handle both drag and click tasks with a single shared model, adapting without task-specific head selection.」と他のGUI Agentとはデータの扱い方が異なるフレームワークの提案。
プロジェクトサイトはShowUI-π: Flow-based Generative Models as GUI Dexterous Hands

コメントを残す

コメントを残す コメントをキャンセル