{"id":6436,"date":"2025-03-20T05:24:00","date_gmt":"2025-03-19T20:24:00","guid":{"rendered":"https:\/\/devneko.jp\/wordpress\/?p=6436"},"modified":"2025-03-20T05:24:00","modified_gmt":"2025-03-19T20:24:00","slug":"plangen-towards-unified-layout-planning-and-image-generation-in-auto-regressive-vision-language-models","status":"publish","type":"post","link":"https:\/\/devneko.jp\/wordpress\/?p=6436","title":{"rendered":"PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models\u00a0"},"content":{"rendered":"\n<ul class=\"wp-block-list\">\n<li><strong>PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models\u00a0<\/strong>[10.3]<br>\u753b\u50cf\u3092\u751f\u6210\u3059\u308b\u524d\u306b\u7a7a\u9593\u914d\u7f6e\u6761\u4ef6\u3092\u4e8b\u524d\u306b\u8a08\u753b\u3067\u304d\u308b\u7d71\u5408\u30ec\u30a4\u30a2\u30a6\u30c8\u8a08\u753b\u3068\u753b\u50cf\u751f\u6210\u30e2\u30c7\u30ebPlanGen\u3092\u63d0\u6848\u3059\u308b\u3002 PlanGen\u306f\u3001\u30ed\u30fc\u30ab\u30eb\u30ad\u30e3\u30d7\u30b7\u30e7\u30f3\u3068\u30d0\u30a6\u30f3\u30c7\u30a3\u30f3\u30b0\u30dc\u30c3\u30af\u30b9\u5ea7\u6a19\u306e\u7279\u5225\u306a\u30a8\u30f3\u30b3\u30fc\u30c7\u30a3\u30f3\u30b0\u3092\u5fc5\u8981\u3068\u305b\u305a\u306b\u3001\u30ec\u30a4\u30a2\u30a6\u30c8\u6761\u4ef6\u3092\u30b3\u30f3\u30c6\u30ad\u30b9\u30c8\u3068\u3057\u3066\u30e2\u30c7\u30eb\u306b\u7d71\u5408\u3059\u308b\u3002 \u3055\u3089\u306b\u3001\u3088\u304f\u8a2d\u8a08\u3055\u308c\u305f\u30e2\u30c7\u30ea\u30f3\u30b0\u306e\u304a\u304b\u3052\u3067\u3001PlanGen\u306f\u30ec\u30a4\u30a2\u30a6\u30c8\u8a98\u5c0e\u306e\u753b\u50cf\u64cd\u4f5c\u306b\u30b7\u30fc\u30e0\u30ec\u30b9\u306b\u62e1\u5f35\u3067\u304d\u308b\u3002<br><a href=\"http:\/\/arxiv.org\/abs\/2503.10127v1\">\u8ad6\u6587<\/a>\u00a0\u00a0<a href=\"https:\/\/fugumt.com\/fugumt\/paper_check\/2503.10127v1\">\u53c2\u8003\u8a33\uff08\u30e1\u30bf\u30c7\u30fc\u30bf\uff09<\/a>\u00a0 \u00a0(Thu, 13 Mar 2025 07:37:09 GMT)<\/li>\n\n\n\n<li>\u753b\u50cf\u751f\u6210\u306e\u524d\u306b\u30ec\u30a4\u30a2\u30a6\u30c8\u8a08\u753b\u53ef\u80fd\u306a\u30e2\u30c7\u30eb\u306e\u63d0\u6848\u3002\u30b3\u30f3\u30c6\u30ad\u30b9\u30c8\u3068\u3057\u3066\u30ec\u30a4\u30a2\u30a6\u30c8\u3092\u53d7\u3051\u53d6\u308b\u3053\u3068\u304c\u53ef\u80fd\u300cPlanGen can complete layout planning and layout-to-image generation in a unified model. Just like thinking about what object each area should be before generating an image, such an explicit planning process allows the model to enjoy more powerful image generation capabilities.\u300d\u3002<\/li>\n\n\n\n<li>\u30ea\u30dd\u30b8\u30c8\u30ea\u306f<a href=\"https:\/\/360cvgroup.github.io\/PlanGen\/\">PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[615],"class_list":["post-6436","post","type-post","status-publish","format-standard","hentry","category-arxiv","tag-615"],"_links":{"self":[{"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/6436","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=6436"}],"version-history":[{"count":0,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/6436\/revisions"}],"wp:attachment":[{"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=6436"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=6436"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=6436"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}