{"id":7043,"date":"2025-07-07T05:19:00","date_gmt":"2025-07-06T20:19:00","guid":{"rendered":"https:\/\/devneko.jp\/wordpress\/?p=7043"},"modified":"2025-07-06T09:26:32","modified_gmt":"2025-07-06T00:26:32","slug":"ledom-an-open-and-fundamental-reverse-language-model","status":"publish","type":"post","link":"https:\/\/devneko.jp\/wordpress\/?p=7043","title":{"rendered":"LEDOM: An Open and Fundamental Reverse Language Model"},"content":{"rendered":"\n<ul class=\"wp-block-list\">\n<li><strong>LEDOM: An Open and Fundamental Reverse Language Model\u00a0<\/strong>[100.5]<br>\u6700\u521d\u306e\u7d14\u7c8b\u9006\u8a00\u8a9e\u30e2\u30c7\u30eb\u3067\u3042\u308bLEDOM\u3092\u5c0e\u5165\u3057,2B\u304a\u3088\u30737B\u30d1\u30e9\u30e1\u30fc\u30bf\u306e435B\u30c8\u30fc\u30af\u30f3\u306b\u5bfe\u3057\u3066\u81ea\u5df1\u56de\u5e30\u8a13\u7df4\u3092\u884c\u3063\u305f\u3002 \u672c\u7a3f\u3067\u306f, \u4e00\u822c\u7684\u306a\u30bf\u30b9\u30af\u306b\u307e\u305f\u304c\u308b\u57fa\u76e4\u30e2\u30c7\u30eb\u3068\u3057\u3066, \u8208\u5473\u6df1\u3044\u4e8b\u4f8b\u3068\u6d1e\u5bdf\u306e\u30bb\u30c3\u30c8\u3092\u4f34\u3063\u3066, \u9006\u8a00\u8a9e\u30e2\u30c7\u30eb\u3092\u63d0\u793a\u3059\u308b\u3002 LEDOM\u3092\u30d9\u30fc\u30b9\u306b\u3057\u305f\u65b0\u3057\u3044\u30a2\u30d7\u30ea\u30b1\u30fc\u30b7\u30e7\u30f3\u3067\u3042\u308bReverse Reward\u3092\u7d39\u4ecb\u3057\u307e\u3059\u3002<br><a href=\"http:\/\/arxiv.org\/abs\/2507.01335v1\">\u8ad6\u6587<\/a>\u00a0\u00a0<a href=\"https:\/\/fugumt.com\/fugumt\/paper_check\/2507.01335v1\">\u53c2\u8003\u8a33\uff08\u30e1\u30bf\u30c7\u30fc\u30bf\uff09<\/a>\u00a0 \u00a0(Wed, 02 Jul 2025 03:52:00 GMT)<\/li>\n\n\n\n<li>\u300cWe introduce LEDOM, the first purely reverse language model, trained autoregressively on 435B tokens with 2B and 7B parameter variants, which processes sequences in reverse temporal order through previous token prediction.\u300d\u3068\u3044\u3046\u9006\u8a00\u8a9e\u30e2\u30c7\u30eb\u3002\u9762\u767d\u3044\u767a\u60f3\u3002<\/li>\n\n\n\n<li>\u300cGiven a known answer and the corresponding supporting reasons, LEDOM can produce natural, well-formed ques- tions. It is helpful for automatically creating QA datasets and educational content, where starting from answers or known concepts is often more practical than designing questions manually.\u300d\u3068\u3044\u3046\u306e\u3082\u8208\u5473\u6df1\u3044\u304c\u3001\u300cWe propose Reverse reward, a novel strategy that uses LEDOM to guide forward model outputs via reranking, leading to consistent performance improvements in mathematical reasoning.\u300d\u3068\u30bf\u30b9\u30af\u306b\u3088\u3063\u3066\u306f\u52b9\u679c\u304c\u3042\u308b\u3088\u3046\u3002<\/li>\n\n\n\n<li>BERT\u306eB\u306e\u3088\u3046\u306b\u53cc\u65b9\u5411\u304c\u6709\u52b9\u306a\u3053\u3068\u306f\u3042\u308b\u3057\u3001\u30c0\u30d6\u30eb\u30c1\u30a7\u30c3\u30af\u306e\u4e0a\u3067\u6709\u52b9\u305d\u3046\u3068\u3044\u3046\u5370\u8c61\u3002<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[223],"class_list":["post-7043","post","type-post","status-publish","format-standard","hentry","category-arxiv","tag-llm"],"_links":{"self":[{"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/7043","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=7043"}],"version-history":[{"count":1,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/7043\/revisions"}],"predecessor-version":[{"id":7044,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/7043\/revisions\/7044"}],"wp:attachment":[{"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=7043"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=7043"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/devneko.jp\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=7043"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}