Mistral Medium 3, Gemini 2.5 Pro preview, Llama-Nemotron, OpenCodeReasoning

先週注目のニュースはMistralのMistral Medium 3のリリース（Medium is the new large. | Mistral AI）。Claude 3.7 sonnetと競合する性能で「The Mistral Medium 3 API is available starting today on Mistral La Plateforme and Amazon Sagemaker, and soon on IBM WatsonX, NVIDIA NIM, Azure AI Foundry, and Google Cloud Vertex. To deploy and customize the model in your environment, please contact us. 」と各社環境での動作が可能な点が重要に思う。

GoogleのGemini 2.5 Proが使用可能になったよう（Gemini Pro – Google DeepMind）でこちらも注目度が高い。NvidiaのLlama-NemotronやOpenCodeReasoning がダウンロード可能になったことも話題になっていた。

各モデルの（第三者の）性能検証はこれからという感じだろうが、本当にニュースが多い。

Llama-Nemotron: Efficient Reasoning Models [105.8]
ヘテロジニアス推論モデルの開族であるLlama-Nemotronシリーズを導入する。サイズはNano(8B)、Super(49B)、Ultra(253B)の3種類。
論文参考訳（メタデータ） (Fri, 02 May 2025 01:35:35 GMT)
リポジトリはnvidia/Llama-3_1-Nemotron-Ultra-253B-v1 · Hugging Face、nvidia/Llama-Nemotron-Post-Training-Dataset · Datasets at Hugging Face

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding [61.2]
教師付き微調整(SFT)データセットを構築し、様々なサイズのモデルで最先端のコーディング能力を実現する。私たちのモデルは、LiveCodeBenchで61.8%、CodeContestsで24.6%を達成するためにSFTのみを使用しており、強化学習でトレーニングされた代替品を上回っています。
論文参考訳（メタデータ） (Wed, 02 Apr 2025 17:50:31 GMT)

コメントを残す

月	火	水	木	金	土	日
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

コメントを残す コメントをキャンセル

コメントを残すコメントをキャンセル