News

[4/2026] Mythos is a step change compared to previous frontier models, kudos to the team.
[2/2026] Opus 4.6 and its predecessor 4.5 achieve frontier performance, especially powering agentic coding, kudos to the team.
[9/2025] Magistral 1.2 achieves frontier performance on reasoning and coding benchmarks.
[6/2025] Magistral is the first Mistral reasoning model. All credits to the amazing team! See here for details.
[6/2025] LlamaRL is the first large-scale RL stack internal to Llama research. Thank you to my close collaborators!
[5/2025] Our new work on scaling RL to unverifiable domains such as long-form data is out!
[4/2025] Llama 4 is out. It is the first major Llama release trained with a large-scale RL stack.
[12/2024] Llama 3.3 is out. Great to have spearheaded the RL stack which produces a highly performant yet cost-efficient open source model.
[7/2024] Great to be part of the project led by my excellent collaborators to benchmark scalable-oversight protocols.
[5/2024] We investigated the importance of on-policy sampling in language model alignment, check it out here!
[5/2024] Four RL papers are accepted at ICML 2024. Thank you for the heavy lifting to my coauthors.
[3/2024] Gemini 1.5 is out.
[12/2023] Gemini is launched. Great to be a core contributor to Gemini, the most powerful multi-modal large language model developed by Google DeepMind. Check out the tech report here.