News
- [2/2026] Opus 4.6 and its predecessor 4.5 achieve frontier performance across the board, especially agentic coding, kudos to the team.
- [9/2025] Magistral 1.2 achieves frontier performance on reasoning and coding benchmarks.
- [6/2025] Magistral is the first Mistral reasoning model. All credits to the amazing team! See here for details.
- [6/2025] LlamaRL is the first large-scale RL stack internal to Llama research. Thank you to my close collaborators!
- [5/2025] Our new work on scaling RL to unverifiable domains such as long-form data is out!
- [4/2025] Llama 4 is out. It is the first major Llama release trained with a large-scale RL stack.
- [12/2024] Llama 3.3 is out. Great to have spearheaded the RL stack which produces a highly performant yet cost-efficient open source model.
- [7/2024] Great to be part of the project led by my excellent collaborators to benchmark scalable-oversight protocols.
- [5/2024] We investigated the importance of on-policy sampling in language model alignment, check it out here!
- [5/2024] Four RL papers are accepted at ICML 2024. Thank you for the heavy lifting to my coauthors.
- [3/2024] Gemini 1.5 is out.
- [12/2023] Gemini is launched. Great to be a core contributor to Gemini, the most powerful multi-modal large language model developed by Google DeepMind. Check out the tech report here.