A look at the future/present
Andrej Karpathy:βThree days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model. It found ~20 changes that improved the validation loss. I tested these changes yesterday and all of them were additive and transferred to larger (depth=24) models. Stacking up all of these changes,
View on X β
If true, this would be the first of @EpochAIResearch's Frontier Math open problems to be resolved by AI.
"The result emerged from a single GPT-5.4 Pro run and was subsequently refined into Lean with GPT-5.4 XHigh which ran for a few hours."
spicylemonade:βWe believe we have fully resolved, in Lean and python, one of @EpochAIResearch Frontier Math open problems: a Ramsey-style problem on hypergraphs.
The result emerged from a single GPT-5.4 Pro run and was subsequently refined into Lean with GPT-5.4 XHigh which ran for a few
View on X β
So excited to work together! I have a feeling it's going to be a productive summer :)
Acer:βI guess now is as good a time as any to announce that I shall be joining the AI for Science team at @OpenAI this summer. This has been in the works since January, and I thank @SebastienBubeck and @kevinweil for their personal interest in making this happen.
View on X β