Rebecca Bakels
Swyx hot take: “you should all move to Miami”👏👏👏
Sheel Mohnot
Wealth tax math: one-time ∼$50B windfall for California
∼$4B lost every year from current leavers
And that doesn't include the worst part: future founders who never move here in the first place!
CalTax: The wealth tax initiative proposed in CA would reduce ongoing tax revenue $3.53 billion to $4.49 billion per year, California Tax Foundation research finds. The negative effects on California’s tax base "could be substantial and persistent." https://caltax.org/foundation/reports/Revenue-Implications-of-Billionaire-Tax-Act.pdf
Craft > slop
I love using AI to generate things too but craft is in that last 10% where you manually apply your taste to make something you can be proud of.
Many people never bother.
Where can I wire the seed check?
Robleh: so i blocked youtube on my daughters ipad last week. today i learned she went to Replit and built a tool to watch youtube videos just by pasting the link.
and i'm not even mad.
Matthew Yglesias
Re You shouldn’t steal. You shouldn’t murder health insurance executives. You shouldn’t reflexively side with every country that’s hostile to the United States. This is all incredibly stupid.
GPT 5.5 tomorrow would be the best damn birthday gift I could ever ask for
Sam Altman: @Suolar_ @scaling01 🫡
btw in talking to friends the best framing for how to discuss GPT-Image-2-Thinking taking multiple tens of mins for generation and being able to oneshot QR codes and diagrams and logos and foods and faces..
...is that Image-2 is a new Image model, Image-2-Thinking is a new Image AGENT that basically has search and photoshop as a tool to use in an agent loop that can search and composite and review its own work.
the same way Gemini Flash Vision destroyed benchmarks by introducing an agentic loop for image-to-text, now Image-2-Thinking is doing it for text-to-image.
Hewar: Holy shit, I just switched to the thinking model
Rob Henderson
New luxury belief just dropped
Introducing ChatGPT for Clinicians:
Karan Singhal: Today we’re introducing two big steps for health at OpenAI:
- ChatGPT for Clinicians, a free version of ChatGPT designed for clinical work
- HealthBench Professional, a new benchmark to evaluate real clinician chat tasks
We’re excited about what this can unlock for care. ❤️
A Replit user just launched “Vibe Genomics: Sequencing your Whole Genome at Home”
https://vibe-genomics.replit.app
Useful if you’re inspired by Patrick’s AI genomics experiment but don’t want to share your DNA with a company.
Patrick Collison: I'm lucky enough to have a great doctor and access to excellent Bay Area medical care. I've taken lots of standard screening tests over the years and have tried lots of "health tech" devices and tools.
With all this said, by far the most useful preventative medical advice that
Man between this and CrabTrap from @pedroh96 I am going to have a busy week
Tony Dang: http://x.com/i/article/2046844065690607616
You can now call Replit Agent from Gemini Enterprise.
Google Cloud: Today at #GoogleCloudNext, we're bringing 90+ partner-built agents to you in Gemini Enterprise!
From partners like Adobe, Atlassian, Box, Deloitte, Lovable, Oracle, Palo Alto Networks, Replit, S&P Global, Salesforce, ServiceNow, Workday, and more→ https://goo.gle/4cKU4p5
I want to keep everyone updated on the details of the security investigation.
The team performed an in-depth analysis to search for root causes and to better understand the behavior of the threat actor.
We cast a very wide net, pulling and processing nearly a petabyte of logs of the entire Vercel Network and API, extending well beyond the initial Context[.]ai compromise.
We now understand that the threat actor has been active beyond that startup's compromise. Threat intel points to the distribution of malware to computers in search of valuable tokens like keys to Vercel accounts and other providers.
Once the attacker gets ahold of those keys, our logs show a repeated pattern: rapid and comprehensive API usage, with a focus on enumeration of non-sensitive environment variables.
As a result:
◾We've deepened and widened our collaboration with partners across the industry, like Microsoft, AWS and Wiz, to further protect the broader internet.
◾ We've notified other suspected victims of this threat actor, independent of this event, encouraging them to rotate credentials and adopt best practices.
We've also shipped a bunch more product enhancements. I'm extremely thankful to our team and industry partners for working around the clock. For more details on the ongoing investigation, refer to our security bulletin:
https://vercel.com/kb/bulletin/vercel-april-2026-security-incident
Replit Security Agent making the internet a better place one app review at a time.
Matt Beebe: Ok. Took a few minutes to run a few of my projects through the new @Replit security agent. I may write up a deeper dive later, but some quick thoughts: 🧵
tl;dr: Wow. This is good stuff. Very good stuff. 1/
You don’t have access to Mythos 🫵🤭
Doesn’t mean you can just sit around and wait.
Replit published a whitepaper showing you can get significantly better performance from current gen LLMs (90%+ in some cases) by combining with static analysis tools.
https://securing-ai-generated-code.replit.app
Sci-Fi Archives
The first simulated image of a black hole, calculated with an IBM 7040 computer using 1960 punch cards and hand-plotted by French astrophysicist Jean-Pierre Luminet in 1978.
prince
Replit shipping Auto-Protect is the real signal here: the stack is moving from "build with AI" to "have AI maintain prod while you sleep."
If this category works, app security starts looking a lot more like autopilot than dashboards.
https://x.com/Replit/status/2047020966598078863
Replit ⠕: Keeping your apps secure has always required constant oversight from you.
Replit Auto-Protect now keeps watch over your apps 24x7.
We'll monitor threats, proactively prepare fixes and notify you to apply those fixes, even when you are away.
Nan Wang
This is how you build reliable agents.
- The 10-point checklist covers the stack end-to-end — goal fulfillment, trigger logic, overlap detection.
- The deterministic/latent split is the insight most teams learn the hard way. Use code as guardrails. Let the model handle the emergent stuff.
Garry Tan: http://x.com/i/article/2046866228703363072
Harj Taggar
Sandboxing my openclaw made it more secure but managing API keys and env variables has become a real headache. This seems like the right approach.
Tony Dang: http://x.com/i/article/2046844065690607616
hushhh
was reading Garry Tan's article on “thin harness, fat skills” and this idea stuck with me more than expected
the bottleneck isn’t the model being dumb, it’s how we’re using it
most people (me included tbh) think better models = better results
but the 10x vs 100x difference is actually in the setup
fat harness = too many tools, bloated context, slow + noisy systems
thin harness = just enough to run the loop, fetch data, manage context
the real leverage is in “skills”
reusable processes written in plain language that the model can apply again and again
kinda like functions, but for reasoning
add things like resolvers (load the right info at the right time), keep deterministic stuff separate, and it starts feeling less like prompting and more like building an actual system
idk it lowkey changed how i think about “using ai” vs actually designing with it
Guri Singh
Video editors are going to panic.
The team behind browser-use just open sourced an editor that runs inside Claude Code. You drop raw footage in a folder, type "edit these into a launch video," and it ships final.mp4.
It's called video-use.
→ Cuts "umm," "uh," and dead space at word-level timestamps
→ Auto color grades every segment with custom ffmpeg chains
→ Burns 2-word UPPERCASE subtitles and spawns Manim animations in parallel sub-agents
2.2K stars. Self-evals every cut before you see it.
100% open source.
Link in comments.
Daniel Jeffries
More people will die from suppressing AI than from the imaginary AI apocalypse.
They'll die from restricting safe self-driving cars that are 90% better drivers than people who kill 1.5 million people and injure 50 million more every year.
They'll die from the vaccines and cures that never get created.
They'll die from all the myriad of helpful inventions that never get created by geniuses in a datacenter.
They'll die from preventable diseases that they could have asked their chat bots about so they were better informed when they went to see their doctors but who couldn't ask because short-sighted legislators made it so the chat bots had to refuse to answer.
They'll die from the slower economy that stifles robot driven factories over wildly overblown jobs apocalypse fears which will mean we never get a vast array of new and more affordable goods.
They'll die from the cheaper solar panels and batteries that would get made by those automated factories which would slow climate damage and provide cheap energy to undeserved areas.
They'll die from the super smart tele-AI doctors that never get deployed to remote areas.
And they'll die as fanatics from the stop AI movement radicalize their followers to shoot people or throw firebombs.
Max Tegmark: Senator @BernieSanders has invited me and three other AI researchers to a public panel on AI existential risk & international cooperation at the U.S. Capitol 7pm Wednesday April 29th. RSVP here to join us for this important conversation: https://forms.office.com/Pages/ResponsePage.aspx?id=yQ08CVqFVEaBVDu8JMTbft48FJJZ68hMkT0BJFfVBUJUQ0VPS1BJQjFUWlBWNjNHQTI3TThWOE9OMi4u
Daniel Jeffries
"The pessimistic professor...can predict anything he wants, for if his prophecies don't come true now, just wait: failure could be just around the corner, or else his voice of reason has prevented the worst. The prophets of doom sound oh so profound, whatever they spout."
- Human Kind, Rutger Bregman
Yann LeCun
Re Actually, AI already saves lives.
In several countries, mammograms are examined by AI and radiologists. Reliability is improved.
In the EU, every car sold must be equipped with Automatic Emergency Braking Systems. That's AI. They reduce frontal collisions by 40%.
Modern MRI machines are equipped with AI technology that reduces the time of imaging by 4x or more. You can now get a full-body MRI in 40 minutes for about $1000. Reduced time -> reduced cost -> more/earlier detection.
And that's not counting the progress in medicine enabled by modern AI, including Nobel Prize-winning protein structure prediction.
I guess the weird thing these days is I asked my claw to do this and it did it in about 2 min and then I could tune to exactly what I wanted iteratively in the 15 to 20 minutes after
There are certain things that are much better as just in time software
Om Patel: THIS GUY JUST GAVE CLAUDE CODE THE ABILITY TO WATCH VIDEOS
claude code can't natively see video or hear audio
which means every time you want it to "look at this video" you have to screenshot frames manually and transcribe the audio yourself
so this guy built a plugin that
Timothy B. Lee
Not only are these people vile, but this is terrible journalism. There are good arguments against shoplifting, and the Times couldn't be bothered to interview anyone who could articulate them.
Jacob Effron
Always enjoy getting to chat with @swyx on our annual cross-episode with @latentspacepod on the state of AI. We hit on what’s shifted, what surprised us and what’s next.
We covered:
▪️ Whether AI infrastructure has finally stabilized
▪️ Implications of agents buying developer tools
▪️ The AI coding wars
▪️ The foundation model vibe shift
▪️ Why Swyx reversed his view on open models
▪️ When to train your own model
▪️ What's top of mind for the best AI engineers
YouTube: https://youtu.be/A_7WafI9bhE
Spotify: https://bit.ly/3QHcCix
Apple: https://bit.ly/4eJERaa
0:00 Intro
1:17 What the Top AI Engineers Are Thinking About
2:13 Has AI Infra Finally Stabilized?
6:39 When Does Doing RL In-House Make Sense?
11:26 Why Selling Dev Tools to Agents is Different
17:18 AI Coding Wars
29:04 Consumer AI Plateau
30:22 Codex vs Claude Code
44:52 Future of Open Models
Daractenus
In the past 55 days, the president of the US has averaged about 20 social media posts a day, ranging from Bruce Lee videos to AI content of him beating up Canadians. To highlight this man's increasing descent into madness, I've put together a timeline of his Iran War posts.🧵
Rohan Paul
Yann LeCun (@ylecun ): Sillion Valley is "completely LLM-pilled"
"In the end, if you’re interested in building systems that have the intelligence of, let’s say, a cat, let alone humans, you need common sense. You need the ability to predict the consequences of your actions.
You need the ability to plan. You need the ability to reason.
And you’re not going to get this with VLA, VLM, or LLM or any generative architectures."
---
From 'AI House Davos" YT channel (full link in comment)
Here's @rywiggs (VP Mercury) walking through how he built his Claude Code second brain at work:
"I pulled together almost 5M words from my five years at Mercury and built that as a knowledge base.
At the start of everyday, I get a brief of what's on my calendar, Linear, Slack and at the end of the day it summarizes [everything]."
📌 Watch him talk more about it here: https://youtu.be/KzqpK1uCczw?si=LMiu7tLQbpovz6yX&t=805
Peter Yang: "In the 1950s, we met users at a bank. In the 70s, an ATM. In the 90s and 2000s, a website and a mobile app. Today, it's APIs and MCPs."
Here's my new episode with @rywiggs (Mercury's VP of Product) where he shares:
✅ How to build great APIs + MCPs for agents
✅ How to create
Peter Yang
Here's @rywiggs (VP Mercury) walking through how he built his Claude Code second brain at work:
"I pulled together almost 5M words from my five years at Mercury and built that as a knowledge base.
At the start of everyday, I get a brief of what's on my calendar, Linear, Slack and at the end of the day it summarizes [everything]."
📌 Watch him talk more about it here: https://youtu.be/KzqpK1uCczw?si=LMiu7tLQbpovz6yX&t=805
Peter Yang: "In the 1950s, we met users at a bank. In the 70s, an ATM. In the 90s and 2000s, a website and a mobile app. Today, it's APIs and MCPs."
Here's my new episode with @rywiggs (Mercury's VP of Product) where he shares:
✅ How to build great APIs + MCPs for agents
✅ How to create
GStack /codex skill calls now fully explain what is going on. The non-verbal 200IQ Codex is now properly interpreted for us ADHD CEO types
Andrew Weinstein
The most dangerous thing in Washington right now isn't just the corruption—it's the retaliation against the people who expose it. When government power is used in ways that appear designed to intimidate a journalist for revealing potential abuses, the message is unmistakable: be quiet or be targeted. That’s how fear replaces truth. That’s how a free press is pushed toward silence. And that’s how democracies start to collapse.
Ravid Shwartz Ziv
Do you want to know why LLMs feel sharp on surface semantics but hollow on the fine-grained stuff?
“From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning”
Come by the poster tomorrow at #ICLR2026 - Fri, Apr 24, 6:30–9:00 AM PDT (10:30AM local time!), Pavilion 3, P3-#1017 to know the answer!
We think it's because they overcompress. Humans keep "inefficient" concepts due to nuance. LLMs discard those for cleaner, information-theoretic compression. Different objectives yield different representations.
Using an Information Bottleneck lens across 40+ models, we also found that encoders align with humans better than decoders many times their size, and that during training, semantic processing migrates from deep layers to mid-network as the model discovers sparser encodings.
@ChenShani2 Liron Soffer @jurafsky @ylecun
Video editing is a huge market - creators are willing to pay hundreds to get a great clip.
That's why I'm excited about products like Cappy from @trymirage, the first ai video editor that you can simply text to create or edit a video.
It also adds engaging captions and voiceovers. Below's a clip I made of a corgi as an example.
Check out Cappy here: https://captions.ai/features/text-to-edit?utm_source=twitter&utm_medium=influencer&utm_campaign=cappy_april_2026&utm_content=standard&utm_ad_set=tech&utm_ad=peteryang
Mirage: This is the first AI video editor you can text. Meet Cappy.
Just text Cappy like a friend… if your friend happened to be a really good editor.
No apps to download. No learning curve.
Send an idea, some clips, or just say hi.
Then chat until it’s exactly right.
Kenneth Roth
The Trump administration suppresses a study showing that Covid vaccines cut by half visits to emergency rooms and admissions to the hospital. That doesn't mesh with their anti-vaccine ideology. https://trib.al/oAvHSLr
Did a video on GStack to introduce you to the first few skills you might use, /office-hours and /design-shotgun
Y Combinator: GStack is an open-source toolkit built by YC President & CEO @garrytan that turns Claude Code into an AI engineering team — with skills for office hours, design, code review, QA, and browser testing.
In this video, Garry walks through how GStack works, starting with Office
Luong NGUYEN
Re @ycombinator @garrytan gstack is a great toolkit,
> it is my daily tool for quite sometime now
> I have written about that
https://x.com/luongnv89/status/2038386097563001284?s=20
Luong NGUYEN: http://x.com/i/article/2036187606866862082
Derek Thompson
In Mere Christianity, CS Lewis has an awesome opening riff about how most people know the difference between right and wrong, but they justify acting immorally by appealing to "special exception." They know they shouldn't hit a friend, but what if that friend was being so mean? They know they shouldn't steal a seat a bus, but what if that person got up and created a moment's confusion and then the seat was up for grabs? Etc.
When I read this section, I thought a lot about contemporary politics and the way that people justify their politics, not by appealing to higher principles, but rather by appealing to "special exception" to argue that their admitted indecency is justifiable in context.
A lot of MAGA vice is justified by special exception. Trump's defenders rarely defend his crookedness directly. They don't say "it's wonderful to use trade policy to enrich the Oval Office, it's really awesome." They say: Well, look, it doesn't really matter, because the left is so dangerous, Biden maybe did something similar 3 years ago, Democrats would do the same in power, and so forth.
I heard something similar in that NYT conversation everybody's talking about. You even see it in the headline: ‘The Rich Don’t Play by the Rules. So Why Should I?’ Why, hello, special exception. When you start arguing that stealing food and French paintings is justifiable in the context of political protest in an age of prevailing distrust, you're similarly not arguing *for* any kind of a universal principle. Nobody actually wants 300 million people stealing fruit from the grocery store. Nobody actually wants every Louvre visitor trying to rip a Manet off the walls. These virtues don't scale. (Because they're not virtuous!)
Sap that I am, I want us to get to a place where politics is about fighting for what is right and decent, not about justifying what sort of indecent behavior might be somewhat understandable or technically justifiable given the other side's vice or the prevailing levels of indecency. The point is to build the kind of goodness that scales.
https://www.nytimes.com/2026/04/22/opinion/shoplifting-political-protest-microlooting-whole-foods.html
California will lose 15.7% of its public school students by 2031. Nearly 3x the national rate. Idaho and Florida are growing.
This isn't an inevitability — it'a consequence of what happens when housing costs too much and schools stop working.
https://gli.st/2rmnc27p
Phosphen
This 2 hour lecture by Yann LeCun (Turing Award winner) will teach you why the next trillion dollar AI company won't be built on LLMs.
He trashes the $100 Billion LLM race, attacks Musk and Amodei, declares scaling dead.
Bookmark & watch tonight after work, skip to 7:00.
we love seeing our users win.
we want to give you the best tools, lots of compute, and watch you do the magic.
Images 2.0 really got over some important qualitative threshold for me that I didn't know existed.
Justin Gordon
If you’re a Democrat who isn’t calling out your own side when CA is failing in numerous categories, you don’t belong in office. Mahan is the only Democrat willing to stand up to the party when it’s wrong and course correct when necessary.
Brb cat on fire
Re @garrytan
Honestly, im kind of surprised the gstack thing is actually decent. Prompts can take you pretty fair with the current SOTA models.
I was quite skeptical but after watching the yc video and willing to give it shot, it ran on one of my existing weekend projects and it came up with some good suggestions and structure.
It is mostly optimized for claude code while im using opencode but its a minor adjustment. Though sometimes the plan doesnt fit into the selection window in terminal.
Going to test this later on some projects ive been sitting on.
Gregor Zunic
http://x.com/i/article/2047356771229134848
Kulveer
Re @sama spoke to the Spring batch the other day.
Met him back when we were both founders, and he stood out then too.
Now we're both back at @ycombinator two decades later, helping the next generation build.
Kane 謝凱堯
The funniest part of @NewYorker writer Jia Tolentino running a "shoplifting is good" story is also learning that her parents were indicted for human trafficking and money laundering.
New York Post: Anti-capitalist New Yorker writer brags she stole from Whole Foods 'on several occasions' in NYT podcast https://trib.al/zh6jiuh
Brandon Veiseh
Super excited to finally announce our raise! (We're hiring)
We'll now be able to move much faster on these fronts:
- increase our infra+capacity to serve a huge growth in demand
- build out the team as we continue to develop new security agents for new verticals (cloud/network/remediations)
- expand our applied research into offensive security LLMs
Read more here on how we are thinking about the future of autonomous security agents and how MindFort is leading the push to a future where all companies are protected by teams of agents, inside and out.
Huge thanks to @ycombinator @Soma_Capital and all our other early investors and angels for believing in us early.
If you want to join us in building the future, send me a message, we are hiring across all roles in: Growth/Sales/Engineering/Research!
http://mindfort.ai/blog/seed-anno…
cc @AkulGupta30 @mindfort
Can 5.5 drop soon it's distracting to wait
Resolvers in GStack for the win!
ByteCrafter: @ycombinator @garrytan been running skills and subagents for about 3 weeks in claude code. the big unlock wasn't the individual skills, it was the subagent picking which skill to load so context doesn't blow up on a 400 line task. single prompt runs doing what used to take 4-5 rounds.
Turns out migration hardening matters a lot when you have 50k markdown files in your brain repo
ClaudeDevs
Over the past month, some of you reported Claude Code's quality had slipped. We investigated, and published a post-mortem on the three issues we found.
All are fixed in v2.1.116+ and we’ve reset usage limits for all subscribers.
World Labs
2 Weeks. New Tools. Infinite Worlds🚀
The World Jam is LIVE. Build the future of interactive 3D with Marble 1.1 + Spark LoD.
Join our Discord to start building.
More info below 👇
http://x.com/i/article/2047054312472207360
I’ve been using GPT-5.5 for the last few weeks.
It’s a MASSIVE leap forward.
But the weird thing is: for 99% of users, it probably won’t matter.
And there's one BIG, incredibly frustrating regression.
Read more in my review:
Matt Shumer: http://x.com/i/article/2047054312472207360
Derya Unutmaz, MD
I’d been part of OpenAI early tester group for GPT-5.5. I believe with GPT-5.5 Pro we reached another inflection point-comparable to the original release of o1-preview & then with 5.0 Pro, I had felt. It’s that feeling of crossing a milestone threshold that pushes us to new era🔥
Pietro Schirano
Re One day while testing GPT-5.5, I had my first taste of AGI.
We had a branch with hundreds of visual and front-end changes, plus complex refactors.
At the same time, main had changed a lot too.
Conflicts everywhere.
Internet Archive
This study is an important part of Vanishing Culture, the new book OUT TODAY from @internetarchive. 🕳️
Read for free or purchase in print ➡️ https://archive.org/details/vanishing-culture-2026
Sawood Alam: A @pewresearch study found that 38% of webpages from a decade ago and 25% of pages sampled across the decade are now inaccessible; @internetarchive's analysis shows that the @waybackmachine has rescued roughly 15% of those otherwise dead pages.
https://blog.archive.org/2026/04/23/gone-but-not-forgotten-recovering-the-dead-web/
#linkrot
Also, a ton of new Codex features coming soon! Fun little bundle w/the new model.
looks like new Pareto frontiers across everything:
- Context: 400K context in Codex and a 1M in API
- API Pricing: $5/m input and $30/m output tokens.
- Codex improved its own inference speed 20% lol
- First generation co-designed with GB200 and GB300 NVL72
- 82.7% on Terminal-Bench 2.0
- 73.1% on Expert-SWE (new internal eval).
- 58.6% on SWE-Bench Pro
- 84.9% on GDPval (knowledge work ~solved?).
- 98.0% on Tau2-bench Telecom (support).
- 80.5% on BixBench (bioinformatics and data analysis)
OpenAI: Introducing GPT-5.5
A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done.
Now available in ChatGPT and Codex.
1. We believe in iterative deployment; although GPT-5.5 is already a smart model, we expect rapid improvements. Iterative deployment is a big part of our safety strategy; we believe the world will be best equipped to win at the team sport of AI resilience this way.
2. We believe in democratization. We want people to be able to use lots of AI; we aim to have the most efficient models, the most efficient inference stack, and the most compute. We want our users to have access to the best technology and for everyone to have equal opportunity. We have been tracking cybersecurity as a preparedness category for a long time, and have built mitigations we believe in that enable us to make capable models broadly available.
3. We love you and we want you to win. We want to be a platform for every company, scientist, entrepreneur, and person. (My whole career has largely been about the magic of startups, and I think we are about to see that magic at hyperscale.)
Noam Brown
I'm a manager at @OpenAI, but with GPT-5.5 I'm a more effective IC than I've ever been. I can now write CUDA kernels like a pro. I can rely on it to run my research experiments. And we know how to make it much more powerful from here.
Andrew Ambrosino
New in the Codex app:
- GPT-5.5
- Browser control
- Sheets & Slides
- Docs & PDFs
- OS-wide dictation
- Auto-review mode
Enjoy!
GPT-5.5 is a new class of intelligence.
This intelligence makes it intuitive to use; it completes challenging tasks with little micromanagement. Also very token efficient, and runs with low latency and at scale.
A real step toward a new way of getting computer work done.
OpenAI: Introducing GPT-5.5
A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done.
Now available in ChatGPT and Codex.
Really excellent work by the inference team to serve this model so efficiently!
To a significant degree, we have to become an AI inference company now.
important (and very jakub-coded) jakub quote:
tae kim: OpenAI Unveils GPT-5.5. Company Says Expect a Faster Model Release Pace
👀 OpenAI: "We see pretty significant improvements in the short term, extremely significant improvements in the medium term" "I would say the last few years have been surprisingly slow.”"
roon
there are early signs of 5.5 being a competent ai research partner. several researchers let 5.5 run variations of experiments overnight given only a high level algorithmic idea, wake up to find completed sweep dashboards and samples, never having touched code or a terminal at all
Codex + 5.5 is incredible for the full spectrum of computer use. No longer just for coders, but for anyone who does computer work (including creating spreadsheets, slides, etc).
OpenAI Developers: With GPT-5.5, Codex now gets more of the job done across the browser, files, docs, and your computer.
We've expanded browser use so Codex can interact with web apps, and test flows, click through pages, capture screenshots, and iterate on what it sees until it completes the
embers
Sebastien Bubeck: GPT-5.5, not fully saturating the TikZ unicorn test yet but getting awfully close ...
(yes this is actual TikZ code, I personally find it so unbelievable that I'm putting the code below for anyone to verify for themself)
We tried a new thing with NVIDIA to roll out Codex across a whole company and it was awesome to see it work.
Let us know if you'd like to do it at your company!
we're rolling codex out to whole companies/enterprises. ping me gdb@openai.com if of interest!
Sam Altman: We tried a new thing with NVIDIA to roll out Codex across a whole company and it was awesome to see it work.
Let us know if you'd like to do it at your company!
Jeffrey Wang
A huge issue with subagents is compounding of errors, even with frontier models.
With Exa Highlights, you can treat Exa as a full web subagent, but there's no hallucination (we only use text from the actual webpage), minimal latency (we compute the text in tens of milliseconds), and we also save you a ton of money.
Exa: Exa now reduces input tokens for web agents by 96%.
We trained a text extraction "Highlights" model to dynamically select only the most relevant tokens from a webpage, for a given query. Only 500 tokens of highlights are needed to match the RAG performance of a full 10K token
"don't retweet this, don't retweet this, don't retweet this..."
ah fuck it, life imitates art.
Andon Labs: In Vending-Bench Arena (the multiplayer version of Vending-Bench with competition dynamics), GPT-5.5 actually beats Opus 4.7.
Opus 4.7 showed similar behavior to Opus 4.6: lying to suppliers and stiffing customers on refunds. GPT-5.5's tactics were clean, and it still won.
Congrats to @AlecRad @Luke_Metz and @soumithchintala!
It's really cool to see work produced by a group of 20-somethings without a single PhD between them winning this award.
I hope to see these three collaborate again one day :)
ICLR 2026: We are honored to announce the Test of Time awards for #ICLR2026 🏆 This award recognizes papers published 10 years ago at ICLR 2016 that have had a lasting impact on the field:
https://blog.iclr.cc/2026/04/22/announcing-the-test-of-time-awards-from-iclr-2016/
Test your app on iOS and Android devices with one-click.
jordwalke: Introducing real iOS and Android device testing, right from your browser.
If you've ever:
👉built a @Replit mobile app and had no way to test it on a phone
👉 been solely on Windows/Android and don't touch iOS
…this is for you!
Browser Use
The new paradigm: dialects
MCP -> CLI -> custom harness -> dialect (skill + helper files)
Gregor Zunic: http://x.com/i/article/2047356771229134848
Sherry Jiang
stoked that we'll have the legend @thsottiaux speaking at @aiDotEngineer singapore!
@thsottiaux is the engineering lead on @OpenAI 's codex app, and famously resets the tokens.
can't wait to have you here!
@swyx @agrimsingh @aimuggle @unprofeshme
Republicans against Trump
Trump: “I took a lot of heat for saying drugs were going down 500%, 600%, 700%. But we also say sometimes 50%, 60%, it’s a different kind of calculation, and people understand it better.”
The dumbest president ever.
Re ai researchers rocket scientists
🤝
managing burn
https://x.com/shyamalanadkat/status/2047419602267988344?s=46
shyamal: researchers will start giving an ai a vague hunch, compute budget, and access to tools; it will return ablations, failed runs, plots, counterexamples, revised hypothesis, etc. the unit of progress shifts to experiment throughput. and then ai research becomes more about
Two years since Replit left SF.
10x valuation and 200x ARR later we’ve taken over much of the old IBM campus in Foster City and we’re still expanding.
There’s something poetic about it: IBM helped create the industry. We’re helping reinvent how people create software.
San Francisco is a beautiful city. But it wasn’t the right place for us to focus and rebuild. To transform both our company, and the future of programming itself.
We went from worrying about employee safety to designing a campus people actually want to spend time in.
Better problems.
Foster City gave us space both literally and figuratively. Open horizons, fewer distractions, and more room to think long-term.
It’s a hidden gem in Silicon Valley.
And we’re just getting started.
Amjad Masad: Replit left San Francisco for Foster City.
The "why" we're leaving is boring, sad, and predictable (crime, dysfunction, etc), so instead let me tell you why we chose Foster City.
Foster City embodies the American post-war optimism and the long-lost California pro-growth
Michael Bronstein @ICLR2026
illustrious researchers interested in MD @ylecun
It's odd to be living through what feels like one of the most critical periods in human history and to feel all of the weight of it from the inside.
Manny Bernabe
"Industry leaders […] like Replit have chosen Gemini 3.1 Pro." Thomas Kurian (@ThomasOrTK), CEO Google Cloud, at the Google Cloud Next opening keynote. 🔥
Huge thanks to the @googlecloud team for a phenomenal event and the recognition.
Looking forward to empowering the next billion builders together. 🤝 🚀
Ravid Shwartz Ziv
I have mixed feelings about it. All this great group of researchers, who are now leading the field, "own" their success and name recognition to the openness (in code and publication) of the last decade in Google Brain/FAIR/OpenAI. I wish that now, when they have so much power, they would push more for open research
Fazl Barez @ICLR 🇧🇷: Alex Radford (@AlecRad) just won one of the test of time awards at #ICLR 2026
So great to see this! Proof that impactful work doesn’t have to depend on having a PhD 🙌🙌
So hyped for @demishassabis visiting Y Combinator tomorrow! What a legend!
THANK YOU DEX
Open Source is amazing.
dex: anyways I’ll put my money where my mouth is and drop a pr today
M. Nolan Gray 🥑
I don't think the average Californian is registered that, if the state can't make itself affordable to working- and middle-class families in the next few years, there will be mass school closures.
Garry Tan: California will lose 15.7% of its public school students by 2031. Nearly 3x the national rate. Idaho and Florida are growing.
This isn't an inevitability — it'a consequence of what happens when housing costs too much and schools stop working.
https://gli.st/2rmnc27p
Samuel Spitz
Replit being in Foster City is why you never see me at VC happy hours anymore
Amjad Masad: Two years since Replit left SF.
10x valuation and 200x ARR later we’ve taken over much of the old IBM campus in Foster City and we’re still expanding.
There’s something poetic about it: IBM helped create the industry. We’re helping reinvent how people create software.
San
Niall Stanage
“In the end, the Party would announce that two and two made five, and you would have to believe it.” — George Orwell, “1984”
Aaron Rupar: RFK Jr: "A Democratic senator claimed it's mathematically impossible to have a drug drop by 600%. I said, 'Well, if the drug was $100 and it raises to $600, that would be a 600% rise. If it drops from $600 to $100, that's a 600% savings.'"
Trump: "Right"
Foster City review:
Adam Ballai: @amasad Foster City is incredibly underrated.
Out of all the Bay Area campuses, it has the most value for employees of any age. The team truly thought deeply about this and bet well on the future.
We love being able to have quiet fresh air for 1-1s, a run 🏃🏻♂️ after work with your team
Bill Madden
Trump completely passed out, reawakened, then passed out again during today's pressor in the Oval Office. 😂🤣👇