← 2026-05-10

Daily Edition

2026-05-11

2026-05-12 →

AI Builders 日报 — 5月11日

追踪 AI 领域真正在做事的人,而不是空谈者。

今日思考

过去几年,AI 开发者把大量精力放在模型本身。但随着 GPT-5.5、Claude Opus 4.7、Replit Parallel Agents 这一代产品成熟,一个新共识正在形成:模型已不再是瓶颈,工程能力才是。Gopi Krishna 说得很直接:"I've built skills and commands around it — the model may fail, but the system will ensure the outputs don't." Meta-meta-prompting(用 AI 来生成和优化 AI 提示词)正在成为 2026 年最重要的技能。与此同时,Sam Altman 宣布 OpenAI 成立Deployment Company,$40亿初始资金、150名前线工程师——这标志着基础模型之战已基本结束,落地和集成正在成为真正的商业战场。


产品与发布

Replit Parallel Agents

Replit 推出并行 Agent 功能,最多可同时运行 10 个 Agent,每个 Agent 有独立计算环境,完成后自动合并。amasad 强调,真正的突破不是"多"而是"编排":Orchestrate correctly, merge back seamlessly,才能让项目进度提升 10 倍。faviconx.com

OpenAI Daybreak

OpenAI 公布 Daybreak 计划,将最强大的模型、Codex 和安全合作伙伴整合,为网络防御团队提供前沿 AI 能力。gdb 表示这是"防御性加速伞形努力"(defensive acceleration umbrella effort)。faviconx.com

OpenAI Deployment Company

OpenAI 成立部署公司,首期投资 $40亿,19 家合作伙伴,150名前线部署工程师,并收购了 Tomoro 团队。swyx 调侃:"连服务端渲染都顾不上做好了"(意指他们太忙),但也承认这是英国 AI 的重大胜利。faviconx.com

Consensus 融资 $3000万

AI 研究平台 Consensus 宣布获 $3000万新融资,致力于打造"研究领域的 AI 操作系统",每月有 250万研究者在平台开始工作。faviconx.com


观点与判断

amasad(Replit 联合创始人)

  • "如果你只是用过 Agent,还没有体验过大规模并行 Agent,你就还没感受到 AI 的进步" 单纯运行多个 Agent 很爽,但真正的突破在于正确的编排和顺畅合并。faviconx.com

Garry Tan(Y Combinator CEO)

  • 设计的最高形式是把人类的痛苦转化 "The highest and most important form of design is actually pure transmutation of human pain and suffering." 这是他 5 月 11 日的原创帖,没有引用,纯个人观点。faviconx.com

  • 不要坐别人的无马马车,造你自己的跑车 Garry Tan 转发了关于"个人 AI"的长帖,并加了自己的判断:Personal AI 时代已来,每个人都应该打造属于自己的 AI 系统,而不是在别人的 ChatGPT 里填空 prompt。faviconx.com

  • 模型已死,AI 工程的黄金时代已来 Garry Tan 转发 Gopi Krishna 的分析:自去年 11 月 Opus 发布后,模型能力已不是瓶颈,回到工程问题。他本人作为 solo founder,用 Claude Code + meta-meta-prompting 构建了完整的 GTM 系统——自动生成 Instagram 帖子、Google Ads 自动化、电邮。核心结论:"learn to create the harness around the model. Model may fail, but the system will ensure the outputs don't." faviconx.com

Sam Altman(OpenAI CEO)

  • "你会把它叫超级 App 吗?" Sam Altman 转发了一个人在 Codex 里完成所有工作的帖子后,发了一句意味深长的反问。联想到他同时宣布的 Deployment Company,这或许暗示 OpenAI 正在从模型公司向平台公司转型。faviconx.com

  • 新的 ChatGPT 模型+人格+个性化,感觉像是新东西 Sam Altman 认为 ChatGPT 最新的模型、性格和个性化组合已跨越某个阈值,是一个质变。faviconx.com

karpathy(AI 教育家、OpenAI 创始成员)

  • 让 AI 输出 HTML 而非 Markdown karpathy 建议:当你想让 AI 输出时,在问题结尾加一句"structure your response as HTML",然后直接在浏览器打开。他进一步展开了一个观点框架——人类偏好的 AI 输入是音频,但 AI 偏好的输出是视觉(图像/动画/视频)。人类大脑约三分之一是视觉处理系统,是信息的十车道超高速公路。他的预测演进路径:1) 原始文本 → 2) Markdown → 3) HTML(当前萌芽的新默认)→ ... → n) 可交互的神经生成视频/模拟。faviconx.com

petergyang(Cursor 联合创始人)

  • AI 应该帮家长读懂学校通知 "你知道什么是一个好的 AI 自动化吗:当我收到孩子学校 10 页的周报时,让 AI 告诉我有没有提前放学或其他需要注意的事。"这个需求看似简单,却精准击中家长日常痛点。faviconx.com

swyx(AI Engineer 创始人)

  • Thinky Machines 重振了 Omnimodel 梦想 swyx 对 Thinky Machines 的实时多模态模型给出了强烈评价:"basically everyone's definition of 'realtime' just got a massive fucking upgrade",认为他们"brutally framemogged gdm and oai"。这是对近期多模态模型军备竞赛的一个有趣注脚。faviconx.com

技术动态

gdb(Geoffrey Hinton)

  • /goal 指令被低估 Hinton 记录了一次实验:一个 gnarly 代码重构任务,所有前沿模型(包括 o1)都没能解决,但 GPT-5.5 开启 extra high thinking 模式并配合 /goal 指令,成功完成。"I am floored." /goal 作为元认知指令的价值值得进一步探索。faviconx.com

X / Twitter

35
gdb
gdb @gdb
agents make for a surprisingly great product
petergyang
petergyang @petergyang
You know what would be a good AI automation:

When I receive those 10 page weekly newsletters from my kid's school I want AI to tell me if there's early dismissal or anything I should pay attention to.
garrytan
garrytan @garrytan
Retweeted
Shaun Maguire Shaun Maguire
I know this sounds insane
I read "Extraordinary Popular Delusions: And the Madness of Crowds" as a teenager
I thought AI would likely have a bubble phase 18 months ago
But now I think this is 1000x bigger than the Industrial Evolution
This is an evolution of our species
Geiger Capital: Have you considered the possibility that it’s not a bubble and the world is indeed changing at a pace humanity has never seen before, anon.
garrytan
garrytan @garrytan
The highest and most important form of design is actually pure transmutation of human pain and suffering.
swyx
swyx @swyx
Retweeted
Stephen Stephen
My talk at @aiDotEngineer is now online.
I talked about our research and where @bfl_ml is heading.
Thanks @swyx for the invite
https://youtu.be/x8Yb4RidLgM?is=ad8jtmUrL5boUbTU
gdb
gdb @gdb
/goal is underrated

james yu: We have a gnarly refactor in our codebase that I test every frontier model on. I've been doing this since the release of o1. Nothing has been able to solve it.

GPT-5.5 extra high thinking with /goal just did it.

I am floored.
ylecun
ylecun @ylecun
Retweeted
Rodney Brooks Rodney Brooks
What will we be like when he is gone? Can we return to mutual respect? Can we believe we are all on the same team as Obama and McCain did? Can we imagine the mutual respect of those two, competitors but compatriots? See this at 12:11 for what our country should be as stated by McCain and reported by Obama. https://www.youtube.com/watch?v=eLIlOQuzTnU
swyx
swyx @swyx
Retweeted
Alex Lupsasca Alex Lupsasca
I recently joined @latentspacepod to talk about AI for physics.
We dug into recent work on scattering amplitudes with GPT, and what it suggests about how AI will accelerate theoretical discovery in a rapidly evolving field.
Latent.Space: 🔬Doing Vibe Physics
The full story of how GPT‑5.x derived new results in theoretical physics and quantum gravity, live on our Science pod today!
https://latent.space/p/lupsasca
our conversation with @ALupsasca, an award winning theoretical physicist on his AGI-pilling journey
garrytan
garrytan @garrytan
Don’t ride someone else’s horseless carriage

Time to build your own Ferrari

Personal AI is here

阿绎 AYi: 我一直觉得,现在 99% 的人使用 AI 的方式都是错的。

他们还在和 ChatGPT 聊天,
还在纠结哪个模型更好,
还在网上抄各种“神奇 prompt”。

就像汽车刚发明的时候,所有人还在研究怎么把马养得更壮、跑得更快。
而 Garry Tan 已经造好了属于自己的跑车。
我们正站在一个历史性的转折点:
从 Prompt

garrytan
garrytan @garrytan
My viral meta-meta-prompting post about personal AI is directly a consequence of the thinking about and building with metaprompting in January and February of this year 👇

Garry Tan: The one skill everyone who uses AI needs to master in 2026

Meta-prompting: using an LLM to generate, refine, and improve the very prompts you use to get work done.

https://garryslist.org/posts/metaprompting-is-a-skill-everyone-who-uses-ai-needs-to-master-in-2026
amasad
amasad @amasad
Retweeted
Nadia I Nadia I
Part of why I love building with @Replit is because the fingerprints of a mother builder are all over it. And as a mother myself, I can tell.
@HayaOdeh, the co-founder and “mama” of @Replit, said this perfectly:
“You did not become less capable when you became a mother. You became more capable in ways that don't always get named.”
https://blog.replit.com/mothers-who-build
#WomenInTech #MothersWhoBuild #MomFounder #BuildInPublic #EdTech #Replit
swyx
swyx @swyx
Retweeted
Sherry Jiang Sherry Jiang
“we gonna yolo our way into running the biggest conf in town.” and somehow… we actually did it with @aiDotEngineer singapore
a year ago, @agrimsingh @unprofeshme and i joked about the idea of running our own conference (after finishing an escape room).
the idea started from frustration -
that there wasn't a quality conference built for builders, by builders in asia.
that there's so much untapped talent here yet ppl think the "scene is dead."
it’s not dead. there’s an absurd amount of untapped talent here. the right people just weren’t in the same room.
and instead of waiting around for someone else to step in, we decided to do it ourselves.
5 ppl, all part-time, running on agents and fumes.
and this week, somehow, that turned into minister vivian balakrishnan keynoting alongside many of the best ai companies and builders in the world.
thank you @swyx for trusting us to bring @aiDotEngineer to asia.
and thank you to our team, volunteers, speakers, sponsors, and everyone who believed in this before it was obvious.
let’s make a little history this week.
petergyang
petergyang @petergyang
Retweeted
Peter Yang Peter Yang
My top 5 takeaways from Moritz on building a personal OS with Claude Code:
1. Set up the right folders and files first
Create the following files and make claude(dot)md aware of them:
- soul md: Your agent’s personality and tone.
- user md: What the agent knows about you.
- tools md: A running list of CLI/MCP/APIs.
- memory folder: Daily notes from past chats and a long-term memory file.
📌 Read more below or watch the full tutorial: https://youtu.be/ACRd0Ikg_KI
Peter Yang: This is the most complete setup I've seen yet to turn Claude Code into your personal OS.
Here's my new episode with @moritzkremb where he shared the system that runs his email, content, and even grocery shopping.
We talked about:
→ The 4 layers: folders, tools, skills,
garrytan
garrytan @garrytan
Retweeted
Erica Sandberg 舊金山的神奇女俠 Erica Sandberg 舊金山的神奇女俠
I don't sugarcoat SF for international press: “I used to be enormously proud of my city, but now I sometimes turn away from people on the tram [cable car] because I’m so afraid they’ll be disappointed.”
The American freelance journalist has lived in San Francisco almost her entire life. She calls the city “my first love.”
“Now it’s like watching your child get cancer, and you want so desperately to help.”
The Mission is sick, so I gave these wonderful Norwegian reporters a tour of health. the amazing restaurants of Hayes Vally ("that's Absinthe, Chez Mama, A Mano!"), then Nob Hill (dreamy views, clean, charming), and made sure they drove down Lombard to visit North Beach.
https://www.vg.no/sport/i/Pdvjj0/san-francisco-vm-byen-som-doer
swyx
swyx @swyx
the inside story of the legendary Cog House.

i believe there have not been any public photos of this place until now (bc we were explicitly not allowed to lol)

as an advisor its been awe inspiring to see this company grow into a well oiled product and gtm machine that will be worth $100B by EOY (imo)




Colossus: Scott Wu is the co-founder of Cognition AI, one of the fastest-growing companies in history. He’s also the greatest competitive programmer the US has ever produced. You may have seen him doing impossible card tricks and mental math.

You’ve never seen him asked about weed,

swyx
swyx @swyx
openai's new $10b forward deployed company is so locked in they can't even bother to server side render properly

(jk congrats, nice win for UK AI 🇬🇧)




OpenAI: We’ve also agreed to acquire Tomoro, which will bring 150 experienced Forward Deployed Engineers and Deployment Specialists to the OpenAI Deployment Company from day one.
swyx
swyx @swyx
Retweeted
Consensus Consensus
Today, we're announcing $30M in new funding to build the AI OS for Research.
2.5M researchers start their work with Consensus every month. Their work is the foundation that all progress is built upon.
We could tell you our story. We'd rather they did👇
garrytan
garrytan @garrytan
Retweeted
Kayvon Jafarzadeh Kayvon Jafarzadeh
karpathy said “i don’t think i’ve typed a line of code since december” and everyone treated it like a meme.
garry tan treated it like a design prompt: what does it look like when one person runs like a whole software team?
gstack is the first oss repo in a while that actually feels like that answer.
not ai as autocomplete.
ai as CEO + staff eng + qa + security + design + release + browser operator + parallel execution layer, all wired through workflows.
and the number is wild: garry claims ~810x higher pace vs 2013, normalized for logical changes (not fake ai loc).
the shift isn’t “faster coding”.
it’s directing + reviewing + orchestrating a swarm without shipping garbage.
stuff that stood out:
→ /office-hours challenges your product before you build
→ /autoplan runs the CEO/design/eng pass
→ /qa literally drives a browser, finds bugs, fixes them
→ /review catches prod-tier issues before you ship
→ /pair-agent + parallel sprints across projects
we’re moving from “ai helps devs code”
to “devs operate systems of ai workers”.
Suryansh Tiwari: http://x.com/i/article/2053414177054842880
karpathy
karpathy @karpathy
This works really well btw, at the end of your query ask your LLM to "structure your response as HTML", then view the generated file in your browser. I've also had some success asking the LLM to present its output as slideshows, etc.

More generally, imo audio is the human-preferred input to AIs but vision (images/animations/video) is the preferred output from them. Around a ~third of our brains are a massively parallel processor dedicated to vision, it is the 10-lane superhighway of information into brain. As AI improves, I think we'll see a progression that takes advantage:

1) raw text (hard/effortful to read)
2) markdown (bold, italic, headings, tables, a bit easier on the eyes) <-- current default
3) HTML (still procedural with underlying code, but a lot more flexibility on the graphics, layout, even interactivity) <-- early but forming new good default
...4,5,6,...
n) interactive neural videos/simulations

Imo the extrapolation (though the technology doesn't exist just yet) ends in some kind of interactive videos generated directly by a diffusion neural net. Many open questions as to how exact/procedural "Software 1.0" artifacts (e.g. interactive simulations) may be woven together with neural artifacts (diffusion grids), but generally something in the direction of the recently viral https://x.com/zan2434/status/2046982383430496444

There are also improvements necessary and pending at the input. Audio nor text nor video alone are not enough, e.g. I feel a need to point/gesture to things on the screen, similar to all the things you would do with a person physically next to you and your computer screen.

TLDR The input/output mind meld between humans and AIs is ongoing and there is a lot of work to do and significant progress to be made, way before jumping all the way into neuralink-esque BCIs and all that. For what's worth exploring at the current stage, hot tip try ask for HTML.

Thariq: http://x.com/i/article/2052796100608974848
garrytan
garrytan @garrytan
Retweeted
Tero Tasanen Tero Tasanen
Just fired up DS4 by @antirez on my Mac Studio M3 Ultra 256GB and man, it’s seriously impressive. A clean, purpose-built engine for DeepSeek V4 Flash that actually makes frontier-level reasoning feel usable locally.
1M context, strong coherence, and solid speed on consumer hardware. This is the kind of focused, no-bullshit effort that finally brings real frontier models to regular machines instead of just giant GPU clusters. Huge respect @antirez — thank you for building this 🔥
https://github.com/antirez/ds4
amasad
amasad @amasad
Retweeted
My First Million My First Million
Replit's founder made $3M in 2 days shipping a half-ready product.
Amjad Masad's team wasn't ready to launch Replit Agent.
He said: &#34;I don't care if it's semi-broken. If it works 50% of the time, it'll wow the world.&#34;
For the first time ever, an AI agent could write code, debug it, create a database, and deploy to the cloud.
End-to-end.
He shot a video on his iPhone. Posted it.
Andrej Karpathy quote-tweeted: &#34;I feel-the-AGI moment.&#34;
OpenAI and Anthropic researchers reached out: &#34;We didn't know our models could do that.&#34;
Day 1: $1M ARR Day 2: $2M ARR
In 48 hours, he made more than 8 years of trying.
Amjad: &#34;Product-market fit feels like stepping on a landmine. We pivoted for years. That moment was the landmine.&#34;
If it doesn't feel like a landmine, you haven't found it yet.
Full episode: https://www.youtube.com/watch?v=ddSucXf0CuY&t=1661s
@amasad @thesamparr @ShaanVP
sama
sama @sama
Retweeted
Greg Brockman Greg Brockman
Introducing the OpenAI Deployment Company, which will help businesses maximally succeed with their deployments of AI.
Starting with 150 Forward Deployed Engineers and Deployment Specialists, and $4 billion of initial investment from 19 partners.
OpenAI: Today we’re launching the OpenAI Deployment Company to help businesses build and deploy AI.
It's majority-owned and controlled by OpenAI. It brings together 19 leading investment firms, consultancies, and system integrators to help organizations deploy frontier AI to production
swyx
swyx @swyx
Retweeted
Black Forest Labs Black Forest Labs
The next generation of models won't just generate images - they'll understand worlds, motion, interaction, and action. We've been building toward this for a while.
Visual intelligence is becoming real-time. @stephenbtl spoke about where we're headed at @aiDotEngineer:
amasad
amasad @amasad
Retweeted
luca. ∆ИƉЯƐΛ luca. ∆ИƉЯƐΛ
I will not confirm nor deny whether I love these Parallel Agents
Replit ⠕: Meet Replit Parallel Agents
Build faster by running up to 10 agents in parallel
Each agent gets its own copy of your app
They work on their own computer
Then merge their work agentically
amasad
amasad @amasad
You haven’t felt AI progress if you’ve merely use agents and haven’t experienced massively parallel agents.

Merely running multiple agents is fun, but the breakthrough is having them orchestrated correctly, merge back seamlessly, and making 10x faster progress on your projects.

Replit ⠕: Meet Replit Parallel Agents

Build faster by running up to 10 agents in parallel

Each agent gets its own copy of your app
They work on their own computer
Then merge their work agentically

amasad
amasad @amasad
Retweeted
Samuel Spitz Samuel Spitz
Fun fact: I made the storyboard for this video with Replit Canvas
Replit ⠕: Meet Replit Parallel Agents
Build faster by running up to 10 agents in parallel
Each agent gets its own copy of your app
They work on their own computer
Then merge their work agentically
garrytan
garrytan @garrytan
Retweeted
Gopi Krishna Gopi Krishna
this is such an important take on so many levels from @garrytan
I've been working on GPT models creating different products for over 3 years now.
first, models are now not the bottleneck. Not at least since last Nov when Opus was out.
it is back to engineering problems now. I've built something so cool as a solo-founder running all the GTM myself via Claude Code - quite deterministically.
My posts on instagram look stunning - because I have built skills and commands around it containing python pipelines and markdown files - ad reports come from my Google Ads MCC account, which auto-improves campaigns. Emails - same thing.
Images and Videos - easy peesy. The key is meta-meta-prompting.
in simple words, learn to create the harness around the model. Model may fail, but the system will ensure the outputs don't.
Garry Tan: http://x.com/i/article/2052898104039657472
swyx
swyx @swyx
Retweeted
Prince Canuma Prince Canuma
My @aiDotEngineer talk is live: &#34;On-device Intelligence using MLX&#34; 🎥
Huge thanks to @swyx and the team for having me — hands down the best tech event I've been to.
And a shoutout to the community shipping with our packages and pushing the ecosystem forward: @MaziyarPanahi, @adrgrondin, @nopmobiel, @lllucas, @KarnikShreyas, @beshkenadze and many more. You make this fun.
https://youtu.be/zTLJNHj0DeQ?si=cjIPK4ZDm7TUuN_o
ylecun
ylecun @ylecun
Retweeted
Internet Archive Internet Archive
News organizations are increasingly blocking the Wayback Machine even as their reporters still depend on it 📰
In PRESERVING THE WEB IN THE AGE OF AI, Mark Graham, Director of the Wayback Machine at the Internet Archive, explains how major newsroom staff rely on archived web history because their internal archives often miss the deeper public record.
That contradiction sits at the heart of the preservation crisis unfolding online.
🎧 Listen on the Future Knowledge #podcast ⤵️
https://futureknowledge.transistor.fm/episodes/preserving-the-web-in-the-age-of-ai
📚 Read VANISHING CULTURE free from the Internet Archive ⤵️
https://archive.org/details/vanishing-culture-2026
#WaybackMachine #AI #WebArchive #FutureKnowledge @MarkGraham
amasad
amasad @amasad
Retweeted
Replit ⠕ Replit ⠕
20 builders. Week 3 of 8. No more dreaming, only shipping.
See who's chasing their first dollar this Wednesday.
Episode 3 of Race to Revenue ⠕
gdb
gdb @gdb
Daybreak: our umbrella effort for defensive acceleration, equipping cyber defenders with the best possible frontier AI capabilities.

OpenAI: Introducing Daybreak: frontier AI for cyber defenders.

Daybreak brings together the most capable OpenAI models, Codex, and our security partners to accelerate cyber defense and continuously secure software.

A step toward a future where security teams can move at the speed

swyx
swyx @swyx
I believe the kids call this "@thinkymachines just brutally framemogged gdm and oai".

basically everyone's definition of "realtime" just got a massive frciking upgrade


swyx 🌉: lowkey the funniest videos of the batch. thinky has some comedians!!

congrats to @thinkymachines on reviving the omnimodel dream that others could not


sama
sama @sama
would you call it

a superapp?

Rex Sorgatz: After being a Claude Code devotee for a year, I finally tried Codex on a new project this weekend. Once again, in the matter of a few months, it feels like the world changed.

I can see myself doing *everything* inside of Codex this week.
sama
sama @sama
speaking of things that have gotten over a threshold for me, the combo of the new ChatGPT model, personality, and personalization feels like a new thing
garrytan
garrytan @garrytan
Retweeted
FBI SanFrancisco FBI SanFrancisco
#FBI CASE UPDATE: Three men have been indicted on robbery, kidnapping, and conspiracy charges related to a $6 million cryptocurrency robbery spree throughout the Bay Area and LA. Elijah Armstrong, Nino Chindavanh, and Jayden Rucker - all from Tennessee- were charged on Conspiracy to Commit Hobbs Act Robbery, Conspiracy to Commit Kidnapping, Attempted Hobbs Act Robbery, and Attempted Kidnapping relating to a violent robbery spree targeting cryptocurrency owners. Armstrong and Rucker were arrested in Los Angeles on December 31, 2025, and Chindavanh was arrested on December 22, 2025, in Sunnyvale. Chindavanh made his initial appearance in federal court in San Francisco on April 14, 2026, and Armstrong and Rucker made their initial appearances in federal court in San Francisco earlier today. According to the indictment filed March 31, 2025, Armstrong, Chindavanh, and Rucker are alleged to have conspired to kidnap and rob individuals in San Francisco, San Jose, Sunnyvale, and Los Angeles in efforts to steal cryptocurrency from the victims. The defendants traveled from Tennessee to commit the alleged crimes and posed as delivery persons to gain access or attempt to gain access to the victims’ residences. They then used firearms, duct tape, and zip ties to assault their victims, including by binding and restraining a victim in order to force him to divulge his account information. In one of the incidents perpetrated by the conspiracy, the victim was forced at gunpoint to sign into his cryptocurrency accounts so that a co-conspirator could transfer approximately $6.5 million from his cryptocurrency accounts to a wallet controlled by the co-conspirators. Armstrong, Chindavanh, and Rucker are currently in federal custody. Armstrong and Rucker are next scheduled to appear on May 12, 2026, for appointment of counsel before U.S. Magistrate Judge Thomas S. Hixson. Chindavanh is next scheduled to appear on June 26, 2026, for a status hearing before U.S. District Judge Trina L. Thompson.

YouTube

0

No recent videos fetched on this date.