← 2026-04-24

Daily Edition

2026-04-25

2026-04-26 →

AI Builders 日报 — 4月25日

追踪 AI 领域真正在做事的人,而不是空谈者。

今日思考

Sam Altman 说 GPT-5.5 有一种"小引擎愿意相信"(little engine that could)的气质——这句话放在当下的 AI 竞赛里意外地准确。xAI 的 Grok-3、DeepSeek-V4、Google Gemini 3 Ultra,所有人都在快速追赶闭源模型的能力边界。但真正值得注意的信号是 DeepSeek-V4-Flash:284B total / 13B active params,在消费级成本下实现接近顶级闭源模型的性能。这是开源社区又往前走了一步的明确证据。

与此同时,Peter Yang 用 GPT 5.5 + Codex 在 15 分钟内做出了可玩的 Star Fox 原型。这不是演示,而是对"多快能做出一个完整游戏"的重新定义。当工具本身已经不再是瓶颈,行业里最紧迫的问题变成了:你的想象力够不够用,而不是你的工程团队够不够大。


产品与发布

GStack v1.13

Garry Tan(Y Combinator CEO)发布 GStack v1.13,新增跨模态合成功能。他目前更多使用 GPT-5.5 配合 Conductor 工作流,并发现有时需要在推进前获得 Opus 4.7 的反馈意见——跨模态综合已成为他日常依赖的核心能力。faviconx.com

GBrain v0.21

Garry Tan 同时发布 GBrain v0.21,新增代码图谱(code graph)支持,整合到现有的检索体系中:图谱检索、向量检索、混合 RRF(倒数排名融合)以及 grep 全文搜索。他正在推进下一版 GStack 的修订,目标是将代码会话中的记忆和"梦境"(dreaming)能力纳入 GBrain 实例。faviconx.com

GPT-5.5 登陆 Databricks

OpenAI GPT-5.5 现已在 Databricks 上线,通过 Unity AI Gateway 实现完整的模型推理治理。企业用户可以基于企业数据构建定制化 Agent,并利用 Codex 等编码工具实现工作流自动化。faviconx.com


观点与判断

Garry Tan(Y Combinator CEO)

  • YC 官方警告:别把 booking 当 revenue,别把 pilot 当 revenue YC 明确要求创始人在描述业务指标时保持准确和诚实——特别是 pilot、booking、revenue 和 recurring revenue 这几个层级不要混淆。YC 认为精确度是信任的基础,也是衡量业务真实进展的前提。faviconx.com

  • 科学需要开源和开放数据,比以往任何时候都更迫切 Garry 转发了关于 AI 生成论文质量的担忧,引用了"除非对代码、数据和实验记录建立极其严格的开放标准,否则可信但实为欺诈的论文潮即将到来"的判断。这与他作为 YC 掌舵人对开源生态的长期支持一致。faviconx.com

Peter Yang(@petergyang)

  • ChatGPT 是残缺版 Codex,Claude 是残缺版 Claude Code Peter 认为 ChatGPT 相对于 Codex,就像普通 Claude 相对于 Claude Code:前者缺少"构建和发布"的能力。他判断"说到底,一切不过是给 AI 下 prompt",而能直接执行和交付的工具链价值远高于对话式界面。faviconx.com

  • Cursor 迟早成为 Xcode Peter 评价 Xchat"是个好东西"(banger),并预测 Cursor 将逐步演变成类似 Xcode 的 IDE——深度集成 AI 能力的开发者工具,这代表了 AI 原生开发环境的演进方向。faviconx.com

swyx(AI Engineer 联合创始人)

  • swyx 与 hhua 正在筹划重要发布 swyx 预告正在与 hhua 联合制作某个项目,喊话"在 ICML Seoul 之后保持日历开放"——具体内容尚未最终确定,但暗示这将是一场面向 AI 工程师社区的重要发布。faviconx.com

技术动态

DeepSeek-V4 Preview

DeepSeek 发布 V4 Preview 并正式开源,标志性特性是成本效益极高的 100 万 token 上下文长度。具体规格:DeepSeek-V4-Pro 为 1.6T total / 49B active params,性能直逼全球顶级闭源模型;DeepSeek-V4-Flash 为 284B total / 13B active params,在消费级成本下实现接近顶级闭源模型的性能。这代表了开源模型阵营在长上下文能力和成本效率上的又一次突破。faviconx.com

X / Twitter

59
garrytan
garrytan @garrytan
Retweeted
Dan McAteer Dan McAteer
Browser Harness:
Self-healing harness that enables LLMs to complete any task.
mattshumer_
mattshumer_ @mattshumer_
With all this talk about "intelligence per token"... well...

Matt Shumer: We need a new term like "Intelligence per second" to capture the magic of some of these newer models
garrytan
garrytan @garrytan
Truly an honor and blessing to host @demishassabis at YC today 🙏
garrytan
garrytan @garrytan
Retweeted
Dimitri Dadiomov Dimitri Dadiomov
It might feel like we’re living in the steep end of the J-curve, but we ain’t see nothing yet. These two are doing as much as anyone to steepen the curve and bend the future closer - fascinating chat today with @garrytan and @demishassabis at @ycombinator
garrytan
garrytan @garrytan
Retweeted
Bruno Koba Bruno Koba
Demis Hassabis at YC today:
"We're only one or two technical breakthroughs away from AGI. But all the other parts are already in place."
petergyang
petergyang @petergyang
Xchat is a banger

Only a matter of time before Cursor becomes Xcode amirite? 🙂
garrytan
garrytan @garrytan
Retweeted
Teknium 🪽 Teknium 🪽
I literally run 12 hermes agent instances every day in parallel to build Hermes Agent, and its now a top 100 GitHub repositories of all time. Agents do bring value and do create substantive software and work.
David Cramer: Everyone is slowly coming to this realization, and I assure you, no one is running multitudes of agents overnight. No one that is doing anything of substance at least.
There _are_ people pretending to be scientists, or fully caught up in their drug infused AI overdose, that
garrytan
garrytan @garrytan
Retweeted
Demis Hassabis Demis Hassabis
Thanks for inviting me @garrytan, was awesome to chat and loved the inspirational space! Great to see so many startups building with @googlegemma models!
Garry Tan: Truly an honor and blessing to host @demishassabis at YC today 🙏
petergyang
petergyang @petergyang
Retweeted
Peter Yang Peter Yang
Asked GPT 5.5 and Codex to make Star Fox. This is after 15 min of prompting 🔥
Peter Yang: Should I ask Codex to make Star Fox next
garrytan
garrytan @garrytan
Retweeted
Davit Davit
"What would the developer experience even be like for someone using a continual learning model? Any idea how you’d actually steer it?" @garrytan asked, @demishassabis answered.
"Not having continual learning right now is one of the things holding agents back from doing full end-to-end tasks.
They’re really useful for aspects of tasks today. You can compose them together and do some really cool things, but they don’t adapt well to the specific context you’re operating in.
That’s the missing piece for them to become truly fire-and-forget agents that can just figure it out themselves.
They need to be able to learn about the specific context you put them in. So I think we have to crack continual learning to reach full general intelligence."
Davit: Demis Hassabis and Garry Tan on the importance of context engineering and memory for continual learning!
garrytan
garrytan @garrytan
Change the party to murder all capitalists (Piker) and destroy the schools (Bowman), brought to you by a tech centimillionaire who is using his money to buy an election (Chakrabarti)

No thanks!
gdb
gdb @gdb
the openai team ships
garrytan
garrytan @garrytan
Retweeted
Kane 謝凱堯 Kane 謝凱堯
The Insurance Commissioner is a down ballot election but has a lot of impact on cost of living.
Jane Kim is a horrible candidate from the Bad Days of SF’s recent past. She’s trying to use this niche election to re-enter politics.
Leave her vote blank. Tell your friends.
Jane Kim 金貞妍: The numbers just dropped and we have great news.. it's all because of you. THANK YOU!
swyx
swyx @swyx
inside me there are two wolves

John 🔱 Li: @aiDotEngineer @mattpocockuk 🤭

garrytan
garrytan @garrytan
Retweeted
Paul Graham Paul Graham
Hamming's talk is so important that I reproduced it on my site. It's one of the only things on my site written by someone else.
https://paulgraham.com/hamming.html
Ihtesham Ali: A mathematician who shared an office with Claude Shannon at Bell Labs gave one lecture in 1986 that explains why some people win Nobel Prizes and other equally smart people spend their whole lives doing forgettable work.
His name was Richard Hamming. He won the Turing Award. He
swyx
swyx @swyx
Retweeted
Paul Graham Paul Graham
Hamming's talk is so important that I reproduced it on my site. It's one of the only things on my site written by someone else.
https://paulgraham.com/hamming.html
Ihtesham Ali: A mathematician who shared an office with Claude Shannon at Bell Labs gave one lecture in 1986 that explains why some people win Nobel Prizes and other equally smart people spend their whole lives doing forgettable work.
His name was Richard Hamming. He won the Turing Award. He
swyx
swyx @swyx
Re fun to think about what the pm thinks vs what the engineer thinks in this scenario
garrytan
garrytan @garrytan
Retweeted
Ananyo Bhattacharya Ananyo Bhattacharya
23 years old with no advanced mathematics training solves Erdős problem with ChatGPT Pro. "What’s beginning to emerge is that the problem was maybe easier than expected, and it was like there was some kind of mental block.”-Terence Tao https://www.scientificamerican.com/article/amateur-armed-with-chatgpt-vibe-maths-a-60-year-old-problem/
garrytan
garrytan @garrytan
Here's YC's official advice about being truthful and precise about what is pilot, bookings, revenue and recurring revenue.

Founders, particularly first time founders, need to sear this into their brains. Don't mistake one tier for another. Be precise, and always be truthful.
garrytan
garrytan @garrytan
Science needs open source and open data now more than ever

Crémieux: Science is about to get absolutely nuked.

Unless we get extremely strict about providing and opening up code and data and documenting lab experiments rigorously, a torrent of credible-looking but fraudulent papers is upon us.
garrytan
garrytan @garrytan
Retweeted
Daeshawn Daeshawn
Garry Tan: Here's YC's official advice about being truthful and precise about what is pilot, bookings, revenue and recurring revenue.
Founders, particularly first time founders, need to sear this into their brains. Don't mistake one tier for another. Be precise, and always be truthful.
petergyang
petergyang @petergyang
My next guest is making $1M+ a month (!) from 5 AI products that he built as a solo founder.

Tomorrow, Tibo will break down:

→ How to validate and kill ideas fast
→ Why SEO is still his top channel in the AI era
→ The pricing sweet spot for AI products

Tibo also shared how 9 of his products failed before one hit $600K/month.

📌 Subscribe to get our full interview tmr: https://www.youtube.com/@peteryangyt?sub_confirmation=1
petergyang
petergyang @petergyang
Retweeted
Peter Yang Peter Yang
My next guest is making $1M+ a month (!) from 5 AI products that he built as a solo founder.
Tomorrow, Tibo will break down:
→ How to validate and kill ideas fast
→ Why SEO is still his top channel in the AI era
→ The pricing sweet spot for AI products
Tibo also shared how 9 of his products failed before one hit $600K/month.
📌 Subscribe to get our full interview tmr: https://www.youtube.com/@peteryangyt?sub_confirmation=1
garrytan
garrytan @garrytan
Retweeted
Erik Bernhardsson Erik Bernhardsson
Humble request to techies to stfu about AI mass unemployment and start to talk about using GPUs to cure cancer and find new materials and all the other amazing opportunities
Polymarket: JUST IN: An AI data center moratorium is now projected to pass this year as protests intensify nationwide.
85% chance.
gdb
gdb @gdb
GPT-5.5 raises the ceiling of ambition for what you can do with AI:

Simon Smith: Riley's recent tests and various posts about GPT-5.5 have revealed something about model progress. Models are already so good, you need to raise your ambitions or you won't realize just how good they've become. If you don't raise your ambitions, you'll think they've stagnated.
gdb
gdb @gdb
what are you building with codex?

Paul Solt: What app are you making this weekend with GPT 5.5 and Codex?
sama
sama @sama
5.5 is so earnest

"little engine that could" energy
gdb
gdb @gdb
GPT Image 2 is great for learning

OscarAI: GPT Image 2 is also great for summarizing books or scientific essays through highly visual, detailed infographics.

Here I asked it for an infographic on On the Origin of Species by Charles Darwin.

ylecun
ylecun @ylecun
Retweeted
Internet Archive Internet Archive
When MTV News shut down, it felt like decades of culture vanished overnight 🕳️
But over 470,000 pages were already preserved.
That history didn’t disappear.
It was archived.
📚 Read VANISHING CULTURE to see why it matters
📖 Download & read: https://archive.org/details/vanishing-culture-2026
🛒 Purchase in print: https://www.betterworldbooks.com/product/detail/vanishing-culture-a-report-on-our-fragile-cultural-record-9798995425014/new
#VanishingCulture #DigitalMemory #InternetArchive #MTVNews #BookTwitter
amasad
amasad @amasad
Retweeted
Y Combinator Y Combinator
Replit is the leading no-code app builder for consumers and enterprise, letting anyone with an idea build real, deployed software using natural language. The company just raised a $400 million Series D at a $9 billion valuation.
In this episode of Founder Firesides, co-founder and CEO Amjad Masad (@amasad) sat down with YC's Andrew Miklas (@amiklas) to talk about @Replit's 10-year journey from browser IDE to vibe coding platform, why the people getting the most value aren't traditional devs but founders and domain experts closest to the problem, and what Agent 4 unlocks with parallel agents, built-in design, and the ability to run your entire company on Replit.
0:28 - Anyone Can Build Software
2:14 - The Rise of AI-Native Builders
4:52 - Not Just Developers Anymore
7:18 - What People Are Actually Building
10:36 - How Replit Is Spreading Everywhere
14:02 - What You Can Build (and What You Can’t)
19:22 - YC, Growth, and Early Lessons
23:18 - From Vibe Coding to Autonomous Agents
29:44 - The Future: Everyone Becomes a Builder
36:12 - What Skills Matter Now
petergyang
petergyang @petergyang
ChatGPT is just a less useful version of Codex that can't build and ship stuff.

Same for regular Claude vs. Claude Code.

It's all just prompting the AI at the end of the day.
petergyang
petergyang @petergyang
Making stuff is fun
garrytan
garrytan @garrytan
Retweeted
Liz4SF Liz4SF
if you haven't heard, YC launched a zine machine "where san francisco goes, so goes the world" w/ determination and deliberation that cant be ignored
https://thevoicesf.org/y-combinator-launches-its-zine-machine
swyx
swyx @swyx
Retweeted
Matt Carey Matt Carey
My talk at AI Engineer “Every API is a Tool for Agents” is out on YouTube
https://youtu.be/YBYUvGOuotE
Thanks to @swyx and the @aiDotEngineer crew for having me :)
garrytan
garrytan @garrytan
I’m having my OpenClaw read my notes and writings from my time as an undergrad at Stanford and this was a particularly unusual find
garrytan
garrytan @garrytan
Retweeted
David Mlcoch David Mlcoch
I'm glad @ycombinator is advising founders to be truthful about their revenue numbers.
I remember our first office hours with @harjtaggar who I believe gave us a demo day goal of $10k MRR, a few paying customers who love us and more importantly figuring out who exactly our customers are. (Getting revenue is easier than that:)
I felt $10k MRR is nothing, I saw Twitter filled with exponentials of $$$ in 5 days. I felt YC was lowering our ambitions. Only later when we reached $10k MRR, I realized how hard the actual 0 -> $10k MRR in 3 months is and that it's a great achievement.
Garry Tan: Here's YC's official advice about being truthful and precise about what is pilot, bookings, revenue and recurring revenue.
Founders, particularly first time founders, need to sear this into their brains. Don't mistake one tier for another. Be precise, and always be truthful.
garrytan
garrytan @garrytan
GStack v1.13 shipped - simple feature, I'm spending a lot more time using GPT-5.5 with Conductor and so I realized I sometimes wanted to get Opus 4.7's take on a change before I kept going. Cross-modal synthesis is something I find myself relying on more and more.
petergyang
petergyang @petergyang
People on X:
“Looks fun, do a barrel roll!”

People on Threads:


Peter Yang: Asked GPT 5.5 and Codex to make Star Fox. This is after 15 min of prompting 🔥

ylecun
ylecun @ylecun
Retweeted
The Philadelphia Inquirer The Philadelphia Inquirer
OPINION: "Trump’s mismanagement has resulted in death, destruction, wasted tax dollars, and a huge loss in respect for the United States — all to accomplish nothing." https://ebx.sh/V5TJjQ
garrytan
garrytan @garrytan
GBrain v0.21 now adds code graph support to our existing retrieval for non-code: graph, vector, hybrid RRF, and of course grep

I'm working on revisions to GStack to support this code graph AND incorporate memory and dreaming from your code sessions into your GBrain instance
garrytan
garrytan @garrytan
Retweeted
Adish Jain ☕️ Adish Jain ☕️
i just don't understand this take.
most people going thru @ycombinator are first-time technical founders that are tinkering and building.
it's not like you become a founder and then you're suddenly instilled with an understanding of the differences between LOIs, cARR, annualized revenue run rate, ARR, etc.
most of YC is spent building something people want and most of being a founder is about learning as you go.
and in a world where founders are incentivized to inflate their revenue to look like they're keeping up with the speed of AI and ever-moving goal post of "success" to unlock funding, laying things out in simple terms is not shocking. it is necessary.
Anthony Lombari: ngl its kind of shocking we need to lay this out like this?
This should already be known to founders.
swyx
swyx @swyx
btw we are cooking something with @hhua_

(not final yet but keep calendar open after ICML in Seoul)


DeepSeek: 🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.

🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params.

garrytan
garrytan @garrytan
Retweeted
Chamath Palihapitiya Chamath Palihapitiya
On page twenty-six of “The Billionaire Tax” proposal in California, it explains how the state legislature can convert from a Billionaire Tax to an Everyone Tax without voter approval.
They can also adjust the tax to be a yearly tax, not just one time…again, without your approval.
Intelligence test for you: if this was meant to just target Billionaires, why did they write this in?
Chamath Palihapitiya: The Billionaire Tax is actually an Everyone Tax.
The Billionaire Tax is a new tax proposal written by four professors who don't believe in the American dream. Some of them aren’t even American…go figure.
Despite its name, it applies to every California resident who currently
petergyang
petergyang @petergyang
I often have to get Codex to fix my OpenClaw set up 🥲
amasad
amasad @amasad
Retweeted
بدر الحناكي بدر الحناكي
سويت لعبة تأثير ستروب مرة ثانية
garrytan
garrytan @garrytan
Retweeted
Adish Jain ☕️ Adish Jain ☕️
here's an "explain it like im 5" explanation of this for the 5-year old founders that will be joining the next @ycombinator batch
one-shotted with @motion_so
Garry Tan: Here's YC's official advice about being truthful and precise about what is pilot, bookings, revenue and recurring revenue.
Founders, particularly first time founders, need to sear this into their brains. Don't mistake one tier for another. Be precise, and always be truthful.
garrytan
garrytan @garrytan
Retweeted
Vox Vox
gbrain 0.21 lets your OpenClaw / Hermes read code through a code graph.
→ sharper debug: ask about one function, get the body + who calls it + what it calls + parent class scope. no more isolated chunks
→ see what gets affected before you change a line
→ PR review can ask which call sites a change actually touches
→ sharper code search
→ language coverage works for us: TS/TSX/JS/Python fully covered
plain version: help your agents actually understand code structure now.
Garry Tan: GBrain v0.21 now adds code graph support to our existing retrieval for non-code: graph, vector, hybrid RRF, and of course grep
I'm working on revisions to GStack to support this code graph AND incorporate memory and dreaming from your code sessions into your GBrain instance
gdb
gdb @gdb
GPT Image 2 for learning about endangered animals

Harboris: GPT Image 2 on ChatGPT

Prompt
Create a visually rich infographic about an endangered animal. Start by finding one online, research its habitat, diet, and unique traits. Present information through annotated visuals and structured callouts, not generic sections. Style it like a

garrytan
garrytan @garrytan
Retweeted
Jason Freedman Jason Freedman
This is the same advice I heard 10+ years ago. YC hasn’t changed much in this type of guidance.
When founders mix one or another up…(usually because they’re young and inexperienced) it takes like 90 seconds of questioning to get to the accurate perspective of the revenue progress. They often fix the slide live on our call with them.
Is everyone else really struggling here? Or do people just enjoy the rage baiting at YC?
Also FWIW, revenue metrics (of all kinds) are not very predictive at this early moment. John and Patrick were in my batch and had no revenue (nor CARR/pipeline/ARR) for dev/payments (Stripe). A lot of startups were ‘farther along’
Garry Tan: Here's YC's official advice about being truthful and precise about what is pilot, bookings, revenue and recurring revenue.
Founders, particularly first time founders, need to sear this into their brains. Don't mistake one tier for another. Be precise, and always be truthful.
sama
sama @sama
how can they write code so fast?!

henrique cunha: codex app is trending to be the best software i've ever used

ridiculous how fast it got so good
gdb
gdb @gdb
GPT-5.5 for the enterprise:

Databricks: .@OpenAI GPT-5.5 is now available on Databricks, with Codex coding workflows and model inference fully governed through Unity AI Gateway.

With GPT-5.5 on Databricks, you can:
- Power coding workflows with Codex or other coding agents
- Build custom agents grounded in enterprise

swyx
swyx @swyx
Retweeted
Kevin Smith Kevin Smith
Such a fun @snipd_app user meetup in SF today :)
Today, we had a Snipd user join us who remixes parts of podcasts into music tracks 🤯 -> new feature idea? 🤔
Thx @swyx & @cognition for hosting us!
gdb
gdb @gdb
oh, *that’s* what tenet was about

Umesh: ChatGPT Images 2.0 explains “Tenet” in a simple way!

gdb
gdb @gdb
oh, *that’s* what tenet was about

Umesh: ChatGPT Images 2.0 explains “Tenet” in a simple way!

garrytan
garrytan @garrytan
Retweeted
Gustaf Alströmer Gustaf Alströmer
Shortage of housing is the source of many of the most critical problems in the San Francisco and the bay area. Scott Weiner have done more than anyone to fix it. I'm voting for him on June 2nd. @Scott_Wiener
SB 35 (2017) — Streamlined housing approvals in cities failing to meet state housing goals, his signature housing law.
SB 9 (2021) — Upzoned most of California to allow up to 4 units per single-family lot.
SB 79 (2025) — Legalized mid-rise apartments around major transit stops, a decade-long fight finally won.
Kim-Mai Cutler: There’s only one candidate in the SF Congressional race with the receipts to show they’ve been working on housing for 10+ years.
There are finally towers being proposed near me in the last month.
Would have been impossible without him.
gdb
gdb @gdb
GPT Image 2 for changing the style of any photo of yourself or your family

Linus ✦ Ekenstam: You can try this:

Turn any photo into a beautiful woodcut/linocut style, GPT Image-2 does a great job with details, expressions.

Perfect thing for the profile picture or the family photo. Or why not a gift?

Try for yourself, full prompt below ⤵️


gdb
gdb @gdb
GPT Image 2 for changing the style of any photo of yourself or your family

Linus ✦ Ekenstam: You can try this:

Turn any photo into a beautiful woodcut/linocut style, GPT Image-2 does a great job with details, expressions.

Perfect thing for the profile picture or the family photo. Or why not a gift?

Try for yourself, full prompt below ⤵️


gdb
gdb @gdb
GPT Image 2 for reimagining damaged photos:

WasifAI: Age-worn, damaged images can now turn into 4k with just one prompt using Chatgpt. & Its Free 💸

Made on Chatgpt, GPT-2

Prompt: 👇🏻


gdb
gdb @gdb
GPT Image 2 for reimagining damaged photos:

WasifAI: Age-worn, damaged images can now turn into 4k with just one prompt using Chatgpt. & Its Free 💸

Made on Chatgpt, GPT-2

Prompt: 👇🏻


YouTube

0

No recent videos fetched on this date.