← 2026-05-01

Daily Edition

2026-05-02

2026-05-03 →

AI Builders 日报 — 5月2日

追踪 AI 领域真正在做事的人,而不是空谈者。

今日思考

Replit 十周年免费日活动吸引了大量开发者涌入,用户的反馈揭示了一个清晰信号:AI 编程工具的「民主化」拐点已至。当一个人可以在 24 小时内用自然语言同时跑 22 个并行 agents 从零构建应用,「编程」这件事的门槛已经降到了前所未有的低点。Sam Altman 的判断也印证了这一点——尽管他一直在推文说想要更便宜更快的模型,但现实是更聪明仍然是第一优先级。这两件事放在一起看:基础设施在卷性价比,应用层在卷智能上限。2026 年的 AI Builder 战场,两个方向都在狂奔,但聪明才是真正的护城河。


产品与发布

Replit 十周年:24小时全功能免费 + Buildathon

Replit 迎来十岁生日,宣布 5 月 2 日 5am PT 起全功能免费 24 小时,同时启动 Buildathon,奖池超过 $100K。amasad(Replit CEO)发推回顾公司使命——让编程对所有人可及,2011 年起步至今已服务数百万用户。社区反响热烈,有用户用 WisprFlow + Replit 20 分钟做出一个技术资源导航站,有人称这是「软件史上最强的生产力提升」,甚至有人跑出 22 个并行 agents 同时构建应用。

faviconx.com

GBrain v0.25.1:Book Mirror Skillpack 上线

Garry Tan 转发 Vox 报道了 GBrain 新功能:用户可以上传一本 EPUB,AI 会根据你「大脑里的知识」将每章内容映射到你的实际工作、兴趣和思考上——就像一个读过你笔记的治疗师在给你讲这本书。Garry 本人评价:「它还处于 Homebrew Computer Club 阶段,但已经能看到远方。」

faviconx.com

ChatGPT Images 爆发式增长

gdb 引用 Nick Turley(OpenAI)数据:新版 ChatGPT Images 发布后两周内用量增长超 50%,60% 的日活用户是新注册用户,应用场景横跨室内设计、学习、工作图形和创意创作。

faviconx.com


观点与判断

Amjad Masad(Replit CEO)

  • Replit 工程师在高强度负载下维持系统稳定 大量用户正在「运行 agents 大军去构建他们梦想中的一切」,Amjad 发推感谢工程和支持团队在峰值压力下保持服务可用。faviconx.com

Garry Tan(Y Combinator CEO)

  • AI-native App 的定义边界:不是「用了 AI」,是「AI 从底层重构了交互」 他转发 gbrain 代码分析,提出 AI-native app 区别于传统 app 的核心差异,引发社区讨论。faviconx.com

  • 软件工程 JD 应标注是否含 /fast mode 暗指 AI 编程工具已经将「正常速度」和「加速模式」变成两种截然不同的工作体验。faviconx.com

  • Hamming 早在几十年前就指明了方向:神经网络的本质是解决编程问题 Garry 发推引用 Hamming 的判断,配合 Ilya 提出的 scaling 路线图,认为这条路已经被验证。faviconx.com

  • AI 时代的新职业正在成型:Personal Agent Designer、Second Brain Engineer、Context Editor Garry 转发了 Prem Makeig 的观察,指出未来人类会对自己的 personal agent 产生强烈需求,职业市场正在形成。faviconx.com

  • Homebrew Computer Club 阶段是一切最有趣的时光 Garry 在评论 GBrain 时表示,「 Ferrari 让你体验极致,但你需要自己带扳手」——当前 AI 工具的不完善恰恰是这个阶段最迷人的地方。faviconx.com

Sam Altman(OpenAI CEO)

  • 5.5 xhigh 在 fast mode 表现优异 Sam 发推表示自己在 fast mode 下使用 GPT-5.5 的高频模式「真的很不错」,并承认之前在 Medium 上被关于「AI 中等能力」的说法短暂影响过判断。faviconx.com

  • 更聪明仍然是第一优先级,尽管我一直以为更便宜/更快才是 Sam 反思自己的直觉判断与现实数据之间的落差,认为尽管人们普遍期待性价比提升,但模型智能程度的提升带来的价值依然超越成本与速度的优化。faviconx.com

  • 重回 Twitter 是正确的选择 Sam 发推表示「回到 Twitter 感觉很好,青春时期练就的技能找到了用武之地」。faviconx.com

  • 未来发布会的规模会更大 Sam 承认 GPT-4.5 发布的体验活动参与人数远超预期,表示会在下次发布时计划更大的活动。faviconx.com

Peter Yang(AI 产品研究者)

  • 三层上下文系统让 AI 生成的应用质量大幅提升 Peter 转发 Ravi 的研究:当前大多数用户给 AI 的提示词过于简单(单行),而 3 层上下文系统(功能层+视觉层+数据层)可以显著提升 AI 原型应用的质量和灵活性。faviconx.com

  • 用 Codex 和 Claude Code 做「断舍离」式文件整理 Peter 分享了自己的用法:给 AI 完全访问电脑和 Google Workspace CLI,然后让它先出计划再执行——整理启动项、清理下载文件夹、整理 Google Drive,「现在我的文件和 Drive 终于能让人感到愉悦」。faviconx.com

  • MacBook Pro 买来跑本地模型,实际在用 AI 编程 Peter 自嘲花近 $3000 买 MacBook Pro 的借口是跑本地模型,但实际主要用途还是 AI 辅助编程。faviconx.com

swyx(AI Engineer)

  • Braintrust + The Trainline 生产实践 workshop:拆分 LLM 调用链是关键 swyx 转发 AI Engineer 活动,展示了欧洲下载量第一的铁路 App(27M MAU,£5.3B 年票务额)如何在生产环境中将单一 LLM 调用拆分为 triage、policy review、reply generation 等专门阶段,配合 golden sets 做回归测试。faviconx.com

  • Chrome Extension 应在所有图片输入框加入 TLDraw 绘图 + AI 生成功能 swyx 向 Devin.ai 提需求:希望在所有网页的图片输入框增加文字生成、图片绘制(无 AI)、以及用文字/绘图生成符合尺寸要求的图片三种能力。faviconx.com

Garry Tan(继续)

  • Seattle 科技公司正在加速撤离 Garry 转发了多位连续创业者关于 Seattle 新市长 Katie Wilson 对科技企业征税的态度——包括「bye」的回应——导致 8 对创业夫妻全部计划搬离,28 年创业老兵宣布离开。Gary 转发并感叹「再见西雅图,你将成为自己的墓碑」。faviconx.com

技术动态

Yann LeCun(Meta 首席 AI 科学家)

  • Stanford 研讨会:World Model 从重建转向隐空间预测 Yann 转发了 Stanford 最新 seminar,聚焦 JEPA(Joint Embedding Predictive Architecture)、Causal JEPA 和 LOWER Model,探讨世界模型从传统重建方法向隐空间预测的范式转移。faviconx.com

  • Paul Dirac 在第五届索尔维会议上的发言:宗教是假想的产物 Yann 转发 Dirac 关于科学与宗教的评论,原始出处为第五届索尔维国际会议记录。faviconx.com

X / Twitter

83
garrytan
garrytan @garrytan
How to drive the billionaires out and ruin the California tax base in one fell swoop: propose asset seizure measures

Make it make sense! Middle class taxpayers will take on all the billions in tax revenue lost

Blake Byers: How the 5% California wealth tax is a 67% wealth tax for Sergey Brin:
>Owns 3% of GOOG.
>Holds 25.3% of voting rights
>Wealth tax is assessed as the greater of his ownership or voting rights. So his tax is 5% * his 25.3% voting ownership = 1.27% of the value of GOOG.
>1.27%
petergyang
petergyang @petergyang
Feels so good when you have a breakthrough with Codex
swyx
swyx @swyx
Re now there is a mac app for it https://github.com/darrylmorley/whatcable
petergyang
petergyang @petergyang
I spent close to $3,000 for a Macbook Pro so that I can try running local models.

At least, that was my excuse. In reality, I am running...😅
garrytan
garrytan @garrytan
Retweeted
Ethan Mollick Ethan Mollick
I was quoted a couple times in this Atlantic article, but that isn’t (the only) reason I think it is good. It lays out the reasons why we whipsawed from “AI is a bubble” to “there are not enough data centers” in less than six months. Spoiler: its agents. https://www.theatlantic.com/economy/2026/05/ai-bubble-revenue-anthropic/687022/
sama
sama @sama
man its good to be back on twitter

there is comfort in the skills of a wasted youth
sama
sama @sama
/hatch clippy
swyx
swyx @swyx
Retweeted
AI Engineer AI Engineer
People are really enjoying our full workshops showing end to end walkthroughs of real production workflows!
This is a rare double header with @braintrust's Giran Moodley and @OussamaHaff walking though the real life AI engineering behind @thetrainline, Europe's #1 most downloaded rail app with 27m MAU and £5.3B in ticket sales!
the workshop bundles several important lessons:
- break down monolithic LLM calls into specialized stages (e.g., triage, policy review, and reply generation)
- how to monitor latency, token usage, and costs effectively with end-to-end tracing of agentic flows
- using "golden sets" (a curated set of test inputs) to identify failure modes
- how to move from local development to a managed environment where prompts and scoring functions are version-controlled
- how to allow non-technical team members to collaborate and update model parameters without code changes
- how to identify production regressions, replay failures, and apply targeted fixes to improve system reliability continuously
enjoy!
Braintrust: Watch here → https://braintrustdata.link/AI-engineer-session
petergyang
petergyang @petergyang
First broken Codex feature I've come across
amasad
amasad @amasad
Retweeted
Zaid Jilani Zaid Jilani
Why is Randy Fine branching out in his racism, did an Armenian buffet kick him out because he violated the three hour time limit?
ANCA: “We don’t want Armenians to be able to serve in Congress.”
The ANCA condemns this racist anti-Armenian rant by US Rep. Randy Fine (R-FL), cosponsor of a reckless Congressional resolution to ship US arms and aid to genocidal Azerbaijan
garrytan
garrytan @garrytan
Retweeted
Jared Friedman Jared Friedman
Software engineering job descriptions should really start saying whether they include /fast mode or not.
garrytan
garrytan @garrytan
Retweeted
Jared Friedman Jared Friedman
Datacenters in space are making more and more sense.
Garry Tan: Per one-gigawatt data center complex: 5,322 permanent jobs, $157M per year in state taxes, $248M per year in local taxes. During construction: $2.67B in combined investment
But nobody managed to tell Seattle this, so it's banned.
sama
sama @sama
we will plan bigger parties for future releases.

a lot more people wanted to come than we expected. thank you!

gonna try to think of a really good idea for the next one.
garrytan
garrytan @garrytan
Retweeted
Steven Pu Steven Pu
Been reading through gbrain code for a few days. I may be late to the party but this looks like a new class of AI-native apps.
What differentiates an AI-native app vs. an app that uses AI? Here's what I surmised from gbrain.
garrytan
garrytan @garrytan
Retweeted
Nick Davidov Nick Davidov
I love California, but If the CA wealth tax passes I’m likely to leave. I’m not anywhere close to a billionaire but none of the taxes our companies or family pays would go to support this lunacy. I think I won’t be alone. Bankruptcy and austerity might actually be better for California in long term even though it will hurt a lot of people not deserving this in the short term. People who turn on their cars praying not to see a check engine light. While bureaucrats throw billions of public money around on waste, fraud, and destroying the markets.
ylecun
ylecun @ylecun
Retweeted
𝗿𝗮𝗺𝗮𝗸𝗿𝘂𝘀𝗵𝗻𝗮— 𝗲/𝗮𝗰𝗰 𝗿𝗮𝗺𝗮𝗸𝗿𝘂𝘀𝗵𝗻𝗮— 𝗲/𝗮𝗰𝗰
Stanford's latest seminar is a deep dive into the evolution of world modeling in AI.
Focuses on the shift in the world model from traditional reconstruction methods toward latent space prediction.
Covers topics like:
- Introduction to JEPA & World Models
- Causal JEPA
- LOWER Model
- Practical Applications & Planning
- Future Outlook
garrytan
garrytan @garrytan
Retweeted
Timur Kuran Timur Kuran
So appalling one doesn’t know where to start. Taxpayer-funded public schools have no business meddling in politics, turning kids into activists, or wasting classroom time on causes that serve teachers—not students.
Corey A. DeAngelis, school choice evangelist: "We are organizing our school so that the kindergarten through 6th grade students are also going to be able to march."
swyx
swyx @swyx
request for chrome extension that augments all image input boxes on the web:

- lets me generate a simple word text thing (no ai) OR
- draw something with @tldraw (no ai) OR
- use either words or drawings to generate something of the required proportions

@devinai do it pls
garrytan
garrytan @garrytan
Retweeted
Jesse Proudman Jesse Proudman
Had dinner tonight with 8 entrepreneurial couples. Every single
one is moving.
Good bye @MayorofSeattle.
You’re the nail in the coffin of Seattle and you will be memorialized for it.
amasad
amasad @amasad
Retweeted
Raouf Chebri Raouf Chebri
What's new this week on Replit
- Replit turns 10 🎉and Agent is free for everyone on May 2
- App Monitoring with Agent: real-time uptime checks and downtime alerts on every paid plan
- Build full slide decks with Agent and export to PPTX, Google Slides, or PDF
amasad
amasad @amasad
Replit, turned 10 🎂

To celebrate we’re making it totally free for 24 hours starting at 5am PT.

But our work—to make coding accessible for all—goes back to 2011.

Watch the highlights from the journey:

It’s been an honor to help millions learn & ship. Here is to the next 10!
garrytan
garrytan @garrytan
Retweeted
Amjad Masad Amjad Masad
Replit, turned 10 🎂
To celebrate we’re making it totally free for 24 hours starting at 5am PT.
But our work—to make coding accessible for all—goes back to 2011.
Watch the highlights from the journey:
It’s been an honor to help millions learn & ship. Here is to the next 10!
amasad
amasad @amasad
Retweeted
Amjad Masad Amjad Masad
Replit, turned 10 🎂
To celebrate we’re making it totally free for 24 hours starting at 5am PT.
But our work—to make coding accessible for all—goes back to 2011.
Watch the highlights from the journey:
It’s been an honor to help millions learn & ship. Here is to the next 10!
garrytan
garrytan @garrytan
Retweeted
💥Susan Dyer Reynolds🗞️ 💥Susan Dyer Reynolds🗞️
“Supervisor Jackie Fielder, who is currently on a leave of absence following a personal health crisis, was also in attendance.”
Wait. WHAT? ⁦@RafaelMandelman⁩ if she’s well enough to attend an airport protest she’s well enough to work. https://missionlocal.org/2026/05/s-f-supervisors-past-and-present-arrested-at-sfo-anti-ice-protest/
ylecun
ylecun @ylecun
Retweeted
Captain Insight Captain Insight
Neural networks were declared scientifically dead in 1987.
A French PhD student bet his entire career on them anyway ~ and won. 🤯
>Meet Yann LeCun 🇫🇷
>Paris-born. PhD from Sorbonne in 1987.
>Joined Bell Labs in 1988. Kept building. Alone.
>In 1989, built Convolutional Neural Networks
>By the late 90s, his CNN was reading 10% of US bank checks
>The industry called it a niche trick. Ignored him for over a decade.
>Then 2012 hit. Deep learning exploded.
>His “dead” research became the blueprint for everything
> ChatGPT. Gemini. Claude. Grok. All standing on his shoulders. 🚀
>Won the 2018 Turing Award ~ computing’s Nobel Prize
>Became Chief AI Scientist at Meta “Godfather of AI.”
>Now publicly says LLMs are a dead end.
>Fights the entire industry.
>Left Meta in late 2025 to build AMI Labs in Paris
>Already valued at $3.5B before launching. World models, not LLMs.
The industry ignored him for over 20 years.
Now he’s ignoring the industry.
Absolute Legend 🐐
ylecun
ylecun @ylecun
Retweeted
Maryam Maryam
2022: “Stop overreacting, they won’t overturn Roe.”
They did.
2023: “Stop overreacting, they won’t let women die rather than get an abortion.”
They did.
2024: “Stop overreacting, they won’t arrest women for miscarriages.”
They did.
2025: “Stop overreacting, they won’t turn women into incubators.”
They did.
2026: “Stop overreacting, they won’t attack mifepristone.”
They did, today.
Now: “Stop overreacting, they won’t go after birth control next.”
They will.
amasad
amasad @amasad
Retweeted
Yuli Kay Yuli Kay
You can build apps, games, websites, ANYTHING FOR FREE on http://replit.com 🤯
But you have only 24 HOURS starting 5 AM PST | 1 PM UTC
Check out what I've already built with @Replit
Replit ⠕: Replit Agent is free tomorrow for everyone starting at 5am PST
Show use what you can build in 24 hours
And Replit is turning10! A trip down the memory lane on what got us here
amasad
amasad @amasad
Retweeted
Replit ⠕ Replit ⠕
Countdown to Free Agent for 24 Hours + Buildathon Kickoff: $100K+ in Prizes https://x.com/i/broadcasts/1rxmqomNPNwxy
amasad
amasad @amasad
Retweeted
Francisco Cruz Mendoza Francisco Cruz Mendoza
Huge shoutout to the @Replit engineering team for sticking around all night ahead of the 24 hour Replit 10 Buildathon 🎉
See you all live at 4:30amPT!
amasad
amasad @amasad
Retweeted
Jennie Littleton Jennie Littleton
Hour 1 of 24 for the @replit 10 year anniversary buildathon! Huge thanks to @amasad @raymmar @MannyBernabe @Franciscocrz for the incredible opportunity. I'll be working on http://getthew.app and a few side projects. 👀 livestream link dropping later
👉What are you building?
garrytan
garrytan @garrytan
Retweeted
aviel aviel
Ok, I’ve finally processed how I experienced this, and it’s a big deal. I grew up hearing stories from my father and grandfather about the Soviet Union, but this was the first time I truly understood them, like the difference between hearing stories about having a child and actually holding your own newborn. Like an ancestral alarm. It’s devastating. Everything in me is screaming to divest from the region that I’ve poured my adult life into to survive. The feeling of loss is immeasurable, and the casual “bye” just makes it worse. The damage from the lack of empathy here will create a cycle of attacks that take decades to undo and will bloody the hands of everyone around me, there are no sidelines in my line of work. This also isn’t about taxes, the cost of reorienting my life is infinitely greater. It’s now primal and existential.
Brandi Kruse: INSANE. Seattle's Socialist Mayor responds to exodus of wealth from Washington state by saying "BYE" ... then laughing. We're doomed.
ylecun
ylecun @ylecun
Retweeted
Republicans against Trump Republicans against Trump
This is insane
Three Trump judicial nominees refused, over and over, to say Joe Biden won the 2020 election. They either believe Trump’s lie that the election was stolen, or they’re too afraid of him to tell the truth.
This isn’t about their political views. It’s about recognizing reality and being part of an independent judiciary
amasad
amasad @amasad
Retweeted
George Ohan George Ohan
Re @Replit agent just worked 25 minutes for free.
One more attempt at #georgiejobsapp
garrytan
garrytan @garrytan
Many such cases

Just Samuel: @garrytan Been using GBrain. It’s da best experience so far. 👍🏻
amasad
amasad @amasad
Retweeted
Dave Gambrill Dave Gambrill
Using @WisprFlow and @Replit I made this in like 20 minutes just by literally talking about it. If you aren't playing with these tools, you are missing out. You need zero tech ability to do this.
https://dave-tech-resource-hub.replit.app/
garrytan
garrytan @garrytan
Retweeted
Bhargav Gajjar Bhargav Gajjar
My token usage has become very efficient after using GBrain @garrytan
petergyang
petergyang @petergyang
Most people give AI one-line prompts and wonder why their app looks like slop.

My next guest, Ravi, has built a 3-layer context system that fixes this:

→ Functional: What the app does.
→ Visual: What the app looks like.
→ Data: How the data structure works.

The data layer is the most underrated and including it in your prompt lets you create much more flexible prototypes and apps.

📌 Subscribe to get our full episode tmr: https://www.youtube.com/@PeterYangYT?subscribe
petergyang
petergyang @petergyang
Retweeted
Peter Yang Peter Yang
Most people give AI one-line prompts and wonder why their app looks like slop.
My next guest, Ravi, has built a 3-layer context system that fixes this:
→ Functional: What the app does.
→ Visual: What the app looks like.
→ Data: How the data structure works.
The data layer is the most underrated and including it in your prompt lets you create much more flexible prototypes and apps.
📌 Subscribe to get our full episode tmr: https://www.youtube.com/@PeterYangYT?subscribe
garrytan
garrytan @garrytan
Retweeted
Robert Pondiscio Robert Pondiscio
A valuable look at how politicians are stage managed by their handlers. Keeping the camera rolling and posting it publicly, awkward silences and all, is an act of civic hygiene.
Ari Hoffman: Staffers for Seattle Socialist Mayor Katie Wilson abruptly end an interview with KOMO News Senior Reporter Chris Daniels when she can't answer basic questions
Wilson has been criticized for dodging the press & being unable to answer basic questions since she came into office
garrytan
garrytan @garrytan
Retweeted
Will Manidis Will Manidis
I don’t think any of you have processed at any level how widespread and profound the ai water libel is
petergyang
petergyang @petergyang
My OpenClaw is going to have a very poor performance review this quarter
garrytan
garrytan @garrytan
Professor Hamming called it: neural nets were the solution to the programming problem. Ilya had to point to scaling for it to work.
garrytan
garrytan @garrytan
Retweeted
Jesse Proudman Jesse Proudman
I started my first company from my bedroom in Tacoma when I was 13, dreaming of one day building something like the tech companies I watched flourish in Seattle. I remember being awed that someone could create a company from nothing and I knew that's what I wanted to do with my life.
After 28 years of building, it's heartbreaking to watch Seattle's leaders shift from celebrating entrepreneurs to making clear we're the problem. The tax bill is just the price tag on their contempt.
https://www.foxnews.com/media/seattle-ai-founder-looks-leave-taxes-rise-everybody-i-know-process-leaving
garrytan
garrytan @garrytan
Retweeted
Pejman Pour-Moezzi Pejman Pour-Moezzi
Run gstack's /office-hours right in the web, no terminal needed!
Skillet uses Anthropic's new Managed Agents to spin up a Claude Agent SDK with skills installed exactly like Claude Code.
Perfect for non-technical people that don't want to mess with terminals.
Chat with @garrytan now (no signup or API keys needed): https://skilletweb.com/office-hours/new
garrytan
garrytan @garrytan
Patrick Wolff just got endorsed by the SF Chronicle

Common sense is winning

https://www.sfchronicle.com/opinion/editorials/article/patrick-wolff-insurance-commissioner-california-22103717.php

Garry Tan: Californians can’t get proper insurance

Why? Because it’s been wholly mismanaged by machine politicians who aren’t very smart

What do we do about it? Elect someone smart who can fix it

That’s Patrick Wolff
garrytan
garrytan @garrytan
Retweeted
Chief Nerd Chief Nerd
Sam Altman Says CEO’s Who Talk About AI Taking Everyone’s Jobs Are ‘Tone Deaf’
“Someone said to me just yesterday that … GPT 5.5 in Codex can accomplish in an hour what would have taken me weeks two years ago … and I have never been busier in my life.”
amasad
amasad @amasad
Retweeted
Jennie Littleton Jennie Littleton
Hour 4 & 5 update of the @Replit 24 hour buildathon. Links below to the two live builds 👇
@raymmar @MannyBernabe check out the second link, it's a new buildathon timer site that I'm super excited to watch evolve 🐳
garrytan
garrytan @garrytan
Retweeted
Eric Ries Eric Ries
The ability of an LLM to help readers make connections events from their own life is going to unlock a lot of interesting new forms of reading and - even better - understanding texts.
Here's a great example from @garrytan
Garry Tan: book-mirror is the flagship.
Hand it a book, get a personalized two-column analysis. Left shows the author's idea. Right maps every idea to your actual life using your own words from the brain.
Here's the example based on a yet-unreleased book by @ericries Incorruptible
petergyang
petergyang @petergyang
It's very satisfying to get Codex or Claude Code to "marie kondo" your local files and Google Drive.

I give these apps full access to my computer and gws (google workspace cli), then prompt things like:

"Tell me what apps load on computer bootup. Give me a plan to clean this up."

"Look at my downloads folder. Give me a plan to clean up and organize it."

"Help me organize my Google Drive. Let's review your plan first before doing anything."

Note that I always ask it for a plan first. These are semi-dangerous operations so try them at your own risk.

Anyway, my files and Drive now spark joy 🤣
petergyang
petergyang @petergyang
Retweeted
Peter Yang Peter Yang
It's very satisfying to get Codex or Claude Code to "marie kondo" your local files and Google Drive.
I give these apps full access to my computer and gws (google workspace cli), then prompt things like:
"Tell me what apps load on computer bootup. Give me a plan to clean this up."
"Look at my downloads folder. Give me a plan to clean up and organize it."
"Help me organize my Google Drive. Let's review your plan first before doing anything."
Note that I always ask it for a plan first. These are semi-dangerous operations so try them at your own risk.
Anyway, my files and Drive now spark joy 🤣
garrytan
garrytan @garrytan
The Homebrew computer club phase of anything is the most fun

I am savoring it

Garry Tan: One note: GBrain is not batteries included. It is experimental and has rough edges. It, like OpenClaw, is a Ferrari that lets you experience insanely cool things but you better bring your wrench!

It will not be like that forever but for now it still in Homebrew Computer Club
garrytan
garrytan @garrytan
It’s not gonna be void deer or boson cutter it turns out

Prem Makeig / premm.eth: @garrytan Some new job titles:

- Personal agent designer
- Second brain engineer
- Context editor

People are going to care a lot about their personal agents, and they will want help designing them.
garrytan
garrytan @garrytan
Many such cases

snowblue: @garrytan It's been incredibly helpful, though. I'm not a coder, but openclaw has turned me into one. I'd started building something of my own that was similar when I came across GBrain. The scaffolding and principles it embodies has supercharged my efforts and I really appreciate it!
garrytan
garrytan @garrytan
Retweeted
The Seattle Times The Seattle Times
They say a gaffe is when a politician tells the truth. Seattle Mayor Katie Wilson saying "bye" to the wealthy upset about taxes is not the kind of truth Seattle needs right now, writes columnist Danny Westneat. https://www.seattletimes.com/seattle-news/politics/the-gaffes-are-becoming-a-pattern-for-seattles-new-mayor/?utm_medium=social&utm_campaign=owned_echobox_tw_m&utm_source=Twitter#Echobox=1777738154-2
amasad
amasad @amasad
Retweeted
TommyYipxyz TommyYipxyz
All the tasks seem to be coming in. Everything's going well. Excited to push it to deployment. Let's go at Replit. Happy birthday once more! Shout out to the support team for getting it under control. @ReplitSupport @Replit
ylecun
ylecun @ylecun
Retweeted
Saganism Saganism
"If we are honest — and scientists have to be — we must admit that religion is a jumble of false assertions, with no basis in reality. The very idea of God is a product of the human imagination. It is quite understandable why primitive people, who were so much more exposed to the overpowering forces of nature than we are today, should have personified these forces in fear and trembling. But nowadays, when we understand so many natural processes, we have no need for such solutions. I can't for the life of me see how the postulate of an Almighty God helps us in any way."
— Paul Dirac, Remarks made during the Fifth Solvay International Conference
garrytan
garrytan @garrytan
Retweeted
Austen Allred Austen Allred
Hahahaha I love this app
Martin Shkreli: Ro pretends he is a modest and humble guy. Over the course of the next few months, I will reveal much more about Ro Khanna. He's just a rich guy lying to everyone about virtually everything.
At the end of this, I predict his wife will file for divorce.
amasad
amasad @amasad
Retweeted
Shaun Willis Shaun Willis
i'm buzzing. this is about the most insane productivity boost in my life that i've ever experienced. @Replit full throttle is by far the greatest thing in software right now. I can't believe there is not more people on the app right now building
amasad
amasad @amasad
Retweeted
Shaun Willis Shaun Willis
Re @replit building in replit right now feels like this
gdb
gdb @gdb
gallery for codex pet sharing:

Hunter ♠️: Built Petdex, a public gallery to discover, share, and install Codex pets with one curl.

Submissions open at link below 👇

amasad
amasad @amasad
Shout out to @replit engineers and support team keeping everything together as users run armies of agents building everything they ever dreamed of 😅

Shaun Willis: @replit building in replit right now feels like this

amasad
amasad @amasad
22 ACTIVE PARALLEL AGENTS…. and 13 in draft 😭

Shaun Willis: Full steam ahead! @Replit

garrytan
garrytan @garrytan
Retweeted
Browser Use Browser Use
We're creating SKILL files for all websites
Contribute a domain skill to browser-harness
Saurav Panda: domain skills are the most fun PRs i've ever merged.
you don't hand-write them. the agent does the task in your browser, figures out the selectors and edge cases, and writes the skill itself. you just open the PR.
linkedin, amazon, expenses, whatever you do daily - contribute
sama
sama @sama
this is great

Boaz Barak: My colleagues have been posting so many cool research results on the @OpenAI alignment blog! A few examples in 🧵

https://alignment.openai.com/
sama
sama @sama
5.5 xhigh in fast mode is

really good

i think i got psyoped by twitter on medium for a bit
sama
sama @sama
never thought id be watching F1 via the kids broadcast

cannot imagine being happier
gdb
gdb @gdb
codex for improving your ergonomics

jason liu: With codex I don’t need a second monitor I turned it into a standing desk

ylecun
ylecun @ylecun
https://youtu.be/kYkIdXwW2AE?si=hV2ANEl-wPh1MSU1
swyx
swyx @swyx
Retweeted
Ron Alfa Ron Alfa
Loved the vibes with @latentspacepod, was a lot of fun.
Latent.Space: 🔬 Training Transformers to solve 95% failure rate of Cancer Trials
the AI for Science pod is back with @RonAlfa, CEO of @NOETIK_ai, and Daniel Bear, VP Research at Noetik, explaining exactly how their team of top AI x Bio researchers and engineers (shoutout @owl_posting) will
sama
sama @sama
i keep thinking i want the models to be cheaper/faster more than i want them to be smarter

but it seems that just being smarter is still the most important thing
garrytan
garrytan @garrytan
Retweeted
Sam Altman Sam Altman
Re @hsu_steve mogging
garrytan
garrytan @garrytan
This is correct


Hugo Amsellem: http://x.com/i/article/2049920112707137536
garrytan
garrytan @garrytan
Retweeted
Lisan al Gaib Lisan al Gaib
I think returns to intelligence are nonlinear because decisions are path-dependent
early choices in code, experiments, or strategy can compound positively or negatively over time
for example by avoiding dead ends or preserving optionality
it's why I am a big fan of very long running tasks and massive benchmarking budgets
GPT-5.5 and Mythos Preview are only marginally more intelligent than previous models and have pretty much the same performance up to 10M tokens, but after that they go absolutely ballistic
Sam Altman: i keep thinking i want the models to be cheaper/faster more than i want them to be smarter
but it seems that just being smarter is still the most important thing
amasad
amasad @amasad
“Prompt” took on an entirely new meaning but somehow many things stayed the same.


Amjad Masad: Replit, turned 10 🎂

To celebrate we’re making it totally free for 24 hours starting at 5am PT.

But our work—to make coding accessible for all—goes back to 2011.

Watch the highlights from the journey:

It’s been an honor to help millions learn & ship. Here is to the next 10!

garrytan
garrytan @garrytan
Retweeted
Danielle Fong 🔆 Danielle Fong 🔆
Protesters shut down Berkeley Forum event hosting @jeffdean is some emblematic circular firing squad stuff come on. Jeff has been out spoken about human rights repeatedly, but instead of even engaging i guess the whole event was shut down. typical!! https://www.dailycal.org/news/campus/protesters-shut-down-berkeley-forum-event-hosting-google-ai-scientist/article_9dd82646-3c37-48b5-8dd4-61a5050646ce.html?utm_medium=social&utm_source=twitter&utm_campaign=user-share
garrytan
garrytan @garrytan
Retweeted
Bexly Bexly
Coined this last year: @garrytan wields the OG switch
“The great inversion is almost here.
More non-technical people exist than traditionally “technical”
Understand this simple economic factor. Your only job is to facilitate this reality faster by way of niche distribution”
Hugo Amsellem: http://x.com/i/article/2049920112707137536
garrytan
garrytan @garrytan
Retweeted
Vox Vox
gbrain 0.25.1 shipped a feature i think is genuinely powerful.
feed your openclaw / hermes a book you've been reading, the agent uses the real you in your brain to map every chapter to what you're actually working on.
drop in Atomic Habits and the agent maps every chapter against your brain's actual reflections on your morning routine, writing, running streak. reads like a therapist who's been reading your notes, scribbling in the margins.
this is where long-term logging compounds. the more complete your brain, the more the book reads you.
those dozen unfinished books finally have a reason to come back out
Garry Tan: GBrain v0.25.1 now ships with the book-mirror skillpack by default. Yes, you can upload an epub and if your brain is full of knowledge about you, it'll relate each idea to something you are working on, care about, or are thinking about.
garrytan
garrytan @garrytan
Whoever at Android forced this mess of how work profiles and personal profiles work together (really they don’t) should have been fired

It’s a textbook example of PMs making the wrong call and letting the bad decision fester for years
garrytan
garrytan @garrytan
Retweeted
Noah Smith 🐇🇺🇸🇺🇦🇹🇼 Noah Smith 🐇🇺🇸🇺🇦🇹🇼
I'm in favor of taxing the ultra-rich. But California's "billionaire tax" is a poorly-designed piece of slopulism.
https://www.noahpinion.blog/p/californias-billionaire-tax-is-the
amasad
amasad @amasad
Retweeted
Gatis Gatis
I’m on @Replit Pro plan now, parallel task is very impressive, after almost 12 hour in it’s working like 🔥 @Franciscocrz
garrytan
garrytan @garrytan
Retweeted
Kane 謝凱堯 Kane 謝凱堯
Silicon Valley congressman @RoKhanna is trying to impose more taxes on Californians who made their wealth instead of getting it the Correct way like him: being handed it by family and trading on congressional insider information.
Arthur MacWaters: > be Ro Khanna
> #2 most active “trader” in Congress
> literally $600m in trades
> net worth 10s-100s of millions
> “it’s not my money it’s my wife’s”
>…
> “tax the billionaires”
> asset seizure with explicit direct path to every citizen
> already caused the largest wealth
mattshumer_
mattshumer_ @mattshumer_
There are no good bagels on the UWS.

If someone opens a solid shop, they’re gonna make a killing.
garrytan
garrytan @garrytan
Retweeted
小盖 小盖
强烈推荐大家看看DeepMind CEO Demis的最新判断。
真的,Google DeepMind 的 CEO Demis Hassabis 每一期访谈我觉得值得都花时间看看。这哥们讲东西很实在,而且通俗易懂。
早上边跑步边听完了他和 YC CEO Garry Tan 的最新一期播客。
刚刚把笔记写完,也给大家分享下。
多说一句,好多人问我这种笔记是不是 AI 写的。我说下自己的流程。
我会先完整听完播客,然后用语音输入法把感触尽量充分地讲出来,再让 AI 帮着整理初稿,最后自己逐字修改优化。
如果全部交给 AI 做总结,那等于把思考和理解的能力让渡给了 AI,对自己理解这件事其实没有任何价值。
OK,咱们进正题。
1
Demis 的态度非常明确,现在的大模型范式(大规模预训练 + RLHF + CoT)一定会是 AGI 最终架构的一部分,他不认为这会是条死路。
但要实现 AGI,还有几个关键问题要解决。这几个问题包括:持续学习、长程推理和记忆系统。
先从最容易看到的现象讲起,Context Window。
现在大模型处理长信息,最常用的招就是把 Context Window 一直撑大。一开始 8k,后来 32k,再后来 100 万 Token。听起来很厉害,但本质上是暴力堆砌。
Context Window 其实就相当于人脑里的 Working Memory,工作记忆。人的工作记忆能同时装多少东西?心理学里有个经典数字,7 个左右。背电话号码能记住 7 位上下,再多就溢出了。
大模型呢?已经做到 100 万 Token。
按理说,模型的工作记忆比人大几十万倍,应该比人聪明几十万倍才对。但显然不是。
问题也恰恰就出现在这。把所有东西都塞进 Context Window 里,里面包含了不重要的东西、错的东西、过时的东西。看起来信息很多,其实是一团乱麻。
那人为什么 7 个数字的工作记忆就够用?
因为人脑背后还有另一套机制在工作。我们记得几年前的事,记得童年的事,记得几小时前发生的事。这些都不塞在工作记忆里,而是另一套系统。
具体来说这套系统是海马体,大脑里负责把新知识整合进已有知识库的那个部分。
研究发现,人睡觉的时候,特别是 REM 睡眠阶段,大脑会重放白天重要的片段,让大脑从中学习。新东西在睡觉的过程里,温柔地融进了旧的知识体系。
这个把新东西融进旧知识库的过程,就是持续学习。
模型现在没有这套机制。每一次对话结束,刚学到的东西就会忘记。下次重新打开,还是上次那个模型,没长进。
2
再聊聊长程推理的问题。英文表达是 Long-term Reasoning。我翻译为了长程。
长程推理这个词太抽象了。Demis 讲了一个特别具体的故事,听完会立刻明白他说的是什么。
他说自己喜欢跟 Gemini 下国际象棋。下棋的过程里能看到模型的 thinking trace,也就是它在那里到底想了什么。
然后他发现一件怪事。
模型考虑一步棋的时候,思考链里清清楚楚写着,这步是个昏招。但接下来,它没找到更好的走法,于是又走回这步昏招。
明明知道是错的,还是把错的那一步走出去了。
这个细节比任何 benchmark 数据都说明问题。因为它暴露的是模型缺少对自己思考过程的某种内省能力。
正常人下棋,意识到一步是昏招之后,脑子里会有一个反应,停一下,再想想。停一下、再想想这个能力,模型现在没有。它能在每一步局部判断对错,但没法基于整盘棋的局势去调整整体策略。
这就是长程推理还没搞定的样子。模型可以一步一步往前走,每一步看起来都合理,但走到后面整盘棋的方向其实是错的。它没有那种退回到当前思考的上一层、重新审视一下的能力。
说到底,模型缺的是一种内省。
3
学习、长程推理、记忆,这是 Demis 在播客里点出来的三个 AGI 鸿沟。
除此之外,他还反复提到了创造力。
2016 年 AlphaGo 跟李世石下棋,第二局走出了著名的 Move 37。那一步棋走出来的瞬间,全世界的围棋高手都看呆了。
所有人类几千年下围棋积累的经验都告诉它不该下那里,但 AlphaGo 下了。下完之后大家发现,是一步神来之笔。
很多人觉得,这就是 AI 的创造力来了。
但 Demis 说,对他自己来说,Move 37 只是起点。他真正想看到的是另一件事。AI 能不能发明围棋这件事本身。
这两件事的区别非常关键。
Move 37 是在围棋这个现成的规则里,找到了一步人类没想到的招。但围棋的规则、棋盘的形状、黑白子的对弈方式,是人类发明出来的。AI 在已有的框架里非常厉害,但能不能自己造一个框架,是另外一回事。
Demis 给了一个具体的设想。
如果给 AI 一个高层次的描述。造一个游戏,五分钟能学会规则,要好几辈子才能精通,棋局有审美,一下午能下完一局。AI 能不能根据这个描述,自己倒推出围棋?
目前做不到。
为了把这件事讲得更清楚,Demis 还提了一个测试,他自己叫爱因斯坦测试。
用 1901 年人类已有的全部知识训练一个模型,看它能不能在 1905 年那个时间点,自己推出狭义相对论。
爱因斯坦在 1905 年那一年里,连写了几篇改变物理学的论文,后来叫爱因斯坦奇迹年。那些工作不是从已有的物理学论文里通过拼接得到的,是基于已有材料做了一次全新的概念跳跃。
爱因斯坦测试想问的就是这件事。AI 能不能做这种跳跃。
目前的大模型主要在做两件事,pattern matching 和 extrapolation。一个是从大量数据里找规律,一个是把规律往外延伸一点。但发现新东西需要的是类比推理的能力。从一个领域里抽出深层结构,搬到另一个全新的领域去用。
这个能力,模型现在还没有。也可能是有,但用法不对所以激发不出来。
4
除此之外,Demis 还分享了一个让我特别出乎意料的判断,他说未来 6 到 12 个月,真正的价值不在更大的模型,在更小的模型。
这一部分内容我反复听了好几次,确实突破我的已有认知。
不知道大家的想法,反正我自己,这一年来并没有怎么关注小模型的进展。毕竟行业的焦点就是把模型做大嘛。
那小模型的价值到底在哪?
最直接的是成本。同样一个任务,小模型的推理价格可能只是前沿模型的十分之一甚至更少。
但 Demis 说,比成本更重要的其实是速度。
这里有一个前提得先说清楚。Demis 不是在说速度可以替代智能。
他的原话是,当小模型的能力已经达到前沿模型的 90% 到 95%,也就是已经相当不错的时候,剩下那 5% 到 10% 的能力差距,比不上速度带来的好处。
比如现在工程师用 AI 写代码,已经形成了一种新的工作节奏。一个想法冒出来,几秒之内就能看到结果,不行就改,再不行再改。
这个一改再改的循环跑得越快,做出来的东西就越好。如果每次调用都要等十秒,整个工作流就被打断了。
更关键的是,快到一定程度,工程师在这种节奏里能进入心流。一个想法、一次尝试、一个反馈、再来一个想法,思维不被打断。
这件事写过代码的人都懂,进入心流和频繁掉出心流,产出的差距是数量级的。
Agent 也是同样的逻辑。一个 Agent 跑完一个任务可能要调几十次模型,每次慢一秒,整个任务就慢一分钟。慢到一定程度,Agent 就从一个能用的东西变成鸡肋。
小模型不是大模型的廉价替代品。有些事只有小模型能做。
比如手机、眼镜、家用机器人,需要的就是一个能在本地跑起来的模型。本地跑除了反应快,还有一个特别重要的好处,隐私。
家里机器人看到的视频、听到的对话,全部在设备本地处理,根本不上云。这件事对很多用户来说不是加分项,是底线。
成本、速度、边缘部署,这是小模型的价值。
5
讲完小模型的价值,接下来一个更关键的问题是,能力被压到这么小的参数里,会不会有上限?
Demis 的判断是,目前没看到信息密度有任何理论上限。小模型的智能天花板还远没看到。
支撑这个判断的,是 DeepMind 在蒸馏这件事上的积累。蒸馏简单说就是先训练一个超大的模型,然后用这个超大模型去教一个小模型。教完之后,小模型用极少的参数,能复现原来 95% 以上的能力。
为什么 DeepMind 这么重视蒸馏?因为要把 AI 能力放进谷歌的头部产品中,前提是低延迟、低成本。前沿模型再强,每次推理花几秒钟、花几毛钱...这条路,恐怕很难走得通。
一个前沿模型发布之后,6 到 12 个月内,他们就能把这个模型的能力蒸馏到边缘设备能跑的小模型上去。这个时间表比很多人想的要快。
在很多场景中,小模型和大模型会相互配合。
举个例子,一个端到端的智能助手,绝大部分日常任务在本地的小模型上跑。智能眼镜看到的画面、家里机器人听到的对话、手机里的私人助理,模型直接在设备里读懂,不需要往云端传一遍。
只有遇到特别复杂、本地搞不定的问题,才向云端的前沿模型发起请求。
也就是说小模型在边缘做主力,前沿模型在云端做后援。
不过,这个构想对小模型的要求也比较高,它不能只会处理文字,还得能理解物理世界。
这就是为什么 Gemini 从一开始就坚持多模态,不光处理文字,也处理图像、视频、声音。
一开始这么做比只做文本要难得多,但眼镜也好,机器人也好,需要的是一个能看懂周围世界的模型,不是一个只会聊天的模型。
讲到这里,小模型这条路的轮廓就完全清楚了。它独立成立,不是前沿模型的廉价替代品,而是另一条同样重要的路。
嗯,很有启发。

YouTube

0

No recent videos fetched on this date.