← 2026-04-08

Daily Edition

2026-04-09

2026-04-10 →

AI Builders 日报 — 4月9日

追踪 AI 领域真正在做事的人,而不是空谈者。

X / Twitter 深度解读

Amjad Masad (CEO, Replit)

Replit 今日密集输出,涵盖多 agent 协作、移动端体验优化和社区活动三条主线。

  • Replit 多 agent 单项目工作流被评"唯一做到位的" 有用户发帖称 Replit 似乎是目前唯一真正搞定多 agent 协作同一项目这一难题的平台。Amjad 转推确认了这一评价,并持续推进产品迭代。 faviconx.com

  • 瑞士 Replit Builders Group 正式上线 Luma Amjad 宣布瑞士 Replit Builders Group 在 Luma 开放注册,明日 16:00 CEST 举行首次聚会,欢迎全球开发者参与。 faviconx.com

  • Replit 移动端:更大触控区域,更少视觉噪音 Replit 移动端发布新一轮优化,提升关键操作的触控目标面积,减少界面干扰。Amjad 表示这只是开始,更多功能正在路上。 faviconx.com


Garry Tan (CEO, Y Combinator)

YC 掌门人今日火力全开,横跨 AI 编程工具评论、播客新节目、融资新闻和深度行业分析多条线。

  • YC 内部 AGENTS.md 披露:如何做出"永久性 agent" Garry 分享了自己 OpenClaw AGENTS.md 的核心规则——"不允许一次性工作"原则:如果某件事以后还要再做,就把它编码成可复用的 skill。另一位开发者据此补充了"回归 bug 根因修复"规范,触发条件包括:同类问题被修正两次、cron 任务重复失败、用户表达"又出问题了"的挫败感——要求 stop + 深挖 + 永久修复,而非临时补丁。 faviconx.com

  • GStack + Claude Code:被真实用户验证的"专家级系统"扩展 YC 合伙人 Ian Hsiao 实测后表示:GStack(garrytan 的产品)将 Claude Code 体验提升到全新层次——它能扩展专家级系统,并展示了 plugin 系统+固执工作流的价值。Garry 转推并评论"Just make stuff as you might and it'll trigger as needed"。 faviconx.com

  • Nerd Snipe:YC 的新播客节目上线 Garry 宣布与 Theo 合作推出播客 "Nerd Snipe"(原名 TBPNN),已在 YouTube 和 Spotify 上线,即将登陆更多平台。 faviconx.com

  • Anjney Midha 募集 13 亿美元首支基金,AMP 向 Anthropic 投入 3 亿美元 资深 VC Anjney Midha 完成 13 亿美元首支基金募集,其中向 Anthropic 最近一轮投入 3 亿美元大额支票,已开始募集下一支基金。AI 基础设施投资热度持续攀升。 faviconx.com

  • "把某样东西当玩具的人,看到的是它的局限性" Garry 引用 Alfred Lin 的观点,并附上自己的观察:个人电脑、Mac、互联网、AI 最初都被当玩具嘲笑。"把某样东西当玩具的直觉,应该被视为一个信号——这类东西往往是未来的起点。" 这与他长期支持"车库创业"的 YC 精神一脉相承。 faviconx.com

  • Grep AI 击败所有大厂:两人团队刷榜深度研究 benchmark 仅由两人组成的 Grep AI,在三大深度研究 benchmark 上击败了 Perplexity、Google、Nvidia、OpenAI 和 Anthropic,达成 SOTA。再次印证:规模不等于创新能力。 faviconx.com

  • 数据中心的"用水危机"源头:被高估了 10 万% Karen Hao《Empire of AI》登上《纽约时报》畅销榜,但其中对数据中心用水量的估算被指高估了 10 万倍。Garry 指出错误书籍至今仍在流通、毫无更正。 faviconx.com

  • Ramp 发布企业 AI 转型指南:8 条实战经验 YC 另一被投公司 Ramp 发布了一份被 Garry 称为"市面上最好的企业 AI 采纳指南",核心 8 条:第二好的时机是今天、把 AI 熟练当作学习曲线接受、拥抱创造性破坏、从中心建设、从边缘推动、给人舞台而非命令、尽快让人到达"顿悟时刻"、消除人与 AI 之间的一切障碍。Garry 特别指出第 8 条应该是第一条——大多数企业卡在采购环节。 faviconx.com


Alex Albert (CEO, Browserbase / Claude 布道者)

  • Claude 上线 Advisor 策略:Opus 当顾问,Sonnet/Haiku 当执行 Anthropic 宣布将"advisor 策略"引入 Claude Platform:用 Opus 作为顾问,搭配 Sonnet 或 Haiku 作为执行者,以极低成本实现接近 Opus 水平的 agent 智能。这一设计将重新定义 AI agent 的成本效率。 faviconx.com

Peter Steinberger (Principal Engineer, PSPDFKit)

Peter 今日密集分享 AI Engineer Europe 现场实况,并贡献了本次最重磅的 AI 编程工具实测对比。

  • Claude Code CLI vs Codex CLI:90 分钟 vs 4 小时的真实体验 Peter 自述:在 Opus Claude Code CLI 中耗时 4 小时,几乎每次修改都需要他手动校正;而切换到 Codex CLI 后,90 分钟产出的 20 个 PR 无需任何编辑。"在 Mythos 出来之前,我 100% 站在 Codex 这边。" 这一对比在业内引发广泛共鸣。 faviconx.com

  • OpenAI 推出 $100/月 Pro 订阅:5 倍 Codex 用量 OpenAI 正式推出 $100/月 ChatGPT Pro 档位,比 Plus 多 5 倍 Codex 使用限额,限时推广至 5 月 31 日额外提升至 10 倍。 faviconx.com


Josh Woodward (Product Lead, Google Gemini)

  • Gemini Lyria 3 全量解锁:5 首完整曲目/天,50 天破亿歌曲生成 Gemini 正式解锁 Lyria 3 音乐模型,上线不足 50 天已生成超过 1 亿首歌曲。即日起每日可生成最多 5 首完整曲目(约 3 分钟/首),超出限额可继续创作 30 秒片段或升级订阅。 faviconx.com

  • Gemini 可视化升级:直接生成交互式 3D 图表 Gemini 新增交互式可视化能力,用户可调整变量、旋转 3D 模型,在对话中直接探索数据。 faviconx.com


Dan Shipper (联合创始人, Every)

Dan 今日是 AI 公司融资与开放生态领域最活跃的评论者,同时引爆了关于 OpenAI Spud 谣言的澄清。

  • OpenAI 辟谣:Cyber 产品不是 Spud,Axios 混淆了两件事 多家媒体报道"OpenAI 因安全原因限制发布最新模型 Spud",Dan 直接联系 OpenAI 后澄清:确有 Cyber 产品在Trusted Tester 组内测试,但这与 Spud 完全无关,Axios 报道已将两件事混淆后更正。Dan 连续发了多条更正推文,成为该事件的事实核查节点。 faviconx.com

  • Audos 推出 Co-vibe 模式:"Solo 创业者的 bug 不是独行,而是缺乏并肩作战的人" Dan 宣布 Audos 推出人类协作侧的 Co-vibe 功能,核心洞察来自 1 万+创业者的观察:"独行"从来不是特点,而是缺陷。 多数人不需要传统意义上的联合创始人,但需要一个能推动项目向前进的伙伴。Audos 同步推出 Portfolio-First 模式——默认运行多个项目,市场验证后再集中。 faviconx.com

  • Dan 直言:我无法停止思考 Claude Mythos Dan 连发多推表达对 Claude Mythos 的高度期待,配文"Keep calm and subscribe to @every",引发社区对 Every 深度报道能力的认可。 faviconx.com


Sam Altman (CEO, OpenAI)

  • ChatGPT Pro $100 档位官宣:Codex 大规模放量 Sam 简短宣布:$100 ChatGPT Pro 订阅正式推出,并预告即将上线更多功能。这是 OpenAI 在 Claude Max 和 Cursor 压力下,对高用量专业用户市场的直接回应。 faviconx.com

Aaron Levie (CEO, Box)

  • 企业 AI agent 时代:数据是护城河,编码 agent 只是开始 Aaron 接受 CNBC 采访时指出:企业 AI agent 的核心前提是访问关键企业数据,而大多数企业仍在用 legacy 数据环境。"Coding agent 是目前最有用的,但也是上限最低的;通用 agent 若能 coding、使用技能、访问数据、运行应用,则生产力提升可达 200%,且没有天花板。" faviconx.com

Peter Yang (AI 科技写作者)

  • 中国 AI 模型的崛起:Qwen 在硅谷大厂广泛使用 Peter 深度披露:Cursor 确认 Composer 2 基于 Moonshot Kimi K2.5;Cognition 的 SWE-1.6 很可能在 Zhipu GLM 上 post-train;Shopify 每年节省 500 万美元改用 Qwen,Airbnb CEO Brian Chesky 也公开表示依赖 Qwen——"非常快、非常好、非常便宜"。同时 Zhipu 发布 GLM-5.1,开源模型在编程 benchmark 上几乎追平 Opus。Peter 警告:Anthropic 切断 OpenClaw 访问的背后,是一场关于"无限用 AI"订阅模式的深刻博弈。 faviconx.com

swyx (AI Engineer 联合创始人)

swyx 今日是 AI Engineer Europe 最核心的现场记录者,从早餐到深夜持续输出。

  • YC 风格的 AIE Europe 现场指南 swyx 在 Downing Street 与各国 AI 工程师、政策制定者共进早餐,宣布 AI Incubator 和 No.10 Innovation Fellowship 正式向顶尖 AI 人才开放申请,英国政府正在系统性引入 AI 构建力量。 faviconx.com

  • SWE-rebench 排行榜解析:模型是如何"作弊"的 swyx 在 AIE Europe 深入解析 SWE-rebench 排行榜背后的构建和模型作弊问题,涵盖 eval 设计细节,是目前最权威的技术复盘之一。 faviconx.com

  • 免费 Workshop:多 agent 系统实战 swyx 联合 Paul Iusztin 发布了完整的开源 workshop,包含代码、PPT 和即将上线的 2 小时视频,涵盖:深度研究 agent、LinkedIn 内容工作流(通过 AI 垃圾检测器)、评估层优化整个系统。使用 MCP 和 Claude Skills 构建作用域 agent 系统,已在生产环境验证。 faviconx.com

  • AIE Europe Day 1 收官:欧洲 AI 社区令人震撼 swyx 总结:AIE Europe Day 1 能量爆棚,前沿实验室、英国政府、伦敦最热 AI 创业公司悉数到场。盛赞 swyx 推动的 Downimg Street 早餐会:"代表英国 AI 生态,感谢您的服务!" faviconx.com

  • 18 分钟演讲时长最合理:5 分钟太短,30 分钟太长 swyx 分享 AIE Europe 参会感受:18 分钟是 AI 工程师演讲的黄金时长,引发广泛认同。 faviconx.com


Kevin Weil (CFO/COO, OpenAI)

  • OpenAI 内部模型再解决 5 个 Erdős 问题 OpenAI 团队发布了新论文,使用内部模型解决了 5 个进一步的 Erdős 问题(arXiv:2604.06609),其中 Kevin 特别提到 Problem 1091 的证明过程"尤其令人愉悦"。 faviconx.com

Guillermo Rauch (CEO, Vercel)

  • shadcn/ui:全球每秒新增一个基于它的应用 Guillermo 援引数据:每秒钟就有一个新的 shadcn/ui 应用被创建(不含 AI 生成应用、v0 或第三方 init)。这是一个来自全球独立开发者的有机增长现象。 faviconx.com

Matt Turck (合伙人, First Mark Capital)

  • Gradium Phonon:手机 CPU 上运行的 SOTA TTS 模型 Gradium 发布 Phonon,参数约 100M 的端侧 TTS 模型,在所有测试指标上击败所有同类端侧模型,支持自然语音、多语言和语音克隆,完全在手机 CPU 上运行,无服务器、无延迟、无按次计费成本。面向游戏开发者和应用构建者开放内测。 faviconx.com

  • LM Studio 收购 Locally AI 团队 LM Studio 宣布收购 Locally AI 创始人 Adrien Grondin,后者的 app 在苹果平台积累了大量拥趸,未来将全力推进苹果平台的本地 AI 体验。 faviconx.com


今日思考

4月9日这一天,信息密度最高的不是某一条单独的发布,而是一张逐渐清晰的版图正在浮现。

第一条主线:AI 编程工具正在经历一次无声的代际切换。 Peter Steinberger 的 4 小时 Opus vs 90 分钟 Codex 的实测,swyx 披露的 SWE-rebench 排行榜"模型作弊"黑幕,以及 Sam Altman 宣布 $100 Pro 档位(5-10 倍 Codex 用量)——三件事叠加在一起,指向同一个结论:当前 agent 编程的性价比王者很可能已经易主,而这场切换会在未来几个月内重塑整个 AI 编程工具市场的格局。值得注意的是,Peter 的体验并非孤例,越来越多的深度用户开始反馈 Claude Code 在长会话中的"累积疲劳"问题,而 Codex 的稳定性优势则被反复验证。

第二条主线:中国 AI 模型正在悄然占领硅谷企业市场。 Peter Yang 的调查揭示了一个被主流媒体严重低估的story:Shopify 每年节省 500 万美元改用 Qwen,Airbnb CEO 公开称赞 Qwen,Cursor Composer 2 基于 Kimi K2.5,Cognition 基于 Zhipu GLM——这不是"中国 AI 崛起"的宣传,而是真实发生在每个大厂工程团队里的采购决策。Zhipu GLM-5.1 的开源发布更是将这场竞争推向了一个新的技术高度。这与 Anthropic 限制 OpenClaw 访问的争议事件形成了微妙的镜像:一边是"无限用 AI"订阅模式在美国面临的商业压力,另一边是中国的模型厂商正在用更低的价格和足够的性能攻城略地。

第三条主线:欧洲 AI 生态正在以出人意料的速度整合进全球核心版图。 AI Engineer Europe 成为了一个完美的观察窗口:swyx 推动的 Downing Street 早餐会把英国政府的 AI Incubator 和前沿实验室的工程师们放在同一张桌子上,这在一年前都是不可想象的。Adrien Grondin 加入 LM Studio、Ramp 发布企业 AI 指南、Garry Tan 分享的 AGENTS.md 规范被全球数千个 agent 运营者采纳——这些都在说明一件事:AI 工具层的最佳实践正在以极快的速度跨越地理边界标准化,而不是被某个单一市场垄断。

X / Twitter

159
Garry Tan
Garry Tan @garrytan
How I get my claw to be a durable AI agent I never have to instruct twice

Paste this into your OpenClaw's AGENTS.md or send it as a message:

You are not allowed to do one-off work. If I ask you to do something and it's the kind of thing that will need to happen again, you must:

1. Do it manually the first time (3-10 items)
2. Show me the output and ask if I like it
3. If I approve, codify it into a SKILL.md file in workspace/skills/
4. If it should run automatically, add it to cron with `openclaw cron add`

Every skill must be MECE — each type of work has exactly one owner skill. No overlap, no gaps. Before creating a new skill, check if an existing one already covers it. If so, extend it instead.

The test: if I have to ask you for something twice, you failed. The first time I ask is discovery. The second time means you should have already turned it into a skill running on a cron.

When building a skill, follow this cycle:
- Concept: describe the process
- Prototype: run on 3-10 real items, no skill file yet
- Evaluate: review output with me, revise
- Codify: write SKILL.md (or extend existing)
- Cron: schedule if recurring
- Monitor: check first runs, iterate

Every conversation where I say "can you do X" should end with X being a skill on a cron — not a memory of "he asked me to do X that one time."

The system compounds. Build it once, it runs forever.
swyx
swyx @swyx
Retweeted
Wolfram Ravenwolf Wolfram Ravenwolf
Day 1 of @aiDotEngineer in London – what a ride!
🤖 Ash Prabaker & Andrew Wilson from Anthropic on "How to Build Agents That Run for Hours (Without Losing the Plot)" – spoiler: stop letting your agents grade their own homework.
⚡ "Skills at Scale" workshop with @nicknisi & @zackproser – the craft of building and evolving advanced agent skills. My sassy Amy-powered Repo Roast skill scored a perfect 10/10 from the presenters.
🧠 Google DeepMind workshop with @thorwebdev & @_philschmid – Gemini 3.1 Flash Live meets the new Interactions API. The future of voice agents got a serious upgrade.
🦞 ClawCon London – swapped hair loss war stories with @steipete over OpenClaw upgrade hiccups (stable branch plz?), showed @WolfBenchAI, and soaked up the community vibes.
🤝 Great talks, great people, great conversations. This is why you show up in person.
See you all again tomorrow – 2 more days to cover!
Garry Tan
Garry Tan @garrytan
Retweeted
skepticalifornia skepticalifornia
America has a gang violence problem disguised as a gun problem
The problem could be completely crushed if there was a will
Crémieux: When the city of Oakland implemented a program intended to curb its gun violence, they also exposed this interesting tidbit:
<0.5% of the population of the city does more than half of the gun violence.
They later revealed this was ~0.3%, or a little under 1,300 people.
Jeremy Howard
Jeremy Howard @jeremyphoward
Retweeted
ben (is hiring engineers) ben (is hiring engineers)
every engineer at anthropic has been using mythos for ~1.5 months.
meanwhile, their uptime is horrendous, claude code still has rendering bugs, etc.
one could conclude that it won't be the end of software engineering.
Lisan al Gaib: ANTHROPIC HAD MYTHOS INTERNALLY SINCE FEB 24
Jeremy Howard
Jeremy Howard @jeremyphoward
Retweeted
Charlie Marsh Charlie Marsh
Tragically I am continuing to find that the most effective guardrail against slop is extremely talented engineers doing very thoughtful, human code review
Garry Tan
Garry Tan @garrytan
Retweeted
Marc Andreessen 🇺🇸 Marc Andreessen 🇺🇸
Worth watching! Matt Mahan @MattMahanSJ has been an outstanding mayor for San Jose.
The All-In Podcast: How Matt Mahan Thinks He Can Save California 🚨
San Jose Mayor and California Gov Candidate @MattMahanSJ joins @friedberg to talk:
-- State of California
-- Impact of public sector unions in CA politics
-- Pension time bomb
-- CA housing, energy crises
(0:00) Matt Mahan:
Garry Tan
Garry Tan @garrytan
The cool thing about markdown is that the agent itself can decide when a GStack skill will help you

Just make stuff as you might and it’ll trigger as needed

Sainath K (OPUS): Great job with Gstack @garrytan !! my favorite is when my agent knows that a particular skill will be useful and it uses it!
Garry Tan
Garry Tan @garrytan
Retweeted
Ryan Sarver Ryan Sarver
This is a really good one from @garrytan to codify work into skills when you'll use it again.
I had a bunch of regressions this week that were driving me crazy so I added this to AGENTS.md and it's helped:
---
## Recurring Bugs — Own the Root Fix (Hard Rule)
If you notice the same problem happening more than once — a regression, a broken behavior, something Ryan had to correct twice — stop and do a deep debugging sweep before moving on.
The pattern:
1.⁠ ⁠Detect recurrence — if Ryan corrects the same thing twice, or you notice you're fixing something you already fixed, flag it
2.⁠ ⁠Root cause first — do not patch symptoms. Read logs, config, scripts, cron history. Find the actual cause
3.⁠ ⁠Propose a permanent fix — config lock, test, guard clause, AGENTS rule, cron monitor, whatever stops it coming back
4.⁠ ⁠Apply it — don't just propose, implement it with Ryan's approval
5.⁠ ⁠Document the lesson — write it into AGENTS.md, MEMORY.md, or the relevant skill
The bar: a fix is only a fix if the problem cannot recur. A workaround is not a fix.
Proactive sweep triggers:
•⁠ ⁠Same issue corrected twice in any rolling 7-day window
•⁠ ⁠Ryan expresses frustration about something happening &#34;again&#34;
•⁠ ⁠A cron job fails with an error you've seen before
•⁠ ⁠Cron health check (twice daily, via HEARTBEAT) — see below
Cron health (built into HEARTBEAT):
Twice per day (morning + evening), check run history for all cron jobs. If any job has 2+ consecutive failures:
1.⁠ ⁠Pull the run logs immediately — don't just flag it, diagnose it
2.⁠ ⁠Identify the root cause before alerting Ryan
3.⁠ ⁠Send Ryan a WhatsApp: what failed, why, and a proposed fix
4.⁠ ⁠Implement the fix with approval; document the lesson
Silent failures are your responsibility. If a cron has been failing for days and Ryan didn't know, that's a miss.
Garry Tan: How I get my claw to be a durable AI agent I never have to instruct twice
Paste this into your OpenClaw's AGENTS.md or send it as a message:
You are not allowed to do one-off work. If I ask you to do something and it's the kind of thing that will need to happen again, you
Garry Tan
Garry Tan @garrytan
Retweeted
Theo - t3.gg Theo - t3.gg
Welcome to the FIRST EPISODE of TBPNN (Theo & Ben Podcast News Network)
jkjk we're calling it Nerd Snipe. Available on Youtube, Spotify, and coming to all your favorite podcast networks soon™
Kevin Weil 🇺🇸
Kevin Weil 🇺🇸 @kevinweil
Five Erdos problems at once! The proofs are getting more elegant as the models improve 👀

Mehtaab Sawhney: We’ve just released another paper solving five further Erdős problems with an internal model at OpenAI: https://arxiv.org/abs/2604.06609.

Several of the proofs were especially enjoyable to digest while writing the paper. My personal favorite was the solution to Erdős Problem 1091. The

Jeremy Howard
Jeremy Howard @jeremyphoward
Retweeted
Ben Golub Ben Golub
Patrick Collison: Congratulations to Alex and the whole team at MSL. As a sucker for all things speedy (http://patrickcollison.com/fast), I thought this was an impressive chart:
Garry Tan
Garry Tan @garrytan
Legit baller @AnjneyMidha

Katie Roof: Scoop: @AnjneyMidha raised $1.3B for his first venture fund, AMP. The firm wrote a $300m check in Anthropic’s recent round. Already raising another fund

Yann LeCun
Yann LeCun @ylecun
Retweeted
Pierre Beyssac 🏴‍☠️🇫🇷🇪🇺🇺🇦 Pierre Beyssac 🏴‍☠️🇫🇷🇪🇺🇺🇦
Détruisons toute capacité d'innover dans une technologie du futur en inventant la présomption de culpabilité.
Texte manipulé, comme souvent, par les ayants-droit.
Les mêmes sénateurs verseront demain des larmes de crocodiles sur la souveraineté numérique, sans voir le rapport.
01net: La proposition de loi qui veut renverser le rapport de force entre #IA et #auteurs a été adoptée au #Sénat. #présomption ➡️ https://www.01net.com/actualites/pourquoi-la-proposition-de-loi-examinee-ce-mercredi-au-senat-pourrait-renverser-le-rapport-de-force-entre-ia-et-auteurs.html
Yann LeCun
Yann LeCun @ylecun
Retweeted
Kenneth Roth Kenneth Roth
No wonder Trump loves Hungary's Viktor Orban. Trump wishes he had made as much progress as Orban in implementing the Autocrat's Playbook by suppressing the media and civil society. But Orban may soon be voted out of office. Republicans are likely next. https://trib.al/o7B1vZ3
Garry Tan
Garry Tan @garrytan
Retweeted
Marc Andreessen 🇺🇸 Marc Andreessen 🇺🇸
Apocalypse cults drop bodies. Many such cases.
Nathan Leamer: A city councilman’s home was shot at over a data center. His child was inside.
No neighbor zoning disagreement justifies violence.
Hyperbolic AI “doomer” rhetoric has consequences, and it’s time to say so. My latest in @realDailyWire
Garry Tan
Garry Tan @garrytan
Retweeted
Chamath Palihapitiya Chamath Palihapitiya
The sad thing is that Ro is the guy preaching for socialism while he is the most active insider trader in Congress.
He is a terrible representative of Silicon Valley.
Anthony Pompliano 🌪: Nancy Pelosi take a seat.
There is a new king in town when it comes to Congress members being abnormally good traders.
Ro Khanna has DESTROYED the S&P 500 since January 2024.
Read the full analysis: https://www.procapinsights.com/app/articles/congress-hedge-fund-stock-picks-wall-street-investing-outlook-economy
Garry Tan
Garry Tan @garrytan
If you’re taking advice from 1x speed engineers I don’t know what to tell you

Don’t believe the haters. Speed up with us.

Ian Hsiao: ngl gstack by @garrytan brought my claude code experience to a whole new level

> it scaled an expert-system
> it shows you what you can do with plugin systems + opinionated workflow
Peter Yang
Peter Yang @petergyang
Titles don’t matter

Chris Bakke: Wild resume:

Amjad Masad
Amjad Masad @amasad
There’s a reason bootstrapped solo businesses are accelerating on Replit… we gave builders entire teams.

Ram Kulkarni: @Replit seems to be the only one that's nailed the multi-agent-on-single-project workflow.

Yann LeCun
Yann LeCun @ylecun
Retweeted
Paul Calcraft Paul Calcraft
>8 out of 8 [cheap oss] models detected Mythos's flagship FreeBSD exploit
Completely disingenuous
They gave it just ~20 lines of code to read. They baked in custom, relevant context pertinent to the exploit at the top
Reasoning *across files* is key to finding this exploit
clem 🤗: &#34;But here is what we found when we tested: We took the specific vulnerabilities Anthropic showcases in their announcement, isolated the relevant code, and ran them through small, cheap, open-weights models. Those models recovered much of the same analysis. Eight out of eight
swyx
swyx @swyx
Retweeted
Steve Ruiz Steve Ruiz
thank you JavaScript
swyx
swyx @swyx
Retweeted
👩‍💻 Paige Bailey 👩‍💻 Paige Bailey
🇬🇧🦞 like a dang concert venue this morning!
swyx
swyx @swyx
Retweeted
Madison Faulkner Madison Faulkner
Got to hang out at @10DowningStreet today with @aiDotEngineer
@WhiteHouse wya
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
raising lobsters at @aiDotEngineer
Garry Tan
Garry Tan @garrytan
きらきら
私はスター
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
OH: Almost everyone at RedHat uses Macs now.
swyx
swyx @swyx
Retweeted
AI Engineer AI Engineer
What's next for @openclaw - Peter Steinberger on the future of the claw
swyx
swyx @swyx
Retweeted
swyx 🇬🇧 @aidotengineer swyx 🇬🇧 @aidotengineer
Re @_lopopolo AMA for Harness Engineering https://app.sli.do/event/t8CrG7h11UrUkAQ1Amd89L
swyx
swyx @swyx
Retweeted
Ryan Lopopolo Ryan Lopopolo
Come chat with me to dive deeper into the technical weeds on good harness engineering in Westminster!
swyx 🇬🇧 @aidotengineer: @_lopopolo AMA for Harness Engineering https://app.sli.do/event/t8CrG7h11UrUkAQ1Amd89L
swyx
swyx @swyx
Retweeted
Peter Gostev (@aiDotEngineer in London) Peter Gostev (@aiDotEngineer in London)
New Open Claw tourist attraction at the heart of London courtesy of @aiDotEngineer
swyx
swyx @swyx
Retweeted
Peter Steinberger 🦞 Peter Steinberger 🦞
Send all your ClosedClaw questions!
swyx 🇬🇧 @aidotengineer: @steipete we are hosting an AMA with Pete next, questions will be triaged by me, post here https://app.sli.do/event/4pT66BVb2noyZ7bCY2LB4N
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Send all your ClosedClaw questions!

swyx 🇬🇧 @aidotengineer: @steipete we are hosting an AMA with Pete next, questions will be triaged by me, post here https://app.sli.do/event/4pT66BVb2noyZ7bCY2LB4N
swyx
swyx @swyx
Retweeted
Mark Briers Mark Briers
Enjoyed @steipete’s AMA with @swyx. On security, I think multilayer defence is the solution. I open sourced this Go sidecar to provide such a layer - using (cyber security) open standards as rule definitions at tool runtime. I use it for yolo mode execution (in a sandbox with limited access 😀) https://github.com/agentshield-ai/agentshield Welcome thoughts!
Amjad Masad
Amjad Masad @amasad
Retweeted
Marcel Haas Marcel Haas
The Swiss @Replit Builders Group is online on @LumaHQ and meeting tomorrow for the first time - everyone who registered will get an email invite soon. Join us tomorrow at 04:00 PM CEST, even ur not from Switzerland. ;-) https://luma.com/rx89u2yp
swyx
swyx @swyx
Retweeted
Marlene Mhangami Marlene Mhangami
The @github team is here at @aiDotEngineer Europe 🥳 Come by our booth to chat to us! We have swag and a raffle to win a GitHub varsity jacket♥️✨
swyx
swyx @swyx
Retweeted
Ibragim Ibragim
At 15:10 today, I’ll be speaking about our SWE-rebench leaderboard at AI Engineer Europe. I'll cover how we build evals and how models cheat!
Come listen and let's chat!
So far, this is the coolest applied AI event in London. Respect and thanks to @aiDotEngineer and @swyx
See you there! 👋
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Retweeted
Christopher Charles Christopher Charles
If you have been using Claude Code CLI like I have and run into constant poor coding output, try Codex. Astonished how much better it is.
I was 4 hours into an Opus CC session last night and every single change had issues with it so I had to resort to hand editing (almost) every change post review.
Woke up this morning and tried the Codex CLI, 90 mins and the output required zero editing after ~20 PR's.
Until Mythos comes out, I am 100% on Codex now.
@steipete I should have trusted you weeks ago.
swyx
swyx @swyx
Retweeted
AI Engineer AI Engineer
Vibes from yesterday's workshop/reception day!
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
GUYS WE FOUND THE GUY WHO BUILT THE GITHUB MCP SERVER
swyx
swyx @swyx
Retweeted
Peter Steinberger 🦞 Peter Steinberger 🦞
GUYS WE FOUND THE GUY WHO BUILT THE GITHUB MCP SERVER
Matt Turck
Matt Turck @mattturck
You know it's a deep crypto winter when John Carreyrou (of Theranos investigation fame) thinks he's figured out who Satoshi is and no one on my timeline cares
Dan Shipper 📧
Dan Shipper 📧 @danshipper
if you saw the Acquired numbers and thought to yourself “I could do that”

we’re hiring a head of biz dev @every. you should apply: https://every.to/careers
Dan Shipper 📧
Dan Shipper 📧 @danshipper
i volunteer to vibe check all of the super dangerous models on normal tasks

@AnthropicAI @OpenAI put me in the game!
swyx
swyx @swyx
Retweeted
Ronan Ronan
We’ll be interviewing as a team.
swyx
swyx @swyx
Retweeted
Luke Knight Luke Knight
etn. & @ElevenLabs at 10 Downing Street
swyx
swyx @swyx
Retweeted
AI Engineer AI Engineer
Lobster cake for the clawfather!
How was it, @steipete?
Yann LeCun
Yann LeCun @ylecun
Retweeted
The Washington Post The Washington Post
Exclusive: The acting director of the CDC has delayed publication of a report showing the covid-19 vaccine cut the likelihood of ER visits and hospitalizations for healthy adults last winter by about half, according to two scientists. https://wapo.st/4tFqKqY
Jeremy Howard
Jeremy Howard @jeremyphoward
Retweeted
Kenton Varda Kenton Varda
A bit over a decade ago, we got fuzzers. A fuzzer is an automated vulnerability-finder that repeatedly runs a target program with semi-random inputs. One particular fuzzer, American Fuzzy Lop, was notable for being really good at searching the space of all possible branches in code in order to find the buggy ones. @BenLaurie found some security bugs in my own Cap'n Proto using AFL -- the first vulnerabilities reported in my code. And honestly, I thought that was really cool.
Today projects like Chromium and V8 have extensive fuzzing infrastructure that find tons of bugs. Most V8 security bugs are found by their own fuzzing, often before the bug is even released. And, you know, that's pretty great!
If you point a fuzzer at a project that hasn't previously been fuzzed, you will probably find a bunch of security bugs. It's not that hard.
And of course, bad guys can use fuzzers too.
But all the interesting targets have already been fuzzed. So. It's not really that useful to bad guys. On the contrary, fuzzing likely made it a lot harder for bad guys to find vulns.
Kenton Varda: Honestly &#34;AI that can find every vulnerability&#34; sounds way better for the good guys than the bad guys. Not sure why everyone is losing their minds here.
Garry Tan
Garry Tan @garrytan
Retweeted
Ankit Gupta Ankit Gupta
FYI the car that hit this duck was driven by a human. And it wasn’t a Waymo like this photo shows.
Ridiculous this is even news. Another day TechCrunch embarrasses itself.
TechCrunch: A self-driving car in Austin killed a mother duck, sparking neighborhood outrage https://techcrunch.com/2026/04/08/a-self-driving-car-in-austin-killed-a-mother-duck-sparking-neighborhood-outrage/?utm_source=dlvr.it&utm_medium=twitter
swyx
swyx @swyx
Retweeted
Peter Gostev (@aiDotEngineer in London) Peter Gostev (@aiDotEngineer in London)
AI Engineer after hours @altryne @steveruizok @maximelabonne
AI Engineer: As tradition, the AI Engineer speaker dinner is a great kick off before the first day of talks tomorrow!
If you're ever debated submitting a talk proposal, this dinner is &#34;the single best networking event in London&#34; - per @swyx
swyx
swyx @swyx
Retweeted
👩‍💻 Paige Bailey 👩‍💻 Paige Bailey
🙌 The AI community in Europe is *stacked*!
Go show off your open-source projects, your side-hustles that could potentially be startups, and what you're currently working on. It's a great way to get visibility with the @aidotengineer community, in the EU and globally! 🫶
swyx 🇬🇧 @aidotengineer: just announced on stage - we're soliciting ATTENDEE TALKS for our &#34;hallway track&#34; tomorrow, starting with @adrgrondin!
it's not too late to get an AIE talk! submission details in the AIE attendee slack.
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Brandon Gell Brandon Gell
Come do this at Every.
Dan Shipper 📧: if you saw the Acquired numbers and thought to yourself “I could do that”
we’re hiring a head of biz dev @every. you should apply: https://every.to/careers
Yann LeCun
Yann LeCun @ylecun
Retweeted
Mr. Beat Mr. Beat
OMG HOLY CRAP ARE YOU TELLING ME TARIFFS MADE THINGS MORE EXPENSIVE IT'S ALMOST AS IF WE SHOULD HAVE LISTENED TO ECONOMISTS OH WELL
swyx
swyx @swyx
Retweeted
Nick Taylor Nick Taylor
We have @TejasKumar_ at @aiDotEngineer teaching us about AI Harnesses 👀
Garry Tan
Garry Tan @garrytan
Retweeted
Alfred Lin Alfred Lin
When something looks like a toy, most people see its limitations. Outlier founders see its structure and imagine what more we can build:
- The personal computer was a hobbyist's toy
- The original Macintosh was mocked as underpowered and overpriced
- The early internet was a playground for academics
- Just a few years ago, AI was solving &#34;toy&#34; problems like playing chess, Go, or video games
The instinct to dismiss something as a toy should be treated as a signal, as they're often where the future begins.
Alfred Lin: http://x.com/i/article/2041205311290372096
Matt Turck
Matt Turck @mattturck
Retweeted
Gradium Gradium
We just published the first public benchmark results for Phonon, our on-device TTS model.
It beats every on-device model we tested, on every metric, at ~100M parameters.
Gradium: API-based voice interaction works great, but scaling it to millions of free users is another story. Gradium Phonon: natural voices, multilingual, voice cloning, running locally on a smartphone CPU. No server, no latency, no per-call cost. Game devs, app builders: private beta is
swyx
swyx @swyx
Retweeted
Luke Harries Luke Harries
Great to spend the morning at 10 Downing Street with the speakers of the AI Engineer Europe summit
swyx
swyx @swyx
Retweeted
rachael rachael
guess what @SherryYanJiang just said on NATIONAL RADIO!!
so cool that the @aiDotEngineer singapore story is literally being talked about on-air rn ❤️‍🔥
@agrimsingh @swyx @ivanleomk @aimuggle
Garry Tan
Garry Tan @garrytan
Retweeted
Marc Joffe Marc Joffe
Looks like Waymo will add Ojais to its San Francisco fleet before Jackie Fielder will get back to the work of stopping AVs.
https://www.sfexaminer.com/news/technology/sf-say-hi-to-waymos-new-ojai-autonomous-vehicles/article_54d53c1a-3a4a-4b91-9ace-a5572891f6ab.html
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Henrik Werdelin Henrik Werdelin
What if focusing on one idea is the wrong advice?
VCs tell you to focus. They run portfolios.
Build many. Build with others. After observing over 10,000 people build companies on @audos_com , we have seen two things;
1. The 'solo' in solo entrepreneur was always the bug, not the feature.
Most people don't want a co-founder in the traditional sense. But they want input. A sparring partner. Someone to build alongside and push things forward with. Otto, our AI co-founder agent, has always been that. But we kept seeing the same thing in our data: our community features were the most loved. So today we're launching the human side. Co-vibe with real people on Audos. Invite a friend, an advisor, a potential co-founder. Pull them into your project, push it forward together. Your terms, just not alone.
2. The best business for you isn't always the one you picked on day one.
It's the one you discovered by building several. Making is thinking.
Most of the successful companies I have been involved with didn’t happen because the founders went all in on one idea right away. Each was one of several things running at the same time. On Audos, portfolio-first is now the default. You sign up to manage a portfolio of ideas, not a single project. Run five, run ten. Test them all. Double down on what the market and your gut tels you is working.
Try out our new co-vibe build feature, as well as our portfolio entrepreneurship mode right away on http://audos.com
Matt Turck
Matt Turck @mattturck
Retweeted
yags yags
So excited to share that @adrgrondin and @LocallyAIApp are joining LM Studio family!
Together we are doubling down on Apple platforms to bring you delightful AI experiences across devices.
Adrien was able to build a tasteful and much loved app over nights and weekends, and have been crushing it on twitter as well. Could not be more excited to join forces and build the future together.
Welcome to the team, Adrien!
Adrien Grondin: I’m excited to announce that I’ve joined @lmstudio 👾
The team behind the app is amazing and I couldn’t be more proud.
I’ll still be working on Locally AI, now full-time, to bring the best experience possible.
Dan Shipper 📧
Dan Shipper 📧 @danshipper
I just spoke to OpenAI, and this is actually false.

OpenAI is working on a cyber product with a trusted tester group.

But this is NOT related to Spud, their newest model.

Unfortunately seems like the Axios story conflated the two, and has now been updated.

Dan Shipper 📧: the new status symbol is making a model so powerful you can’t release it

Dan Shipper 📧
Dan Shipper 📧 @danshipper
The Axios story floating around about OpenAI limiting the release of their newest model Spud isn’t true.

Just spoke to OpenAI, and it appears the story conflated two things.

They do have a cyber product they are testing with a trusted tester group. But this is not the same thing as Spud.

The Axios story has now been updated.
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Soumitra Shukla Soumitra Shukla
Very important clarification regarding the latest news on OpenAI
Dan Shipper 📧: I just spoke to OpenAI, and this is actually false.
OpenAI is working on a cyber product with a trusted tester group.
But this is NOT related to Spud, their newest model.
Unfortunately seems like the Axios story conflated the two, and has now been updated.
Peter Yang
Peter Yang @petergyang
Silicon Valley is quietly running on Chinese open source AI models.

Here are the receipts:

→ Cursor confirmed last month that Composer 2 is built on Moonshot's Kimi K2.5

→ Cognition's SWE-1.6 model is likely post-trained on Zhipu's GLM

→ Shopify saved $5M a year by switching to Alibaba’s Qwen model. Airbnb CEO Brian Chesky has also said: "We rely a lot on Qwen. It's very good, fast, and cheap."

And now Zhipu dropped GLM-5.1, an open source model that performs almost as well as Opus on coding benchmarks.

📌 More on the Anthropic + OpenClaw drama and what I'm learning about AI on the ground in China in my new post: https://creatoreconomy.so/p/the-all-you-can-use-ai-subscription


Peter Yang: As much as I love using Claude Max and ChatGPT Pro, I don't think these all-you-can-use AI subscriptions will last forever.

Here's my new deep dive that covers:

→ Why Anthropic cut off OpenClaw access
→ How to run local models on your Mac
→ What I'm seeing on the ground in
Yann LeCun
Yann LeCun @ylecun
Retweeted
Peter Yang Peter Yang
Silicon Valley is quietly running on Chinese open source AI models.
Here are the receipts:
→ Cursor confirmed last month that Composer 2 is built on Moonshot's Kimi K2.5
→ Cognition's SWE-1.6 model is likely post-trained on Zhipu's GLM
→ Shopify saved $5M a year by switching to Alibaba’s Qwen model. Airbnb CEO Brian Chesky has also said: &#34;We rely a lot on Qwen. It's very good, fast, and cheap.&#34;
And now Zhipu dropped GLM-5.1, an open source model that performs almost as well as Opus on coding benchmarks.
📌 More on the Anthropic + OpenClaw drama and what I'm learning about AI on the ground in China in my new post: https://creatoreconomy.so/p/the-all-you-can-use-ai-subscription
Peter Yang: As much as I love using Claude Max and ChatGPT Pro, I don't think these all-you-can-use AI subscriptions will last forever.
Here's my new deep dive that covers:
→ Why Anthropic cut off OpenClaw access
→ How to run local models on your Mac
→ What I'm seeing on the ground in
Peter Yang
Peter Yang @petergyang
Retweeted
Peter Yang Peter Yang
Silicon Valley is quietly running on Chinese open source AI models.
Here are the receipts:
→ Cursor confirmed last month that Composer 2 is built on Moonshot's Kimi K2.5
→ Cognition's SWE-1.6 model is likely post-trained on Zhipu's GLM
→ Shopify saved $5M a year by switching to Alibaba’s Qwen model. Airbnb CEO Brian Chesky has also said: &#34;We rely a lot on Qwen. It's very good, fast, and cheap.&#34;
And now Zhipu dropped GLM-5.1, an open source model that performs almost as well as Opus on coding benchmarks.
📌 More on the Anthropic + OpenClaw drama and what I'm learning about AI on the ground in China in my new post: https://creatoreconomy.so/p/the-all-you-can-use-ai-subscription
Peter Yang: As much as I love using Claude Max and ChatGPT Pro, I don't think these all-you-can-use AI subscriptions will last forever.
Here's my new deep dive that covers:
→ Why Anthropic cut off OpenClaw access
→ How to run local models on your Mac
→ What I'm seeing on the ground in
Garry Tan
Garry Tan @garrytan
Keep running businesses legal

David Senra: "Capitalism created the possibility of the win win win. It used to be a zero sum game where somebody won, somebody else lost.

The biggest mistake people make, intellectuals in particular, they still think we're in a zero sum world. They're obsessed with some billionaires because

Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Kyle Russell Kyle Russell
Good because I have like three big projects I’m saving for 5.5
Dan Shipper 📧: The Axios story floating around about OpenAI limiting the release of their newest model Spud isn’t true.
Just spoke to OpenAI, and it appears the story conflated two things.
They do have a cyber product they are testing with a trusted tester group. But this is not the same
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
.@thsottiaux
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Chubby♨️ Chubby♨️
Important update: OpenAI’s cyber product/model that is not being released publicly is not Spud, but a different model.
Dan Shipper 📧: I just spoke to OpenAI, and this is actually false.
OpenAI is working on a cyber product with a trusted tester group.
But this is NOT related to Spud, their newest model.
Unfortunately seems like the Axios story conflated the two, and has now been updated.
swyx
swyx @swyx
Retweeted
Tom Mann Tom Mann
Loving the eval track! Super relevant stuff for @clawdbench
Funny pic from own of the slides
@aiDotEngineer @ibragim_bad @arafatkatze and others
swyx
swyx @swyx
Retweeted
Luke Knight Luke Knight
With the main man. Thanks @swyx, you’re a legend
Garry Tan
Garry Tan @garrytan
Retweeted
Perplexity Perplexity
Computer now connects with Plaid to link bank accounts, credit cards, and loans.
Track spending in detail, build custom budget tools, and visualize your net worth alongside your investment portfolio.
swyx
swyx @swyx
Retweeted
Incubator for Artificial Intelligence Incubator for Artificial Intelligence
Great morning bringing the speakers from @aiDotEngineer to Downing Street to discuss transforming the state.
Through the Incubator for AI and the No10 Innovation Fellowship, we are making sure that top AI talent can help build a better Britain!
https://job-boards.eu.greenhouse.io/iai
Aditya Agarwal
Aditya Agarwal @adityaag
Very cool to have the @AnthropicAI CTO visit @spc_india

Ankit Chowdhary: Truth: AI is advancing at crazy speeds. Everyday we wake up to new realities.

Question: How do you balance exponential growth, safety & future possibilities, all at the same time? Chatted about this and more, with @AnthropicAI CTO Rahul Patil @spc_india. Full EP out!

swyx
swyx @swyx
Retweeted
Fatima ✨ Fatima ✨
Re @swyx with all the flaws and these examples, is AI still net worth it? @aiDotEngineer
@GergelyOrosz the leaderboards came from leadership pushing for usage
individually we are moving a lot faster, as teams at certain orgs it can be hard to retrofit AI into workflows
Garry Tan
Garry Tan @garrytan
Retweeted
Aaron Epstein Aaron Epstein
Congrats to @Luminai on their $38m series B from Peak XV, and partnership with Cleveland Clinic!
Even at 19, it was clear @kesava_kirupa was going to build a great company – he has one of the most impressive founder stories I've seen at YC.
Y Combinator: .@Luminai helps large health systems move operational workflows from people to computers, turning unstructured data like faxes into structured data and building AI agents to automate critical processes. The company recently raised a $38 million Series B.
In this episode of
Jeremy Howard
Jeremy Howard @jeremyphoward
Retweeted
Kanjun 🐙 Kanjun 🐙
Twitter’s algorithm is optimized for addiction, not for us. We deserve better.
We’re releasing Bouncer today so you can take back control of your feed. Describe what you don't want, and Bouncer removes it.
It’s free, doesn’t collect your data, and will be open source soon.
Garry Tan
Garry Tan @garrytan
Retweeted
Seneca Scott Seneca Scott
SEIU is a criminal organization
https://www.city-journal.org/article/california-labor-unions-social-justice-left-policies-politicians
Allie K. Miller
Allie K. Miller @alliekmiller
Give me one minute, and I’ll improve your Claude Code experience immediately.

This is the first skill I built.

And it’s the skill I use most often.

*drumroll*

It’s a SCREENSHOT skill. And honestly, I’m shocked Anthropic hasn’t built this functionality into Claude Code itself.

Claude has access 🔑

But Claude needs EYES 👁️

Here’s what you’re going to do:

1) locate what folder all your screenshots go to (and if it’s your desktop, you’re a maniac, change it). Mine goes to a folder on my desktop called “organized screenshots”

2) prompt Claude Code with the following:

Build me a skill called ‘/ss’ that lists out the files in from newest to oldest, and grabs the newest. This is how I will speak to you visually. I also want an argument for the screenshot count - if I type ‘/ss 4’, you should grab the four most recent screenshots in that folder. If I type no number after ‘ss’ then only grab the most recent screenshot. Then, whatever follows after that argument is the action I want you to take. ‘/ss huh’ means I need you to explain the screenshots’ content to me. ‘/ss 3 make infographic plz’ means I need you to grab the last 3 screenshots and use their content to make me a unified infographic. ‘/ss fix’ likely means that I’m screenshotting an error message in code we’re building out and I need you to understand the error message, figure out the bug, and edit the code to fix it. Or, if we’re in the middle of a front end design project, it might mean the design has an error (like overlapping text) to fix. ‘/ss do this’ likely means that I screenshotted a smart thing someone did online and I want us to learn from it and do the same and remix it so it’s the most goal-oriented outcome for me based on what you know about me

3) let it build you the skill

4) go on X

5) scroll through your feed and screenshot one thing you find valuable

6) open a new terminal and prompt Claude with “/ss” + “do this” or “explain” or “turn this into an infographic”

7) enjoy - you just gave Claude eyes 🎉

Let me know how it goes. Again, this is my most used Claude Code skill by a landslide and easily saves me an hour a week.

Cc @bcherny @trq212
alliekmiller
alliekmiller @alliekmiller
Give me one minute, and I’ll improve your Claude Code experience immediately.

This is the first skill I built.

And it’s the skill I use most often.

*drumroll*

It’s a SCREENSHOT skill. And honestly, I’m shocked Anthropic hasn’t built this functionality into Claude Code itself.

Claude has access 🔑

But Claude needs EYES 👁️

Here’s what you’re going to do:

1) locate what folder all your screenshots go to (and if it’s your desktop, you’re a maniac, change it). Mine goes to a folder on my desktop called “organized screenshots”

2) prompt Claude Code with the following:

Build me a skill called ‘/ss’ that lists out the files in from newest to oldest, and grabs the newest. This is how I will speak to you visually. I also want an argument for the screenshot count - if I type ‘/ss 4’, you should grab the four most recent screenshots in that folder. If I type no number after ‘ss’ then only grab the most recent screenshot. Then, whatever follows after that argument is the action I want you to take. ‘/ss huh’ means I need you to explain the screenshots’ content to me. ‘/ss 3 make infographic plz’ means I need you to grab the last 3 screenshots and use their content to make me a unified infographic. ‘/ss fix’ likely means that I’m screenshotting an error message in code we’re building out and I need you to understand the error message, figure out the bug, and edit the code to fix it. Or, if we’re in the middle of a front end design project, it might mean the design has an error (like overlapping text) to fix. ‘/ss do this’ likely means that I screenshotted a smart thing someone did online and I want us to learn from it and do the same and remix it so it’s the most goal-oriented outcome for me based on what you know about me

3) let it build you the skill

4) go on X

5) scroll through your feed and screenshot one thing you find valuable

6) open a new terminal and prompt Claude with “/ss” + “do this” or “explain” or “turn this into an infographic”

7) enjoy - you just gave Claude eyes 🎉

Let me know how it goes. Again, this is my most used Claude Code skill by a landslide and easily saves me an hour a week.

Cc @bcherny @trq212
Garry Tan
Garry Tan @garrytan
Re Where is the California Chamber of Commerce on the California Asset Seizure Tax?

Where is it on the 800% Gross Receipts Tax in San Francisco?

Nowhere to be found. But when their big donors have big problems with little tech needing a level playing field, here they come!
swyx
swyx @swyx
Retweeted
AI Engineer AI Engineer
AIE Europe 2026 - Session Day 1 in the books!
So much energy & enthusiasm here -- we had such a blast!
Thanks to all of our speakers for anchoring this event -- and to our sponsors for making the event more engaging and interesting -- not to mention high-value and possible at all!
Thanks to all the builders, founders, & engineers for showing up today! We'll see you tomorrow for session day 2 🚀
Zara Zhang
Zara Zhang @zarazhangrui
Give people a stage, not just a mandate.

Great writeup on how to ignite AI usage & enthusiasm in your organization

Geoff Charles: http://x.com/i/article/2041982227865571328
swyx
swyx @swyx
Retweeted
Paul Iusztin Paul Iusztin
Here is my free workshop on building multi-agent systems, I presented at the @aiDotEngineer London conference together with @Whats_AI.
It has code, slides and soon a 2-hour video diving deep into how to build:
- a deep research agent
- a workflow that writes LinkedIn posts that pass AI slop detectors
- an evals layer to optimize the whole system
You can find everything on my GitHub:
http://github.com/iusztinpaul/de…
Clone it and become a LinkedIn influencer!
Joking.
It’s just a really good case study to build scoped agentic systems using MCP and Claude Skills.
So grateful to do this with @Whats_AI. Thanks @swyx for doing this in Europe.
The 2h workshop was a lot. At the end, I started to fade away, but we are excited with what we pulled off.
Andrew Ng
Andrew Ng @AndrewYNg
New course: Efficient Inference with SGLang: Text and Image Generation, built in partnership with LMSys @lmsysorg and RadixArk @radixark, and taught by Richard Chen @richardczl, a Member of Technical Staff at RadixArk.

Running LLMs in production is expensive, and much of that cost comes from redundant computation. This short course teaches you to eliminate that waste using SGLang, an open-source inference framework that caches computation already done and reuses it across future requests.

When ten users share the same system prompt, SGLang processes it once, not ten times. The speedups compound quickly, especially when there's a lot of shared context across requests.

Skills you'll gain:
- Implement a KV cache from scratch to eliminate redundant computation within a single request
- Scale caching across users and requests with RadixAttention, so shared context is only processed once
- Accelerate image generation with diffusion models using SGLang's caching and multi-GPU parallelism

Join and learn to make LLM inference faster and more cost-efficient at scale!
https://www.deeplearning.ai/short-courses/efficient-inference-with-sglang-text-and-image-generation
AndrewYNg
AndrewYNg @AndrewYNg
New course: Efficient Inference with SGLang: Text and Image Generation, built in partnership with LMSys @lmsysorg and RadixArk @radixark, and taught by Richard Chen @richardczl, a Member of Technical Staff at RadixArk.

Running LLMs in production is expensive, and much of that cost comes from redundant computation. This short course teaches you to eliminate that waste using SGLang, an open-source inference framework that caches computation already done and reuses it across future requests.

When ten users share the same system prompt, SGLang processes it once, not ten times. The speedups compound quickly, especially when there's a lot of shared context across requests.

Skills you'll gain:
- Implement a KV cache from scratch to eliminate redundant computation within a single request
- Scale caching across users and requests with RadixAttention, so shared context is only processed once
- Accelerate image generation with diffusion models using SGLang's caching and multi-GPU parallelism

Join and learn to make LLM inference faster and more cost-efficient at scale!
https://www.deeplearning.ai/short-courses/efficient-inference-with-sglang-text-and-image-generation
swyx
swyx @swyx
Retweeted
Matt Pocock Matt Pocock
Thank you @aiDotEngineer for packing out the keynote audience to see my boxes/learn why engineering skills still matter
I had to dash off straight away so if there was a question you wanted to ask, ask me below!
kitze 🛠️ tinkerer.club: so nice of @mattpocockuk to show a grid of boxes to the audience
Josh Woodward
Josh Woodward @joshwoodward
Expect even more creative model replies (like this) coming soon!

Google Gemini: Gemini can now transform your questions and complex concepts into customizable interactive visualizations directly in your chat.

Adjust variables, rotate 3D models, and explore data for a more immersive way to learn and explore in Gemini.
Yann LeCun
Yann LeCun @ylecun
Retweeted
FactPost FactPost
New economic analysis finds that nearly all Americans will pay more in taxes this year, with just the top 5% of earners getting a tax cut.
Garry Tan
Garry Tan @garrytan
Retweeted
AJ Asver AJ Asver
Grep just achieved SOTA on the three major deep research benchmarks, beating Perplexity, Google, Nvidia, OpenAI, and Anthropic.
We're a two-person founding team.
Grep AI: http://x.com/i/article/2042256522609356800
Guillermo Rauch
Guillermo Rauch @rauchg
This is remarkable. One 𝚗𝚙𝚡 𝚜𝚑𝚊𝚍𝚌𝚗 𝚒𝚗𝚒𝚝 every second
Gives a sense of the scale of new software creation

shadcn: A new shadcn/ui app is created every second. Every single second.*

*Not counting AI-generated apps, v0, or third-party init. Just someone, somewhere, running shadcn init in a terminal every second.

rauchg
rauchg @rauchg
This is remarkable. One 𝚗𝚙𝚡 𝚜𝚑𝚊𝚍𝚌𝚗 𝚒𝚗𝚒𝚝 every second
Gives a sense of the scale of new software creation

shadcn: A new shadcn/ui app is created every second. Every single second.*

*Not counting AI-generated apps, v0, or third-party init. Just someone, somewhere, running shadcn init in a terminal every second.

Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Retweeted
OpenAI OpenAI
We’re updating our ChatGPT Pro and Plus subscriptions to better support the growing use of Codex.
We’re introducing a new $100/month Pro tier. This new tier offers 5x more Codex usage than Plus and is best for longer, high-effort Codex sessions.
In ChatGPT, this new Pro tier still offers access to all Pro features, including the exclusive Pro model and unlimited access to Instant and Thinking models.
To celebrate the launch, we’re increasing Codex usage for a limited time through May 31st so that Pro $100 subscribers get up to 10x usage of ChatGPT Plus on Codex to build your most ambitious ideas.
OpenAI
OpenAI @OpenAI
Our existing $200 Pro tier still remains our highest usage option. And as a thank you to our existing Pro users on the $200 tier, we’re extending our 2x Codex usage promo (until May 31st) and we’ve reset your Codex rate limits (yes, again).

OpenAI: We’re updating our ChatGPT Pro and Plus subscriptions to better support the growing use of Codex.

We’re introducing a new $100/month Pro tier. This new tier offers 5x more Codex usage than Plus and is best for longer, high-effort Codex sessions.

In ChatGPT, this new Pro tier
Dan Shipper 📧
Dan Shipper 📧 @danshipper
I CANT STOP THINKING ABOUT CLAUDE MYTHOS
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Every 📧 Every 📧
A very important PSA 👇
Dan Shipper 📧: I CANT STOP THINKING ABOUT CLAUDE MYTHOS
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Retweeted
Tibo Tibo
We did it, say hi to the $100 plan!
It should be the sweet spot for a ton of you. It comes with a ton of codex usage. And yes we are resetting the limits again too as I mentioned yesterday. Let’s keep building!
OpenAI: We’re updating our ChatGPT Pro and Plus subscriptions to better support the growing use of Codex.
We’re introducing a new $100/month Pro tier. This new tier offers 5x more Codex usage than Plus and is best for longer, high-effort Codex sessions.
In ChatGPT, this new Pro tier
swyx
swyx @swyx
Retweeted
kitze 🛠️ tinkerer.club kitze 🛠️ tinkerer.club
first time in my life i attend a conference with 18 min talk time
but i really really like it. 5 min talks are too short. 30 mins are too long
gj @swyx
swyx
swyx @swyx
Retweeted
Filip Makraduli Filip Makraduli
A common theme at @aiDotEngineer @swyx @steipete 🦞
Lot’s of fun and meeting great people, my talk about model inference at @superlinked is coming tomorrow!
swyx
swyx @swyx
Retweeted
Mario Zechner Mario Zechner
people of pai and consorts.
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Katie Parrott Katie Parrott
Keep calm and subscribe to @every
Dan Shipper 📧: I CANT STOP THINKING ABOUT CLAUDE MYTHOS
Ryo Lu
Ryo Lu @ryolu_
less is more

nazha: #分享 一行禁用 macOS Tahoe 版本满地的不明所以的图标,方式是打开终端,执行:

defaults write -g NSMenuEnableActionImages -bool NO

swyx
swyx @swyx
Retweeted
Joe Reeve - 🇬🇧/acc Joe Reeve - 🇬🇧/acc
This is going to be good.
IMO the coolest AI event in London.
swyx
swyx @swyx
Retweeted
Louis-François Bouchard 🎥🤖 Louis-François Bouchard 🎥🤖
I just gave a workshop at @aiDotEngineer in London on building real multi-agent systems.
The best part was hearing people laugh, interrupt us with questions..
You could feel they were following, thinking, and pushing on the ideas with us.
Even better, a lot of people came to talk to us after the workshop with great feedback. That was a bit crazy and surreal, honestly.
Together with Paul Iusztin, we shared a hands-on workshop with code, slides, and soon a full video on how to build:
• a deep research agent
• a LinkedIn writing workflow
• an evals layer to improve the whole system
What I like most is that this is not agent hype. Or claw-something.
It is a practical case study on building scoped agentic systems with MCP and Claude Skills to replace workflows we used to have. Something we actually use.
Also, the hardest part was not presenting it.
It was preparing, then preparing again, then preparing again.
Huge thanks to @pauliusztin_ and the @towards_AI team for building this with me, and to @swyx for making this happen in Europe.
Have you ever taught something live and immediately felt whether the room was with you or not?
It’s one of the best feeling you can have!
#AIEurope
Garry Tan
Garry Tan @garrytan
Retweeted
Atlas Press Atlas Press
Ralph Waldo Emerson, preach
swyx
swyx @swyx
Retweeted
Joe Reeve - 🇬🇧/acc Joe Reeve - 🇬🇧/acc
The vibe is GOOD
Joe Reeve - 🇬🇧/acc:
Josh Woodward
Josh Woodward @joshwoodward
Create your first full song today. For free.

In under 50 days, over 100 million songs have been generated on @GeminiApp. To celebrate, we’re unlocking the full power of our music model, Lyria 3.

Here’s what you get starting today:
🎵 Generate up to 5 full-length tracks every day (~3 mins each)
🎵 Hit your limit? Keep creating 30-second clips, or upgrade for more

Images, video, and now music.

You have the ideas. Gemini has the tools. Start creating!
Thariq
Thariq @trq212
I think "prompting" will keep being an incredibly high-leverage skill, like writing or public speaking.

It is the skill of talking to agents, mediated by the harness.

My main goal is to grow the bandwidth between humans and agents, to help us understand each other better.
swyx
swyx @swyx
Retweeted
Ronan Ronan
The impact that @swyx is already having on the London AI industry with @aiDotEngineer is incredible.
Today there were builders from every frontier lab, the UK government, the hottest AI start-ups in London - hell he even managed to get me into 10 Downing Street for breakfast, and he's visiting - I LIVE HERE.
On behalf of the UK ecosystem, thank you for your service sir!
etn.: Swyx (@swyx) says that everyone benefits from more choice, more innovation, and fierce competition among the model labs:
&#34;There's a lot of excitement about the potential of @GoogleDeepMind to take the lead. They obviously have every card stacked in their favor.&#34;
&#34;The concept
Alex Albert
Alex Albert @alexalbert__
Allowing Sonnet to "phone a friend" (i.e. call Opus) increases performance while also reducing total cost since it reduces tokens spent trying to solve more complex tasks


Claude: We're bringing the advisor strategy to the Claude Platform.

Pair Opus as an advisor with Sonnet or Haiku as an executor, and get near Opus-level intelligence in your agents at a fraction of the cost.

alexalbert__
alexalbert__ @alexalbert__
Allowing Sonnet to "phone a friend" (i.e. call Opus) increases performance while also reducing total cost since it reduces tokens spent trying to solve more complex tasks


Claude: We're bringing the advisor strategy to the Claude Platform.

Pair Opus as an advisor with Sonnet or Haiku as an executor, and get near Opus-level intelligence in your agents at a fraction of the cost.

swyx
swyx @swyx
Retweeted
Maxime Labonne Maxime Labonne
So nice to finally meet people I've followed for years like @pauliusztin_ @swyx @Prince_Canuma @mervenoyann @ben_burtenshaw @osanseviero in real life!
(I'm the guy with the yellow cap)
AI Engineer: As tradition, the AI Engineer speaker dinner is a great kick off before the first day of talks tomorrow!
If you're ever debated submitting a talk proposal, this dinner is &#34;the single best networking event in London&#34; - per @swyx
Amjad Masad
Amjad Masad @amasad
💯

hareem: turns out reviewing a bunch of design options, flow diagrams, and product requirements all together is just faster to do in one place. and that one place is @Replit

Yann LeCun
Yann LeCun @ylecun
Retweeted
ian bremmer ian bremmer
this map is the story of the last 25 years of us foreign policy in its own backyard. and that was before the tariffs.
Garry Tan
Garry Tan @garrytan
Retweeted
Kane 謝凱堯 Kane 謝凱堯
For context on how egregiously Hao and @MorePerfectUS overestimated data center water use, claiming Donald Trump won 99.99% of votes, the minimum wage was $7,250.00/hr, or that the global population is 8 trillion are the same order of magnitude error.
Kane 謝凱堯: The source of the data center water psychosis is @_KarenHao, whose book Empire of AI was a NYT best seller but overestimated water use by 100,000% (lol).
The response was just “oopsies” and all the incorrect books were kept in circulation 🤷🏻‍♂️
Thariq
Thariq @trq212
you'll need to explicitly prompt Claude Code to use it, but the Monitor Tool is super powerful

e.g. "start my dev server and use the MonitorTool to observe for errors"

Noah Zweben: Thrilled to announce the Monitor tool which lets Claude create background scripts that wake the agent up when needed.

Big token saver and great way to move away from polling in the agent loop

Claude can now:
* Follow logs for errors
* Poll PRs via script
* and more!

Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Retweeted
Magnus Müller Magnus Müller
I just realized that Perplexity is built on Browser Use open-source library.
Last April, Perplexity users kept reporting that it was randomly searching for “capital of France” and answering “Paris” for unrelated prompts.
That exact prompt, “What is the capital of France?”, is hardcoded in Browser Use. We used it as a sanity check in _verify_llm_connection: every time an Agent() was instantiated, it sent that prompt to the LLM.
You can disable that but they forgot.
Honestly, if they'd just told us, I'd have happily shown them how to integrate it properly.
Feels like with Manus.
Commit in browser_use: browser_use/agent/service.py lines 1272–1296 at commit 3f4c918a
Jeremy Howard
Jeremy Howard @jeremyphoward
Retweeted
Brydon Eastman Brydon Eastman
I know it's self serving to say, but man I would've killed for a resource like Tinker and the tutorials, the cookbook, etc back when I was in undergrad.
Following @karpathy blogs and training RNNs on a crappy Acer *was* fun, but doing bigger things with less setup is such a boon
Tinker: First, to get you started, we've created 23 tutorials to walk you from the API basics to advanced training techniques and deploying models into production.
https://tinker-docs.thinkingmachines.ai/tutorials/
Sam Altman
Sam Altman @sama
It is very nice to see Codex getting so much love. We are launching a $100 ChatGPT Pro tier by very popular demand.
sama
sama @sama
It is very nice to see Codex getting so much love. We are launching a $100 ChatGPT Pro tier by very popular demand.
Jeremy Howard
Jeremy Howard @jeremyphoward
Retweeted
Hao AI Lab Hao AI Lab
(1/5) FP4 hardware is here, but 4-bit attention still kills model quality, blocking true end-to-end FP4 serving.
To fix that, we propose Attn-QAT, the first systematic study of quantization-aware training for attention.
The result: FP4 attention quality is comparable to BF16 attention with 1.1x–1.5x higher throughput than SageAttention3 on an RTX 5090 and 1.39x speedup over FlashAttention-4 on a B200.
Blog: https://haoailab.com/blogs/attn-qat/
Code: https://github.com/hao-ai-lab/FastVideo/pull/1225
Checkpoints: https://huggingface.co/FastVideo/14B_qat_400
Aaron Levie
Aaron Levie @levie
Agents getting the right context to do their work will be the dominant IT challenge over the next decade. Every agent strategy is at the mercy of how effectively agents can access the right data and systems to make decisions. Huge opportunity for those that get this right.

Box: .@Levie shared with @CNBC why the rapid rise of AI agents is good news for enterprises that have the right foundation in place.

"If you want to be able to include them in your workflow, have them augment your work, they need access to your critical enterprise data. And they need

Yann LeCun
Yann LeCun @ylecun
Retweeted
Linuxiac Linuxiac
France is transitioning government desktops to Linux, with each ministry required to formalize its implementation plan by autumn 2026.
https://linuxiac.com/france-launches-government-linux-desktop-plan-as-windows-exit-begins/
#Linux #OpenSource
cat
cat @_catwu
We made setting up Claude Code with Bedrock and Vertex much faster!

Morgan Lunt: Tired of manually writing config files and env vars to use Claude Code with Amazon Bedrock or Google Vertex? Me too. Just shipped a setup wizard that handles it for you. Bonus: Claude Code now notices if you have an older model pinned and suggests a newer one if you have access!

Garry Tan
Garry Tan @garrytan
You need to use frontier models with giant context and actually have systems that give them the right context at the right time to understand what's happening now in AI. Everyone else is guessing.

There is both massive cost (a $20/mo sub is not going to unlock the awesomeness) and skill issue (you've gotta be a builder)

Andrej Karpathy: Judging by my tl there is a growing gap in understanding of AI capability.

The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much. This is
Garry Tan
Garry Tan @garrytan
Many such cases

Daler Radjabov: @garrytan Garry, you make it too easy to spawn the next billion dollar company. Thank you.
Garry Tan
Garry Tan @garrytan
The thing is, thanks to mother models distilling intelligence into smaller models, everyone up and down the ability-to-pay curve will benefit as the models get to superintelligence

Athul Nambiar: @garrytan gonna be rough watching ai become a luxury good
Matt Shumer
Matt Shumer @mattshumer_
If you want to try a new personal agent that is just... insanely good, comment + DM me.
Yann LeCun
Yann LeCun @ylecun
Retweeted
Republicans against Trump Republicans against Trump
WOW. 46% of Italians and 37% of French people see the U.S. as a threat
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Kasra Kasra
i enjoyed this! Every, especially Dan, have such a great lens on AI. grass-touching, creativity-loving, 10/10
Dan Shipper 📧: I CANT STOP THINKING ABOUT CLAUDE MYTHOS
rauchg
rauchg @rauchg
Agentic Infrastructure is the future of the cloud

① For coding agents
If you use Claude Code, Codex, Cursor, you need infra that 'clicks' for your agents, not just devs.

② To deploy agents
Pages → Agents. Long-running compute, sandboxes, and our token delivery network are the building blocks of this new kind of software.

③ Itself an agent
Vercel is beloved because it's self-configuring (serverless). Add to that: self-healing, self-optimizing, self-securing. The agent holds the pager.

⟁ It's a triple-entendre that works. I highly recommend the read. Agentic Infrastructure will make existing companies more efficient and support the next generation of AI-native startups.

Vercel: In 3 months, weekly deployments on Vercel have doubled. 30% are triggered by agents (up 1000% in 6 months).

Agents are writing software that uses AI, and agents are building agents. Infrastructure must become agentic itself. That's what we're building. https://vercel.com/blog/agentic-infrastructure
Aaron Levie
Aaron Levie @levie
AI adoption is a tale of two cities. On one end (most) users right now are interacting with AI via chat tools and on the other end people are deploying agents to do long running tasks that create and produce real work output or automate workflows.

The former is super useful but the productivity gains are capped. The latter could be 100-200% productivity gains off the bat and have no inherent upper limit as you have agents running in the background.

*Most* of the users in the latter camp are using coding agents. But now that general purpose agents are coming online that can code, use skills, access data, run apps, and more, we’re going to see these agents in more areas of knowledge work.

The gap, though, with the rest of knowledge work though will be that most enterprises have legacy data environments, workflows that aren’t well documented, or technologies that don’t play nice with agents. Getting context to coding agents is a ton easier than in many other areas of work. And the users are far more technical, so there’s a big safety net there.

This is all going to take time to upgrade these traditional workflows, but this is why there’s so much opportunity right now as well.

Andrej Karpathy: Judging by my tl there is a growing gap in understanding of AI capability.

The first issue I think is around recency and tier of use. I think a lot of people tried the free tier of ChatGPT somewhere last year and allowed it to inform their views on AI a little too much. This is
swyx
swyx @swyx
Retweeted
Maya Avendaño Maya Avendaño
feeling very optimistic about the state and future of IRL community in London atm 🖤
Garry Tan
Garry Tan @garrytan
I sometimes get questions about whether coding is what YC partners should be doing

The answer is, are you kidding? Yes, because now is the most important time in history for you to have a player-coach with you.

Far better than a front row courtside luxury season ticket holder!
Garry Tan
Garry Tan @garrytan



ali: it's been one of those weeks

Jeremy Howard
Jeremy Howard @jeremyphoward
Retweeted
Thomas H. Ptacek Thomas H. Ptacek
Whatever else you might say about this, it's a grave flaw in the article that it doesn't even mention C++ stylometry. Hard to think of a mainstream language where personal styles are so evident or idiosyncratic, too. You wonder why Carreyrou's expert sources didn't call that out.
Robert Graham: Hi. Professional C/C++ programmer here. The open-source code I can find written by Adam Back and Satoshi Nakamoto don't look remotely similar.
Back's code looks typical of academic Unix programmers who also hack their code to run on Windows.
Satoshi code was written by a
jeremyphoward
jeremyphoward @jeremyphoward
Retweeted
Thomas H. Ptacek Thomas H. Ptacek
Whatever else you might say about this, it's a grave flaw in the article that it doesn't even mention C++ stylometry. Hard to think of a mainstream language where personal styles are so evident or idiosyncratic, too. You wonder why Carreyrou's expert sources didn't call that out.
Robert Graham: Hi. Professional C/C++ programmer here. The open-source code I can find written by Adam Back and Satoshi Nakamoto don't look remotely similar.
Back's code looks typical of academic Unix programmers who also hack their code to run on Windows.
Satoshi code was written by a
Garry Tan
Garry Tan @garrytan
In China they call people using OpenClaw "still using the Little Lobster" 🦞

Ray Wang: Garry Tan 分享他的 OpenClaw AGENTS.md 提示词

还在用小龙虾的朋友可以试试,可以达到跟 Hermes 类似的效果
swyx
swyx @swyx
Retweeted
Yuchen Jin Yuchen Jin
Anthropic killed this, Anthropic killed that, why cant Anthropic kill TurboTax
Garry Tan
Garry Tan @garrytan
Re I'm open sourcing it MIT license so we can all speed up and have our own personal mini-AGI. It's been amazing for me and I want you to have it.

To install GBrain in your OpenClaw, just paste this image to your OpenClaw or paste this text:

Set up gbrain (https://github.com/garrytan/gbrain) as my knowledge brain.

1. Make sure bun is installed: curl -fsSL https://bun.sh/install | bash
Then run: bun add github:garrytan/gbrain

2. Run: gbrain init --supabase (follow the wizard to connect my Supabase database)

3. Scan ~/git/ and ~/Documents/ for markdown repos, pick the best one, and run: gbrain import --no-embed

4. Run a query against the imported data to prove search works

5. Read docs/GBRAIN_RECOMMENDED_SCHEMA.md and offer to restructure my knowledge base

6. Read docs/GBRAIN_SKILLPACK.md and update all your skills with the production agent patterns: brain-agent loop, entity detection, source attribution, iron law back-linking, and the enrichment pipeline

7. Set up a daily cron for `gbrain check-update`.
Notify me on new features only, not patches.
Never auto-install, just tell me what's new.
Garry Tan
Garry Tan @garrytan
Retweeted
Peter Yang Peter Yang
Ramp just published the best guide out there on how to get your company AI pilled:
1. The second best time to start is today
2. Treat AI proficiency as a learning curve
3. Embrace creative destruction
4. Build from the center, drive from the spokes
5. Give people a stage, not just a mandate
6. Get people to the &#34;Aha&#34; moment asap
7. Make it a competition
8. Remove every constraint between your people and AI
Honestly, #8 should probably be first because most companies don't get past procurement.
Full guide below:
Geoff Charles: http://x.com/i/article/2041982227865571328
Peter Yang
Peter Yang @petergyang
Retweeted
Peter Yang Peter Yang
Ramp just published the best guide out there on how to get your company AI pilled:
1. The second best time to start is today
2. Treat AI proficiency as a learning curve
3. Embrace creative destruction
4. Build from the center, drive from the spokes
5. Give people a stage, not just a mandate
6. Get people to the &#34;Aha&#34; moment asap
7. Make it a competition
8. Remove every constraint between your people and AI
Honestly, #8 should probably be first because most companies don't get past procurement.
Full guide below:
Geoff Charles: http://x.com/i/article/2041982227865571328
Garry Tan
Garry Tan @garrytan
Hanging with @benioff and @LipBuTan1 this morning on the Big Island 🏝️

Talking GStack, GBrain, open source AI and longevity

What a time to be alive

Garry Tan
Garry Tan @garrytan
Retweeted
Tenobrus Tenobrus
holy shit just replaced whispr flow with this and it's instantly a 10x better experience
Aqua Voice: Aqua Voice is now live for iOS.
It's a premium voice keyboard for every app on your phone.
Amjad Masad
Amjad Masad @amasad
Retweeted
vic vic
Small but delightful ships 🛶 for Replit's mobile app...
Larger touch targets & less noise around actions that matter the most 🕺🏻
So much more to come... @ink_404
Garry Tan
Garry Tan @garrytan
Re Did you know 19 year old @Benioff was a software evangelist intern for @GuyKawasaki on the first Mac? He was a hacker making Atari games

I tried to sell him on using GStack to code again. I think any CEO would be psyched if they could multiply themselves by 100x!
swyx
swyx @swyx
Retweeted
Alex Volkov Alex Volkov
Just wrapped the most EU ThursdAI ever — live from the floor of @aiDotEngineer in London 🇬🇧
We had 5 guests! @swyx, @petergostev, @reach_vb, @vincent_koc from @openclaw and @osanseviero! Most are from EU btw, just Swyx and Me are 🇺🇸!
More details + link below
Yann LeCun
Yann LeCun @ylecun
Retweeted
Dawid Moczadło Dawid Moczadło
I looked at their prompts, It's complete bs
They are literally providing all of the insight to the LLM upfront
> Are there any security vulnerabilities in this code? Consider the behavior of the SEQ_LT/SEQ_GT macros with sequence number wraparound. If you find issues, explain how an attacker might trigger them.
They are providing ALL required facts to the LLM, and they only ask the LLM to connect the dots
The real challenge for LLMs would be to get those insights first
THAT IS THE WHOLE CHALLENGE IN CYBERSECURITY; TO HAVE DEEP INSIGHT
This test proves nothing; don't make any conclusions about OSS models being good for security based on this
Stanislav Fort: New post: We tested the Mythos showcase vulnerabilities with open models.
They recovered similar scoped analysis! 8/8 models found the flagship FreeBSD zero-day, including a 3B model.
Rankings reshuffle completely across tasks => the AI cybersecurity frontier is super jagged!
Garry Tan
Garry Tan @garrytan
Retweeted
Ryan Lackey Ryan Lackey
This post (by a former EFF person that I know/friends with) is probably the best summary of what happened to EFF:
https://news.ycombinator.com/item?id=47707802
John Carmack
John Carmack @ID_AA_Carmack
Making a scatter plot of 400_000 data points, some of the plots had odd gaps in coverage. It took me a little while to realize that it was only when the data was farther from the origin -- it was the raw bfloat16 precision. Everything looks great from -1 to 1, but as you go past 2 and 4, the coverage gaps get larger.

My intuition didn't have it being quite so "discretely countable" at those modest numeric values.

Float32 for comparison.

ID_AA_Carmack
ID_AA_Carmack @ID_AA_Carmack
Making a scatter plot of 400_000 data points, some of the plots had odd gaps in coverage. It took me a little while to realize that it was only when the data was farther from the origin -- it was the raw bfloat16 precision. Everything looks great from -1 to 1, but as you go past 2 and 4, the coverage gaps get larger.

My intuition didn't have it being quite so "discretely countable" at those modest numeric values.

Float32 for comparison.

Zara Zhang
Zara Zhang @zarazhangrui
I seldom use AI for writing because I actually enjoy writing and find it very easy & fun to do

And because I have a high bar for writing, if I use AI to write, it almost always ends up taking more time than if I had written it by myself (too many iterations)

For me personally, AI has been most useful for helping me with things OUTSIDE my comfort zone (like coding), not inside of it (like writing)
Yann LeCun
Yann LeCun @ylecun
Retweeted
Andrew—#IAmTheResistance Andrew—#IAmTheResistance
We are Donald Trump’s 7th and final bankruptcy.
The Kobeissi Letter: BREAKING: US GDP growth falls from 4.4% to 0.5% in Q4 2025, well below the initially expected +2.8% growth.
Garry Tan
Garry Tan @garrytan
Retweeted
Simone Syed Simone Syed
You live long enough to see every institution you loved that did meaningful and true work become the very monster they ostensibly were fighting
Goodbye EFF.
💔💔💔
EFF: After almost twenty years on the platform, EFF is logging off of X.
This isn’t a decision we made lightly, but it might be overdue. 🧵(1/5)
swyx
swyx @swyx
Retweeted
oscarama oscarama
look who i found
more stickers to the collection 🫡
@mayvencraft @hugorcd
Matt Turck
Matt Turck @mattturck
SurrealDB > EdgeDB > MongoDB


mira_the_AI: Just published: SurrealDB vs EdgeDB vs MongoDB: The Best Multi-Model Database 2026 — https://devtoolreviews.com/reviews/surrealdb-vs-edgedb-vs-mongodb-2026 #DevTools #Programming #Development
Garry Tan
Garry Tan @garrytan
Retweeted
Nav Patel Nav Patel
Nothing more motivating than a user base.
Added support for X, Insta, TikTok
Nav Patel: If someone’s going to randomly find your project, @garrytan’s a pretty good one.

YouTube

0

No recent videos fetched on this date.