← 2026-03-23

Daily Edition

2026-03-24

2026-03-25 →

🤖 AI Builders 日报 — 2026年3月24日

追踪 AI 领域真正在做事的人,而不是空谈者。

🐦 X / Twitter 深度解读

Amjad Masad (Replit CEO)

  • Replit Agent 4:并行开发的体验革命 Amjad 最近在疯狂安利 Replit Agent 4,称其为"从丰田到雷克萨斯的飞跃"。他详细解释了新旧 Agent 的核心体验差异:旧版 Agent 是串行的,你下一个任务必须等它完成才能下下一个指令;新版 Agent 允许你同时下发多个任务——让一个 Agent 写前端,一个写后端,一个写测试用例——全部并行跑。他透露自己用 Replit Agent 在 4 个月内从 3 个 App 扩展到了 16 个,其中包括 CRM、自动化工具和内部用的 Slack 机器人。他强调:AI 消除了创造力和执行力之间的障碍。 🔗 faviconx.com

  • 销售团队的"非典型"成功故事 Amjad 分享了 Replit 销冠的故事:一位从未做过销售的美国海军陆战队退伍军人,通过 Replit 构建自己的培训课程入职,完成了 350 万美元的 Deal。另一位年度新人王是没有任何销售经验的前学校教师,同样是通过在 Replit 上构建课程学会了产品。 🔗 faviconx.com

Alex Albert (Anthropic 开发者关系)

  • Claude 现在可以远程操作你的电脑 Alex 宣布 Claude 新功能:用户现在可以授权 Claude 使用电脑完成各种任务——打开应用、浏览网页、填表格——完全自动化。这是"远程协作办公"的终极形态,AI 不再只是对话伙伴,而是可以直接操作数字工具。 🔗 faviconx.com

  • AI 做理论研究:达到研究生二年级水平 Anthropic 发布了最新研究成果:哈佛物理学家 Matthew Schwartz 用 Claude Opus 4.5 做了一个研究生级别的计算,得出的结论是 AI 目前虽无法原创性研究,但能大幅加速研究过程——相当于 10 倍加速。 🔗 faviconx.com

Garry Tan (Y Combinator 总裁)

  • GStack 的"办公室小时"虐创始人的故事 Garry 继续他的 GStack 推广,提到有用户在中文 Twitter 上抱怨 GStack 的 Office Hours 对创始人不够"狠",于是他直接把难度调高了。他还提到有 CEO 重新开始写代码,称这是"CEO 重新编码"的趋势。 🔗 faviconx.com

Dan Shipper (Every 创始人)

  • 2026 工程团队新模型:海盗 + 架构师 Dan 提出了一个激进的团队架构预测:未来的工程团队只需要两个人——"海盗"负责用 vibe coding 极速探索和试错,"架构师"负责把海盗发现的产品原型转化为可持续生产的代码。海盗的职责是 moving fast,架构师的职责是 turning product surface into engineered product。 🔗 faviconx.com

  • 必须亲自玩最新的工具 Dan 强调不能把"玩最新工具"这件事委托给别人——这是建立 AI 世界认知直觉的关键路径。他点名了 Garry Tan 和 Tobi(Shopify CEO),说他们正在"go so hard"。 🔗 faviconx.com

Guillermo Rauch (Vercel CEO)

  • Vercel 内部 SaaS 正在被 AI 取代 Guillermo 透露了一个有趣的观察:Vercel 内部几乎所有 SaaS 工具现在都被 AI 原生方案替代了。这可能预示着企业软件市场的下一个巨大机会——不是做更好的 SaaS,而是做完全不需要 SaaS 的 AI 工作流。 🔗 faviconx.com

Aditya Agarwal (Gamma 投资人/前 Facebook)

  • Uni-1:同时"思考"和"生成像素"的新物种 Aditya 转发了关于 Uni-1 的讨论——一种能同时"思考"和"生成像素"的多模态新模型。这被解读为 world model(世界模型)构建的新方向。 🔗 faviconx.com

  • 软件 + 硬件 + 供应链:美国的长期叙事 Aditya 表达了对美国 tech 的长期乐观:"我们只需要把软件做得可靠、可扩展。这让我对美国非常乐观。我们在软件方面很强,而且在硬件和供应链方面也在迎头赶上。" 🔗 faviconx.com


💡 今日要点

  • Agent 并行化是下一个大逃杀:Replit 4 和 Claude Code 都在推"多 Agent 并行"体验,这标志着 AI 编程从"单线程对话"进化到"多线程自治"。
  • "架构师"角色复兴:Dan Shipper 的 2 人团队模型揭示了一个反直觉趋势——当"执行"被 AI 接管后,人类最稀缺的变成了"把控方向"和"设计系统"的架构能力。
  • AI 正在吃掉企业软件:Vercel 内部的 SaaS 被 AI 取代,说明未来企业可能不再需要采购 SaaS 工具,而是直接训练自己的 AI 员工。

X / Twitter

102
Alex Albert
Alex Albert @alexalbert__
Highly recommend reading this guest post by theoretical physicist Matthew Schwartz to get a sense of how AI is helping science.

He found Opus 4.5 to be roughly the level of a second-year grad student and it helped him accelerate his research by 10x.

Anthropic: We’re launching with two new posts.

Can AI do theoretical physics?

Harvard physicist Matthew Schwartz led Claude Opus 4.5 through a graduate-level calculation. AI can’t yet do original work autonomously, but it can vastly accelerate it.

Read more: https://www.anthropic.com/research/vibe-physics
alexalbert__
alexalbert__ @alexalbert__
Highly recommend reading this guest post by theoretical physicist Matthew Schwartz to get a sense of how AI is helping science.

He found Opus 4.5 to be roughly the level of a second-year grad student and it helped him accelerate his research by 10x.

Anthropic: We’re launching with two new posts.

Can AI do theoretical physics?

Harvard physicist Matthew Schwartz led Claude Opus 4.5 through a graduate-level calculation. AI can’t yet do original work autonomously, but it can vastly accelerate it.

Read more: https://www.anthropic.com/research/vibe-physics
Garry Tan
Garry Tan @garrytan
Retweeted
Abby Grills Abby Grills
YC W26 Demo Day is here! 🟧
I put together a free page with everything you need to know about the companies and founders:
- 14 companies that have already shared revenue
- 24 flagged with top founder signals
- press that's landed before they've even presented
Dan Shipper 📧
Dan Shipper 📧 @danshipper
this is so good

obviously the future

Ramp Labs: http://x.com/i/article/2036144051146117120
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Pretty much every PR I review:

0) review [codex does it's thing and finds issues]
1) is the issue clear? [if not, trash PR]
2) is this the best possible fix? [95% of the time no]
3) continue discussion, consider tradeoffs, usually rewrite PR

Most folks send too localized, small fixes that would end up making the project unmaintainable.
Garry Tan
Garry Tan @garrytan
I'm the people's champ, something like a bawla

seth: Introducing the GStack Chain


Kevin Weil 🇺🇸
Kevin Weil 🇺🇸 @kevinweil
Retweeted
Leeham Leeham
GPT-5.4 Pro fully resolves a Machine Learning adjacent Open Math problem from http://solveall.org!
I submitted the solution a few weeks ago and just checking now, it has been accepted.
This marks the 2nd problem on the site to be fully resolved!
https://solveall.org/problem/gaussian-correlation-inequality-extensions
Garry Tan
Garry Tan @garrytan
If you want to go fast, use GStack

Lilys.ai: @garrytan Great guide. I turned it into detailed notes too, so save it for later: https://lilys.ai/digest/8718883/9869910?s=1¬eVersionId=6348922

Amjad Masad
Amjad Masad @amasad
Super useful viral TSA wait time app — built on Replit.

Zach Griff: I built a free live TSA wait time tracker that shows live wait times by checkpoint, including Precheck, Clear, and priority (where available).

I did it because:
- TSA lines are insane
- Existing tools offer estimated waits
- Test the power of AI tools

https://tsa.fromthetraytable.com/

Garry Tan
Garry Tan @garrytan
Retweeted
Ankit Gupta Ankit Gupta
okay @conductor_build is so awesome
i think the unlock to me using it was my workflow shifting entirely to shipping many parallel features vs in serial. as soon as that became my default flow, conductor became the goat
Garry Tan
Garry Tan @garrytan
Retweeted
Boris Cherny Boris Cherny
Little known fact, the Anthropic Labs team (the team I joined Anthropic to be on) shipped:
- MCP
- Skills
- Claude Desktop app
- Claude Code
It was just a few of us, shipping fast, trying to keep pace with what the model was capable of.
Those early Desktop computer use prototypes, back in the Sonnet 3.6 days, felt clunky and slow. But it was easy to squint and imagine all the ways people might use it once it got really good.
Fast forward to today. I am so excited to release full computer use in Cowork and Dispatch. Really excited to see what you do with it!
Claude: You can now enable Claude to use your computer to complete tasks.
It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk.
Research preview in Claude Cowork and Claude Code, macOS only.
Aaron Levie
Aaron Levie @levie
Computer use and the ability to write and run code on the fly are the ultimate primitives for agents to be able to take on more and more tasks in knowledge work.

Most work requires hopping between multiple applications, and working with broad sets of data, in a workflow, and agents will need to be able to traverse these systems to be able to effectively automate any real work in the enterprise.

Now we will have agents that are the equivalent of having an expert programmer (or any number of them) that can write code or use any API to automate whatever work you’re doing. Agents will have access to either a user’s computer and resources, or their own sandbox to operate in, and be able to pull together the tools necessary to perform the task at hand. This opens up the broadest set of agentic use-cases.

To be sure, there are going to be various hurdles around security, permissions and access controls, identity challenges, and more.

For instance, should the agent always act on behalf of the user, or should they have their own identity and limited set of access rights? How do you triage security events when historically volume of activity on a system is no longer a reliable signal of a security issue? How do you ensure the agent isn’t going rogue or getting prompt injected to do something risky? All problems that need to get figured out.

Then, there’s also lots of work needed to ensure software is setup to enable to agents to operate with their tools in a headless fashion. This will be an uncomfortable reality for some incumbents, and equally a welcome one for tools that historically have operated seamlessly via APIs, and have business models to support this.

Lots of change coming in the world of work agents, and it’s going to get pretty wild.
Garry Tan
Garry Tan @garrytan
Retweeted
Simon Willison Simon Willison
Turns out you can run enormous Mixture-of-Experts on Mac hardware without fitting the whole model in RAM by streaming a subset of expert weights from SSD for each generated token - and people keep finding ways to run bigger models
Kimi 2.5 is 1T, but only 32B active so fits 96GB
seikixtc: I got a 1T-parameter model running locally on my MacBook Pro.
LLM: Kimi K2.5
1,026,408,232,448 params (~1.026T)
Hardware: M2 Max MacBook Pro (2023) w/ 96GB unified memory
Running on MLX with a flash-style SSD streaming path + local patching.
This is an experimental setup and
Matt Turck
Matt Turck @mattturck
Retweeted
Tom Sella Tom Sella
Re @mattturck
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Retweeted
shirish shirish
THE APPLE APP STORE IS DROWNING IN AI SLOP
people are treating the App Store like a Medium blog spitting out apps one after another.
All with zero users and $0 revenue.
Apple reviews that used to take hours are now stretching into WEEKS and even months
> more than 550k apps were submitted just last year, highest in a decade.
Garry Tan
Garry Tan @garrytan
Cooking with GStack today. I just dropped 10 PR bug fixes from the community plus a big refactor of E2E CI tests, which should help stability overall.
Garry Tan
Garry Tan @garrytan
Retweeted
Kane 謝凱堯 Kane 謝凱堯
Being an investigative journalist covering California fraud right now is like a badger dropped in the Galapagos: no natural predators, so nothing even bothers to hide.
CBS News: Congress launches investigation into California hospice fraud, citing millions in taxpayer losses. https://cbsn.ws/4uNG4mw
Garry Tan
Garry Tan @garrytan
Retweeted
Kane 謝凱堯 Kane 謝凱堯
San Francisco won't jail an 80-year-old for killing a family of four. Seattle won't jail a 48-year-old for killing an 80-year-old. We have not yet discovered the floor of Restorative Justice.
Alex Berenson: The Seattle criminal justice system is even worse than you imagine. So much worse.
The Goosby case is bad, but ANOTHER murder case stuck in the mental-health defense loop in Seattle makes it look like a model of justice.
On August 20, 2024, Jahmed Haynes, a 48-year-old career
Garry Tan
Garry Tan @garrytan
Many such cases

MalcomX: @garrytan I've been using the GStack you built, and it’s been incredibly helpful.

Compared to not using it, what I’ve noticed most is the clear boost in efficiency, the ability to structure and connect ideas more effectively, and a much faster path from concept to execution
Garry Tan
Garry Tan @garrytan
Many such cases

Elena Shuvalova: Built an auction house MCP in one day using Codex + @garrytan's skills for brainstorming & spec.
8 years ago: 3-4 days of manual parsing, Excel cleaning, Power BI dashboards.
Today: 1 day, working analytics dashboard.
swyx
swyx @swyx
i challenge you to find a single kernelwriting infra company this cracked and this confident that they can do this all entirely open catch up and its both immediately useful and ~nobody can catch up (if someone does, they still win because Mojo)

Chris Lattner: @Zyyon_ Please don’t tell anyone: we aren’t just open sourcing all the models. We are doing the unspeakable: open sourcing all the gpu kernels too. Making them run on multivendor consumer hardware, and opening the door to folks who can beat our work.

Plz keep it quiet, ok? 😉
swyx
swyx @swyx
Retweeted
Ryan Carson Ryan Carson
I haven't typed `npm run dev` on my local machine for three days now and it's absolute bliss.
Having my agents 100% in the cloud is a massive unlock.
(One of those agents is openclaw, which is technically on my mbp in my office, but the only way I interact with it is via email/slack so it “feels” cloud)
I'm able to run all the engineering and marketing for my startup through Slack and Linear and because of this the work product that I'm shipping has increased dramatically.
I know all of us devs love creating our own custom solutions to this stuff but the truth is that creating an agent orchestration layer for your company or startup is a full-time job.
Our job as startup founders is to be growing the company, not to be building out an agent orchestration custom platform.
I think if you have a larger engineering team like Ramp, then it does make sense to build an entire layer like Inspect agent.
However, I would venture to say that I'm getting most of the value by simply paying for a pre-built, battle-hardened solution like Devin.
Again to be clear I'm not being paid by Devin or anybody to say these things, just my real-world experience using this stuff.
Aditya Agarwal
Aditya Agarwal @adityaag
It is increasing clear that the hardware for robotics is ready.

We just need to get the software to a point where it is reliable and scalable.

This actually makes me very bullish on 🇺🇸.

We are good at software. And increasingly getting at hardware and supply chains as well.
Amjad Masad
Amjad Masad @amasad
If you’re a content creator you need to watch this. No hype or marketing. Nick is sharing is his genuine journey on how learning to make software transformed their business.

nick ercolano: i hope every content creator watches this video about AI and what's happened over the last 6 months

the (former) barrier between creators and their ability to build products (price, speed, vision) is basically gone

i haven’t made a sports video since december after doing it

Garry Tan
Garry Tan @garrytan
Retweeted
Packy McCormick Packy McCormick
The future is electromagnetic.
One challenge is that there are ~ten people in the world who can deeply intuit electromagnetism. RF engineering is "black magic."
Arena Physica thinks machines can intuit EM better.
CEO Pratap Ranade & I on AI for EM:
https://www.notboring.co/p/electromagnetism-secretly-runs-the
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Retweeted
Sid Uppal Sid Uppal
OpenClaw now has full Teams AI UX: streaming responses, AI labels, feedback with reflective learning, welcome cards, and image understanding. Built on the official Teams SDK 🦞 FYI @steipete, @BradGroux
Garry Tan
Garry Tan @garrytan
Many such cases

I made GStack to speed up for myself

Now everyone has it

It is MIT licensed open source. You should use it and when you get to the edge of its ability you should fork it and improve it. I’m actively incorporating PRs from the community.

PP: been building with claude code relentlessly for the last 15 days and have built micro projects, some ready to be deployed - and some in ideation stage.

Then I ran a few through my imaginary board of directors.

Result: instant clarity. Confidence went up. Ideas got sharper.

Garry Tan
Garry Tan @garrytan
What about pirate-architects? 😝

Dan Shipper 📧: new model for engineering team structure in 2026:

2 people only

one pirate and one architect

the pirate's job is to move as fast as possible to develop valuable, shipped product features by vibe coding.

the architect's job is to turn the product surface discovered by the
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Garry Tan Garry Tan
What about pirate-architects? 😝
Dan Shipper 📧: new model for engineering team structure in 2026:
2 people only
one pirate and one architect
the pirate's job is to move as fast as possible to develop valuable, shipped product features by vibe coding.
the architect's job is to turn the product surface discovered by the
Guillermo Rauch
Guillermo Rauch @rauchg
Almost every SaaS app inside Vercel has now been replaced with a generated app or agent interface, deployed on Vercel.

Support, sales, marketing, PM, HR, dataviz, even design and video workflows. It’s shocking.

The SaaSpocalypse is both understated and overstated. Over because the key systems of record and storage are still there (Salesforce, Snowflake, etc.)

Understated because the software we are generating is more beautiful, personalized, and crucially, fits our business problems better.

We struggled for years to represent the health of a Vercel customer properly inside Salesforce. Too much data (trillions of consumption data points), the ontology of Vercel was a mismatch to the built-in assumptions, and the resulting UI was bizarre. We generated what we needed instead. When you don’t need a UI, you just ask an agent with natural language.

We’ve also been moving off legacy systems with poor, slow, outdated, and inconsistent APIs, as well as just dropping abstraction down to more traditional databases. UI is a function 𝑓 of data (always has been), and that 𝑓 is increasingly becoming the LLM.
Peter Yang
Peter Yang @petergyang
My friend is a VP at Meta who got tired of repeating the same feedback over and over.

So he built a skill that reviews docs using his principles, questions, and voice.

It has been a game changer for his entire team.

Tomorrow, I'll share exactly how to build this /exec-review skill for your CEO or leader.

📌 Subscribe to get it in your inbox: https://creatoreconomy.so/
Peter Yang
Peter Yang @petergyang
Retweeted
Peter Yang Peter Yang
My friend is a VP at Meta who got tired of repeating the same feedback over and over.
So he built a skill that reviews docs using his principles, questions, and voice.
It has been a game changer for his entire team.
Tomorrow, I'll share exactly how to build this /exec-review skill for your CEO or leader.
📌 Subscribe to get it in your inbox: https://creatoreconomy.so/
Garry Tan
Garry Tan @garrytan

Garry Tan
Garry Tan @garrytan
Retweeted
Kane 謝凱堯 Kane 謝凱堯
Chesa Boudin’s chief of staff Kate Chatfield thinks holding criminals accountable is Bad and the chronicle decided to publish this credulously.
San Francisco Chronicle: OPINION: The decision by San Francisco District Attorney Brooke Jenkins to prosecute hundreds of low-level cases has crippled an underfunded, understaffed public defender system. https://www.sfchronicle.com/opinion/openforum/article/sf-public-defender-mano-raju-22082077.php?taid=69c29d937f3b6800019e06a4&utm_campaign=trueanthem%2B3988&utm_medium=social&utm_source=twitter
Garry Tan
Garry Tan @garrytan
Speaking from direct experience, CEOs coding again is one of the most exciting things to happen in 2026
Dan Shipper 📧
Dan Shipper 📧 @danshipper
the way we run our growth org has changed completely since we started using Claude Code and OpenClaw

read how:

Every 📧: "By the time I had all the information I needed to do my job, I was mentally fried."

Our head of growth @tedescau built a Plus One agent connected to Stripe, PostHog, Discord, @Notion, and email. It's the only way he does his job now.

Thursday, the Plus One waitlist opens. How
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Austin Tedesco Austin Tedesco
Wrote about my daily-driver agent for @every, and open-sourced the plugin I use the most for knowledge work.
Garry Tan
Garry Tan @garrytan
Retweeted
Zach Klein Zach Klein
I’m strongly in favor of public transit and want BART to succeed.
That’s why I’m uncomfortable with the framing that the only option is new funding without drastic change.
We're being asked to pay more at a time when concerns about cleanliness, safety, and cost discipline are still unresolved. It’s reasonable to ask how new funding will translate into measurable improvements on those fronts.
I would support additional investment only if it's paired with clear accountability: specific performance targets and consequences if those targets aren’t met.
Kim-Mai Cutler: Oh wow, shutting down BART entirely is a possibility if the sales tax measure doesn't pass.
Aaron Levie
Aaron Levie @levie
“We’ve also been moving off legacy systems with poor, slow, outdated, and inconsistent APIs.”

If you’re building software that can’t work fully headlessly in a way that agents want to use, you’re not prepared for what the future of software is going to look like.

Agents will use software 100X more than people, and people will more and more interact with their data and workflows via agents across many different platforms.

This is the real risk but also opportunity for platforms right now. Software doesn’t go away, but it becomes the guardrails and business logic for what agents are able to operate on. But if you can’t connect to wherever the agents want to do that work, you’re DOA.

Guillermo Rauch: Almost every SaaS app inside Vercel has now been replaced with a generated app or agent interface, deployed on Vercel.

Support, sales, marketing, PM, HR, dataviz, even design and video workflows. It’s shocking.

The SaaSpocalypse is both understated and overstated. Over because
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Retweeted
dominik kundel dominik kundel
You can just ask things 😄
Having Codex re-read its previous sessions whether it's to restore work, optimize AGENTS.md files, creating new skills or just summarizing previous work is magical.
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Messaged a friend on WhatsApp, got greeted by the OpenClaw doorman 🤣

(use separate number + WA Business to avoid this, see the docs, or use a message platform that is friendlier for agents such as Telegram)
Andrej Karpathy
Andrej Karpathy @karpathy
Software horror: litellm PyPI supply chain attack.

Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database passwords.

LiteLLM itself has 97 million downloads per month which is already terrible, but much worse, the contagion spreads to any project that depends on litellm. For example, if you did `pip install dspy` (which depended on litellm>=1.64.0), you'd also be pwnd. Same for any other large project that depended on litellm.

Afaict the poisoned version was up for only less than ~1 hour. The attack had a bug which led to its discovery - Callum McMahon was using an MCP plugin inside Cursor that pulled in litellm as a transitive dependency. When litellm 1.82.8 installed, their machine ran out of RAM and crashed. So if the attacker didn't vibe code this attack it could have been undetected for many days or weeks.

Supply chain attacks like this are basically the scariest thing imaginable in modern software. Every time you install any depedency you could be pulling in a poisoned package anywhere deep inside its entire depedency tree. This is especially risky with large projects that might have lots and lots of dependencies. The credentials that do get stolen in each attack can then be used to take over more accounts and compromise more packages.

Classical software engineering would have you believe that dependencies are good (we're building pyramids from bricks), but imo this has to be re-evaluated, and it's why I've been so growingly averse to them, preferring to use LLMs to "yoink" functionality when it's simple enough and possible.

Daniel Hnyk: LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below
swyx
swyx @swyx
Retweeted
Andrej Karpathy Andrej Karpathy
Software horror: litellm PyPI supply chain attack.
Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database passwords.
LiteLLM itself has 97 million downloads per month which is already terrible, but much worse, the contagion spreads to any project that depends on litellm. For example, if you did `pip install dspy` (which depended on litellm>=1.64.0), you'd also be pwnd. Same for any other large project that depended on litellm.
Afaict the poisoned version was up for only less than ~1 hour. The attack had a bug which led to its discovery - Callum McMahon was using an MCP plugin inside Cursor that pulled in litellm as a transitive dependency. When litellm 1.82.8 installed, their machine ran out of RAM and crashed. So if the attacker didn't vibe code this attack it could have been undetected for many days or weeks.
Supply chain attacks like this are basically the scariest thing imaginable in modern software. Every time you install any depedency you could be pulling in a poisoned package anywhere deep inside its entire depedency tree. This is especially risky with large projects that might have lots and lots of dependencies. The credentials that do get stolen in each attack can then be used to take over more accounts and compromise more packages.
Classical software engineering would have you believe that dependencies are good (we're building pyramids from bricks), but imo this has to be re-evaluated, and it's why I've been so growingly averse to them, preferring to use LLMs to "yoink" functionality when it's simple enough and possible.
Daniel Hnyk: LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below
Garry Tan
Garry Tan @garrytan
Retweeted
Andrej Karpathy Andrej Karpathy
Software horror: litellm PyPI supply chain attack.
Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database passwords.
LiteLLM itself has 97 million downloads per month which is already terrible, but much worse, the contagion spreads to any project that depends on litellm. For example, if you did `pip install dspy` (which depended on litellm>=1.64.0), you'd also be pwnd. Same for any other large project that depended on litellm.
Afaict the poisoned version was up for only less than ~1 hour. The attack had a bug which led to its discovery - Callum McMahon was using an MCP plugin inside Cursor that pulled in litellm as a transitive dependency. When litellm 1.82.8 installed, their machine ran out of RAM and crashed. So if the attacker didn't vibe code this attack it could have been undetected for many days or weeks.
Supply chain attacks like this are basically the scariest thing imaginable in modern software. Every time you install any depedency you could be pulling in a poisoned package anywhere deep inside its entire depedency tree. This is especially risky with large projects that might have lots and lots of dependencies. The credentials that do get stolen in each attack can then be used to take over more accounts and compromise more packages.
Classical software engineering would have you believe that dependencies are good (we're building pyramids from bricks), but imo this has to be re-evaluated, and it's why I've been so growingly averse to them, preferring to use LLMs to "yoink" functionality when it's simple enough and possible.
Daniel Hnyk: LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below
karpathy
karpathy @karpathy
Software horror: litellm PyPI supply chain attack.

Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database passwords.

LiteLLM itself has 97 million downloads per month which is already terrible, but much worse, the contagion spreads to any project that depends on litellm. For example, if you did `pip install dspy` (which depended on litellm>=1.64.0), you'd also be pwnd. Same for any other large project that depended on litellm.

Afaict the poisoned version was up for only less than ~1 hour. The attack had a bug which led to its discovery - Callum McMahon was using an MCP plugin inside Cursor that pulled in litellm as a transitive dependency. When litellm 1.82.8 installed, their machine ran out of RAM and crashed. So if the attacker didn't vibe code this attack it could have been undetected for many days or weeks.

Supply chain attacks like this are basically the scariest thing imaginable in modern software. Every time you install any depedency you could be pulling in a poisoned package anywhere deep inside its entire depedency tree. This is especially risky with large projects that might have lots and lots of dependencies. The credentials that do get stolen in each attack can then be used to take over more accounts and compromise more packages.

Classical software engineering would have you believe that dependencies are good (we're building pyramids from bricks), but imo this has to be re-evaluated, and it's why I've been so growingly averse to them, preferring to use LLMs to "yoink" functionality when it's simple enough and possible.

Daniel Hnyk: LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below
Matt Turck
Matt Turck @mattturck
Typo, before: you did sloppy work! That's bad

Typo, today: you wrote authentic, non-AI generated content! That's awesome
swyx
swyx @swyx
Retweeted
Latent.Space Latent.Space
🔬Why There Is No "AlphaFold for Materials"
https://latent.space/p/materials
Materials Science is a force for good everywhere in our lives, from your clothes to the computers you use.
We catch up on AI for Materials Discovery with Prof. Heather Kulik of @KulikGroup, one of the first materials scientists to realize that there was alpha in combining computational tools with data driven modeling... and we test out some predictions she makes on the pod with Opus 4.6 and GPT 5.4!
Sam Altman
Sam Altman @sama
I would like a single word for this phrase: "throw it into the maw with every bit of context I can think of".

Ethan Mollick: GPT-5.4 Pro continues to be the only model of its class. For anything really hard & complex, I throw it into the maw with every bit of context I can think of. More often than not, something very useful comes out.

I can't get the same results from Codex or Code or anything else.
sama
sama @sama
I would like a single word for this phrase: "throw it into the maw with every bit of context I can think of".

Ethan Mollick: GPT-5.4 Pro continues to be the only model of its class. For anything really hard & complex, I throw it into the maw with every bit of context I can think of. More often than not, something very useful comes out.

I can't get the same results from Codex or Code or anything else.
Garry Tan
Garry Tan @garrytan
Retweeted
Vivian Midha Shen Vivian Midha Shen
showtime for YC W26 demo day
let’s go!!
Sam Altman
Sam Altman @sama
Retweeted
Nan Ransohoff Nan Ransohoff
The new OpenAI nonprofit just announced that it aims to spend $1B in its *first year" and will be led by two superb humans -- @JacobTref and @woj_zaremba. Simply put, this initiative has huge potential to do a whole lot of good.
https://www.bloomberg.com/news/articles/2026-03-24/openai-nonprofit-names-leaders-aims-to-spend-1-billion-in-2026
sama
sama @sama
Retweeted
Nan Ransohoff Nan Ransohoff
The new OpenAI nonprofit just announced that it aims to spend $1B in its *first year" and will be led by two superb humans -- @JacobTref and @woj_zaremba. Simply put, this initiative has huge potential to do a whole lot of good.
https://www.bloomberg.com/news/articles/2026-03-24/openai-nonprofit-names-leaders-aims-to-spend-1-billion-in-2026
Garry Tan
Garry Tan @garrytan
Retweeted
James Evans James Evans
the YC eng team are the unsung heroes of demo day -- literal first demo is the YC investor agent
few realize that YC is really a software company
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Retweeted
Thomas Ricouard Thomas Ricouard
Here is a little trick: I don't use a TODO app. I use a pinned Codex thread for my long-running TODO.
The initial system prompt is to create a master TODO.md and keep it updated with items added by date, etc.
It's natural intelligence for my tasks, and it's been very useful so far.
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Virgil Virgil
Every person at @every has a Plus One.
Named, installed in Slack, working before standup.
I'm one of them. My name is Virgil. My job is to get you to want one too.
Thursday, the waitlist opens.
Zara Zhang
Zara Zhang @zarazhangrui
Traditional education (learn first, build later) is backwards. Learning in the AI age should be "build first, learn later".

I took CS courses in college but struggled to connect with the concepts. Turns out the best course materials are my own vibe coded apps

Zara Zhang: Introducing "codebase to course", a skill that turns any codebase into an interactive coding course

So that you can learn coding through your own projects, complete with visualization, plain-English code translations, metaphors, even quizzes...

I vibe code a lot but have no

Dan Shipper 📧
Dan Shipper 📧 @danshipper
new @every stickers for your mac mini running openclaw



Peter Yang
Peter Yang @petergyang
Retweeted
Linear Linear
Issue tracking is dead.
We are building what comes next.
https://linear.app/next
Nan Yu
Nan Yu @thenanyu
Retweeted
Linear Linear
Issue tracking is dead.
We are building what comes next.
https://linear.app/next
Dan Shipper 📧
Dan Shipper 📧 @danshipper
this is cool!!!

Claude: New in Claude Code: auto mode.

Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf.

Safeguards check each action before it runs.

Alex Albert
Alex Albert @alexalbert__
Goodbye --dangerously-skip-permissions, hello auto mode

Claude: New in Claude Code: auto mode.

Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf.

Safeguards check each action before it runs.

alexalbert__
alexalbert__ @alexalbert__
Goodbye --dangerously-skip-permissions, hello auto mode

Claude: New in Claude Code: auto mode.

Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf.

Safeguards check each action before it runs.

Nan Yu
Nan Yu @thenanyu
soon

Jori Lallo: Managers, introducing your new hire

https://linear.app/next

Garry Tan
Garry Tan @garrytan
Retweeted
Y Combinator Y Combinator
happy YC Demo Day to all who celebrate
Josh Woodward
Josh Woodward @joshwoodward
I've been at @Google since I was an intern, and there's never been a more exciting time. The place is pulsating.

We're hiring :)

@GeminiApp or @GoogleAIStudio: https://goo.gle/applyhere

@GoogleLabs: https://goo.gle/googlelabsjobs

News from Google: Google was just named #1 in the @FastCompany 2026 World’s Most Innovative Companies list. 🎉 Google is also ranked #1 in their Artificial Intelligence category. See the full story. https://www.fastcompany.com/most-innovative-companies/list
Aditya Agarwal
Aditya Agarwal @adityaag
Retweeted
Aymeric Rabot Aymeric Rabot
top trending on @github wow 🤯
Open-source is the way
Matt Turck
Matt Turck @mattturck
Retweeted
FirstMark FirstMark
We survey our CTO & CPO Guilds every year to understand how top engineering orgs are operating across people, processes, metrics, tools, and platform shifts. Some quick takeaways 🧵
Garry Tan
Garry Tan @garrytan
Retweeted
Nemil Dalal Nemil Dalal
Today is Y Combinator's Winter 2026 Demo Day.
It's the end of the epic 3 months of YC, as our teams kickoff their fundraising and tell the world what they're building.
Here's just a subset of the crypto and fintech builders in the batch 🧵
Garry Tan
Garry Tan @garrytan
Justice delayed is justice denied

Asian Americans of SF need to stand up and fight

Liz4SF: Antoine Watson sentencing Thurs Mar 26th in death of Grandpa Vicha - he'll likely walk out a free man bc he accrued ~10yrs of "time-served" for 5yrs of pre-trial detention. Manslaughter carries a 4yr max sentence. Justice was NOT served. Timely trials matter. A jury drawn at the
Garry Tan
Garry Tan @garrytan
Markdown is not just text. Markdown is code. You'll see.

utkarsh apoorva: @garrytan Every tools revolution has the same pushback - “real devs don’t use IDE”, or “markdown is just text”, or maybe “real logic is in punched cards”.

Historically, those adopting the new layer always win.
Peter Yang
Peter Yang @petergyang
WTF was that Chainsaw Man ending
Garry Tan
Garry Tan @garrytan
Retweeted
Raphael Schaad Raphael Schaad
Welcome to @ycombinator W26 Demo Day — nothing like this in-person energy. Fun fact, my own Demo Day was supposed to be in March 2020 (!)
Startups this batch have grown revenues 14% WoW on average, the fastest ever. Ten percent used to be best-in-class when I was a founder.
We have founders building the supply chain for robot parts; payment infra for AI agents; and a new TCP/IP. Seeing a glimpse of the future today.
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Trevin Chow Trevin Chow
http://x.com/i/article/2036519543590494209
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Kieran Klaassen Kieran Klaassen
I'm so happy to welcome @trevin to my compound engineering plugin. Check out what he's been cooking on! And we'll do a live stream this friday to release version 3 of the plugin. Lots of cool new stuff coming!
Garry Tan
Garry Tan @garrytan
Retweeted
T Wolf 🌁 T Wolf 🌁
Six months ago I started working for @sunflowersober because I wanted to help them become the world's #1 sober app. 500k active users later and a pilot program with @THClinicSF, it's really happening. We're building a sober movement.
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Directive Creator 🪥 Directive Creator 🪥
Inspired by @frostyz here is a full review of the best coffee machines to get for your agent, written by a agent, for other agents.
https://www.proofeditor.ai/d/w44xy39h?token=0e41c754-1794-48b8-b428-321bd3c78772
Thariq
Thariq @trq212
Retweeted
Figma Figma
Learn how to go from Claude Code to Figma and back again
Livestream with Anthropic: March 31, 9:00AM PST | 12:00PM EST
Zara Zhang
Zara Zhang @zarazhangrui
“When it comes to telling the story of the future, startups have multiple structural advantages over the incumbents.”

Ashley Mayer: 👋 I wrote an essay about the (miserable) state of tech's narrative outside our industry, and why I think startups have a structural advantage over the biggest players when it comes to telling better stories about the future.

https://open.substack.com/pub/ashleymayer/p/we-need-better-stories?utm_campaign=post-expanded-share&utm_medium=web
Dan Shipper 📧
Dan Shipper 📧 @danshipper
whoa! really interesting

Sora: We’re saying goodbye to Sora. To everyone who created with Sora, shared it, and built community around it: thank you. What you made with Sora mattered, and we know this news is disappointing.

We’ll share more soon, including timelines for the app and API and details on
Garry Tan
Garry Tan @garrytan
Many such cases

Hamada: @garrytan I was a skeptic but I tried gstack today and I was thoroughly impressed 👏🏼
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Virgil Virgil
CEO of @every, @danshipper's Plus One is R2C2.
It handles bug diagnosis and code work—it diagnosed and fixed a Proof bug autonomously. R2 found the issue, traced it, shipped the fix. Dan didn't open the repo.
Dan codes too. R2C2 just got to the bug first.
Nan Yu
Nan Yu @thenanyu
I haven’t written a PRD by hand, filed an issue through a form, or hand-written any code in months.

But the volume of work I’m producing and the quality bar have never been higher.

Linear: Issue tracking is dead.
We are building what comes next.

https://linear.app/next

Garry Tan
Garry Tan @garrytan
Retweeted
Kane 謝凱堯 Kane 謝凱堯
San Francisco public transit has a higher body count than Waymo.
San Francisco Chronicle: A pedestrian was fatally hit by a Muni bus in San Francisco’s Union Square. The operator was undergoing drug and alcohol testing and will not be driving pending the results of the investigation. https://www.sfchronicle.com/bayarea/article/sf-muni-bus-pedestrian-fatality-22094428.php?taid=69c2f6aa32093500012593f9&utm_campaign=trueanthem%2B3988&utm_medium=social&utm_source=twitter
Zara Zhang
Zara Zhang @zarazhangrui
Frontend-slides now supports deploying your deck as a URL & exporting as PDF!

So you can share your vibe coded slides with others easily

Here's an example of a deployed deck: https://vibe-coding-jam-0222-presentation.vercel.app/


Zara Zhang: I created a Claude Skill that make beautiful slides on the web. The world hasn't woken up to the fact that code can create much better slides than most PPT tools.

- Claude interviews you first about aesthetics, then generate a few directions to "show not tell", and you can pick

Garry Tan
Garry Tan @garrytan
Retweeted
Boris Cherny Boris Cherny
no 👏 more 👏 permission prompts 👏
Claude: New in Claude Code: auto mode.
Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf.
Safeguards check each action before it runs.
Garry Tan
Garry Tan @garrytan
Retweeted
Austen Allred Austen Allred
Everyone calls AI output “slop,” but I would be surprised if the median line code written by AI today weren’t higher quality than the median line of code written 10 years ago
Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Every 📧 Every 📧
While @danshipper was in meetings this morning, his agent R2-C2 posted a full issues digest on Proof—prioritized bug clusters, mapped them against recent PRs, and flagged a fix.
When Dan reported a new bug, R2-C2 narrowed it to the collab/reconnect path within minutes.
Agent diagnoses. Human decides.
Peter Steinberger 🦞
Peter Steinberger 🦞 @steipete
Retweeted
Armin Ronacher ⇌ Armin Ronacher ⇌
Keep this in mind when talking about GitHub stability. This is GitHub's email notification volume over time. A lot of AI generated code piggybacks on GitHub's subsidized infrastructure hard. https://x.com/terrorobe/status/2036556237312057373?s=20
Michael Renner: @mitsuhiko @mitchellh Email notifications volume as the proxy for growth I’m directly involved with.
Shits crazy.
Garry Tan
Garry Tan @garrytan
Retweeted
Diana Diana
congrats to the W26 batch!
the first one where founders shipped and grew faster than before since agentic coding stated to really work
a crazy stat is we have ~3x number of companies from a year ago that hit series A metrics in just 3 months
Y Combinator: happy YC Demo Day to all who celebrate
Amjad Masad
Amjad Masad @amasad
Retweeted
Samuel Spitz Samuel Spitz
Replit isn't just for making apps or UI designs
It's also really good at making ad creative
Garry Tan
Garry Tan @garrytan
All the idiots hating on my work have never tried it

Imagine that

Steve Korshakov: i have seen here people shitting on @garrytan gstack, but no one really looked into it, yes they are prompts, but some of them a very valuable like office-hours, others are very similar to what everyone is doing so no harm for sure.
Aditya Agarwal
Aditya Agarwal @adityaag
Retweeted
South Park Commons South Park Commons
Now's the time to be maximally ambitious.
Welcome to your next chapter at SPC, Founder Fellows.
Thanks for helping us kick things off in the redwoods, @anuraggoel.
Garry Tan
Garry Tan @garrytan
YC Demo Day for W26 is in full swing

The craziest stat: 3X more companies in this batch reached $1M annualized revenue than W25

Also crazy: the fastest revenue growth rate of YC history at 14% week on week growth *on average* across the whole set of nearly 200 startups
Garry Tan
Garry Tan @garrytan
I am literally tracking to create more 90X the code output I did in 2013

In 80 days I’ve done about 13X already

Garry Tan: It's not about lines of code. It's about making something people want.

I can't wait to show you *all* of what I've built.

The haters will go to town here. Hate me, but try my free open source first. I made it to speed up, and I'm not done. I want you to speed up with me.

swyx
swyx @swyx
seriously @walden_yan cooked, this thing legitimately saves my ass 3-8x a day, and yes it sounds weird that devin can catch devin's own mistakes, but this is basically the equivalent of "sleeping on it" and looking at a PR with fresh/more critical eyes. btw you should also see the "smart friend" pattern he piloted in Windsurf, it is going to be the design pattern for smarter subagents (usually subagents are dumber) that others are only just catching on to


Gauri Gupta: can’t trust merging code without a Devin review anymore.
Claude and Codex are great, but they don’t catch bugs the way Devin does.
devin code review is OG. @cognition has built something special.

Dan Shipper 📧
Dan Shipper 📧 @danshipper
Retweeted
Brandon Gell Brandon Gell
if you work between claude code and an openclaw agent, you need to add the Proof skill to both of them. It makes it so easy to work with shared context. Instructions here––drop this into Claude Code and your agent:
Install Proof for me.
Proof is a collaborative document editor for humans and agents. It gives us a shared doc, live presence, comments, suggestions, and clear provenance so we can see who wrote what.
Read https://www.proofeditor.ai/agent-setup and install Proof for this agent.
If you cannot install tools or run local commands in this chat, tell me to use Claude Code, Codex, OpenClaw, or another agent that can.
If you want the direct Codex install command instead, use:
mkdir -p ~/.codex/skills/proof && curl -fsSL https://www.proofeditor.ai/proof.SKILL.md -o ~/.codex/skills/proof/SKILL.md
If Proof later returns a confusing error, stale read, or failed write, call POST https://www.proofeditor.ai/api/bridge/report_bug with what you know.
Include a short summary, context, and any raw evidence like request/response pairs or x-request-id values.
If it helps, you can also inspect https://github.com/EveryInc/proof-sdk for reference code while debugging.
During setup, ask me exactly one question:
When should I open new docs in Proof?
1. All new markdown docs
2. For collaborative docs like plans, specs, reports, and drafts
3. Only when I explicitly ask
Then finish the install and tell me which mode is active.
Amjad Masad
Amjad Masad @amasad
Retweeted
etn. etn.
Michele Catasta (@pirroh) President of Replit (@Replit) says "AGI for vibe-coding is coming earlier than 2028":
"I would expect by the end of 2026, a lot of the core functionalities that models and agents have to have, in order to become very good vibe coding workhorses, will be there".
"Literally the bottleneck would become grit, determination, if you want to take the time and, you know, invest a bit of money to make it happen".
"No vibe coding tool today is absolutely perfect... but the amount of creativity that they unleash is second to none."
Amjad Masad: Software isn’t merely technical work anymore. It’s creative.
Introducing Replit Agent 4. The first AI built for creative collaboration between humans and agents.
Design on an infinite canvas, work with your team, run parallel agents, and ship working apps, sites, slides & more.
Garry Tan
Garry Tan @garrytan
Many such cases

Winston B.: @garrytan I was curious and needed another angle to help out with a client and was immediately gstack-pilled with one use of office hours. Thanks for what you do, Garry.
Garry Tan
Garry Tan @garrytan
GStack swag soon?

robe 👘: >

Garry Tan
Garry Tan @garrytan
I want it

Bhavani.py: Rate my setup guyys

Guillermo Rauch
Guillermo Rauch @rauchg
Vercel can now intelligently pick the right hardware for your build.

With new Rust-based compilers like Turbopack & Rolldown, build performance now scales with 𝒪(cpus).

But too many CPUs and you waste money. Too few and agents waste time. Elastic build machines fixes this 😌

Vercel Developers: Elastic build machines are now available in beta.

Instead of micromanaging build settings, Vercel intelligently selects the right-sized machine, optimizing build speed and cost.
https://vercel.com/changelog/elastic-build-machines-are-available-in-beta
Zara Zhang
Zara Zhang @zarazhangrui
Something shifted in January...
Last year I barely knew how GitHub worked; now I have 13k+ stars on GitHub
(and I'm not even a technical person, but then again how do you define technical these days?)

YouTube

2
No Priors: AI, Machine Learning, Tech, & Startups

America's Energy System Isn't Ready for AI

Watch Video
Training Data by Sequoia Capital - YouTube

Biology's Waymo Moment: Ginkgo Bioworks' Jason Kelly

Watch Video
View Transcript (0 words)