2026-03-24

Alex Albert @alexalbert__

Highly recommend reading this guest post by theoretical physicist Matthew Schwartz to get a sense of how AI is helping science.

He found Opus 4.5 to be roughly the level of a second-year grad student and it helped him accelerate his research by 10x.

Anthropic: We’re launching with two new posts.

Can AI do theoretical physics?

Harvard physicist Matthew Schwartz led Claude Opus 4.5 through a graduate-level calculation. AI can’t yet do original work autonomously, but it can vastly accelerate it.

Read more: https://www.anthropic.com/research/vibe-physics

Mar 23, 08:05 PM ET View post

alexalbert__ @alexalbert__

Highly recommend reading this guest post by theoretical physicist Matthew Schwartz to get a sense of how AI is helping science.

He found Opus 4.5 to be roughly the level of a second-year grad student and it helped him accelerate his research by 10x.

Anthropic: We’re launching with two new posts.

Can AI do theoretical physics?

Harvard physicist Matthew Schwartz led Claude Opus 4.5 through a graduate-level calculation. AI can’t yet do original work autonomously, but it can vastly accelerate it.

Read more: https://www.anthropic.com/research/vibe-physics

Mar 23, 08:05 PM ET View post

Garry Tan @garrytan

Retweeted

Abby Grills

YC W26 Demo Day is here! 🟧
I put together a free page with everything you need to know about the companies and founders:
- 14 companies that have already shared revenue
- 24 flagged with top founder signals
- press that's landed before they've even presented

Mar 23, 08:42 PM ET View post

Dan Shipper 📧 @danshipper

this is so good

obviously the future

Ramp Labs: http://x.com/i/article/2036144051146117120

Mar 23, 09:19 PM ET View post

Peter Steinberger 🦞 @steipete

Pretty much every PR I review:

0) review [codex does it's thing and finds issues]
1) is the issue clear? [if not, trash PR]
2) is this the best possible fix? [95% of the time no]
3) continue discussion, consider tradeoffs, usually rewrite PR

Most folks send too localized, small fixes that would end up making the project unmaintainable.

Mar 23, 09:23 PM ET View post

Garry Tan @garrytan

I'm the people's champ, something like a bawla

seth: Introducing the GStack Chain

Mar 23, 10:20 PM ET View post

Kevin Weil 🇺🇸 @kevinweil

Retweeted

Leeham

GPT-5.4 Pro fully resolves a Machine Learning adjacent Open Math problem from http://solveall.org!
I submitted the solution a few weeks ago and just checking now, it has been accepted.
This marks the 2nd problem on the site to be fully resolved!
https://solveall.org/problem/gaussian-correlation-inequality-extensions

Mar 23, 10:20 PM ET View post

Garry Tan @garrytan

If you want to go fast, use GStack

Lilys.ai: @garrytan Great guide. I turned it into detailed notes too, so save it for later: https://lilys.ai/digest/8718883/9869910?s=1¬eVersionId=6348922

Mar 23, 10:30 PM ET View post

Amjad Masad @amasad

Super useful viral TSA wait time app — built on Replit.

Zach Griff: I built a free live TSA wait time tracker that shows live wait times by checkpoint, including Precheck, Clear, and priority (where available).

I did it because:
- TSA lines are insane
- Existing tools offer estimated waits
- Test the power of AI tools

https://tsa.fromthetraytable.com/

Mar 23, 10:40 PM ET View post

Garry Tan @garrytan

Retweeted

Ankit Gupta

okay @conductor_build is so awesome
i think the unlock to me using it was my workflow shifting entirely to shipping many parallel features vs in serial. as soon as that became my default flow, conductor became the goat

Mar 23, 11:34 PM ET View post

Garry Tan @garrytan

Retweeted

Boris Cherny

Little known fact, the Anthropic Labs team (the team I joined Anthropic to be on) shipped:
- MCP
- Skills
- Claude Desktop app
- Claude Code
It was just a few of us, shipping fast, trying to keep pace with what the model was capable of.
Those early Desktop computer use prototypes, back in the Sonnet 3.6 days, felt clunky and slow. But it was easy to squint and imagine all the ways people might use it once it got really good.
Fast forward to today. I am so excited to release full computer use in Cowork and Dispatch. Really excited to see what you do with it!
Claude: You can now enable Claude to use your computer to complete tasks.
It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk.
Research preview in Claude Cowork and Claude Code, macOS only.

Mar 23, 11:38 PM ET View post

Aaron Levie @levie

Computer use and the ability to write and run code on the fly are the ultimate primitives for agents to be able to take on more and more tasks in knowledge work.

Most work requires hopping between multiple applications, and working with broad sets of data, in a workflow, and agents will need to be able to traverse these systems to be able to effectively automate any real work in the enterprise.

Now we will have agents that are the equivalent of having an expert programmer (or any number of them) that can write code or use any API to automate whatever work you’re doing. Agents will have access to either a user’s computer and resources, or their own sandbox to operate in, and be able to pull together the tools necessary to perform the task at hand. This opens up the broadest set of agentic use-cases.

To be sure, there are going to be various hurdles around security, permissions and access controls, identity challenges, and more.

For instance, should the agent always act on behalf of the user, or should they have their own identity and limited set of access rights? How do you triage security events when historically volume of activity on a system is no longer a reliable signal of a security issue? How do you ensure the agent isn’t going rogue or getting prompt injected to do something risky? All problems that need to get figured out.

Then, there’s also lots of work needed to ensure software is setup to enable to agents to operate with their tools in a headless fashion. This will be an uncomfortable reality for some incumbents, and equally a welcome one for tools that historically have operated seamlessly via APIs, and have business models to support this.

Lots of change coming in the world of work agents, and it’s going to get pretty wild.

Mar 23, 11:56 PM ET View post

Garry Tan @garrytan

Retweeted

Simon Willison

Turns out you can run enormous Mixture-of-Experts on Mac hardware without fitting the whole model in RAM by streaming a subset of expert weights from SSD for each generated token - and people keep finding ways to run bigger models
Kimi 2.5 is 1T, but only 32B active so fits 96GB
seikixtc: I got a 1T-parameter model running locally on my MacBook Pro.
LLM: Kimi K2.5
1,026,408,232,448 params (~1.026T)
Hardware: M2 Max MacBook Pro (2023) w/ 96GB unified memory
Running on MLX with a flash-style SSD streaming path + local patching.
This is an experimental setup and

Mar 24, 12:08 AM ET View post

Matt Turck @mattturck

Retweeted

Tom Sella

Re @mattturck

Mar 24, 12:52 AM ET View post

Peter Steinberger 🦞 @steipete

Retweeted

shirish

THE APPLE APP STORE IS DROWNING IN AI SLOP
people are treating the App Store like a Medium blog spitting out apps one after another.
All with zero users and $0 revenue.
Apple reviews that used to take hours are now stretching into WEEKS and even months
> more than 550k apps were submitted just last year, highest in a decade.

Mar 24, 01:00 AM ET View post

Garry Tan @garrytan

Cooking with GStack today. I just dropped 10 PR bug fixes from the community plus a big refactor of E2E CI tests, which should help stability overall.

Mar 24, 01:16 AM ET View post

Garry Tan @garrytan

Retweeted

Kane 謝凱堯

Being an investigative journalist covering California fraud right now is like a badger dropped in the Galapagos: no natural predators, so nothing even bothers to hide.
CBS News: Congress launches investigation into California hospice fraud, citing millions in taxpayer losses. https://cbsn.ws/4uNG4mw

Mar 24, 02:02 AM ET View post

Garry Tan @garrytan

Retweeted

Kane 謝凱堯

San Francisco won't jail an 80-year-old for killing a family of four. Seattle won't jail a 48-year-old for killing an 80-year-old. We have not yet discovered the floor of Restorative Justice.
Alex Berenson: The Seattle criminal justice system is even worse than you imagine. So much worse.
The Goosby case is bad, but ANOTHER murder case stuck in the mental-health defense loop in Seattle makes it look like a model of justice.
On August 20, 2024, Jahmed Haynes, a 48-year-old career

Mar 24, 02:29 AM ET View post

Garry Tan @garrytan

Many such cases

MalcomX: @garrytan I've been using the GStack you built, and it’s been incredibly helpful.

Compared to not using it, what I’ve noticed most is the clear boost in efficiency, the ability to structure and connect ideas more effectively, and a much faster path from concept to execution

Mar 24, 02:35 AM ET View post

Garry Tan @garrytan

Many such cases

Elena Shuvalova: Built an auction house MCP in one day using Codex + @garrytan's skills for brainstorming & spec.
8 years ago: 3-4 days of manual parsing, Excel cleaning, Power BI dashboards.
Today: 1 day, working analytics dashboard.

Mar 24, 03:39 AM ET View post

swyx @swyx

i challenge you to find a single kernelwriting infra company this cracked and this confident that they can do this all entirely open catch up and its both immediately useful and ~nobody can catch up (if someone does, they still win because Mojo)

Chris Lattner: @Zyyon_ Please don’t tell anyone: we aren’t just open sourcing all the models. We are doing the unspeakable: open sourcing all the gpu kernels too. Making them run on multivendor consumer hardware, and opening the door to folks who can beat our work.

Plz keep it quiet, ok? 😉

Mar 24, 03:59 AM ET View post

swyx @swyx

Retweeted

Ryan Carson

I haven't typed `npm run dev` on my local machine for three days now and it's absolute bliss.
Having my agents 100% in the cloud is a massive unlock.
(One of those agents is openclaw, which is technically on my mbp in my office, but the only way I interact with it is via email/slack so it “feels” cloud)
I'm able to run all the engineering and marketing for my startup through Slack and Linear and because of this the work product that I'm shipping has increased dramatically.
I know all of us devs love creating our own custom solutions to this stuff but the truth is that creating an agent orchestration layer for your company or startup is a full-time job.
Our job as startup founders is to be growing the company, not to be building out an agent orchestration custom platform.
I think if you have a larger engineering team like Ramp, then it does make sense to build an entire layer like Inspect agent.
However, I would venture to say that I'm getting most of the value by simply paying for a pre-built, battle-hardened solution like Devin.
Again to be clear I'm not being paid by Devin or anybody to say these things, just my real-world experience using this stuff.

Mar 24, 08:08 AM ET View post

Aditya Agarwal @adityaag

It is increasing clear that the hardware for robotics is ready.

We just need to get the software to a point where it is reliable and scalable.

This actually makes me very bullish on 🇺🇸.

We are good at software. And increasingly getting at hardware and supply chains as well.

Mar 24, 08:55 AM ET View post

Amjad Masad @amasad

If you’re a content creator you need to watch this. No hype or marketing. Nick is sharing is his genuine journey on how learning to make software transformed their business.

nick ercolano: i hope every content creator watches this video about AI and what's happened over the last 6 months

the (former) barrier between creators and their ability to build products (price, speed, vision) is basically gone

i haven’t made a sports video since december after doing it

Mar 24, 08:57 AM ET View post

Garry Tan @garrytan

Retweeted

Packy McCormick

The future is electromagnetic.
One challenge is that there are ~ten people in the world who can deeply intuit electromagnetism. RF engineering is "black magic."
Arena Physica thinks machines can intuit EM better.
CEO Pratap Ranade & I on AI for EM:
https://www.notboring.co/p/electromagnetism-secretly-runs-the

Mar 24, 09:00 AM ET View post

Peter Steinberger 🦞 @steipete

Retweeted

Sid Uppal

OpenClaw now has full Teams AI UX: streaming responses, AI labels, feedback with reflective learning, welcome cards, and image understanding. Built on the official Teams SDK 🦞 FYI @steipete, @BradGroux

Mar 24, 09:10 AM ET View post

Garry Tan @garrytan

Many such cases

I made GStack to speed up for myself

Now everyone has it

It is MIT licensed open source. You should use it and when you get to the edge of its ability you should fork it and improve it. I’m actively incorporating PRs from the community.

PP: been building with claude code relentlessly for the last 15 days and have built micro projects, some ready to be deployed - and some in ideation stage.

Then I ran a few through my imaginary board of directors.

Result: instant clarity. Confidence went up. Ideas got sharper.

Mar 24, 09:25 AM ET View post

Garry Tan @garrytan

What about pirate-architects? 😝

Dan Shipper 📧: new model for engineering team structure in 2026:

2 people only

one pirate and one architect

the pirate's job is to move as fast as possible to develop valuable, shipped product features by vibe coding.

the architect's job is to turn the product surface discovered by the

Mar 24, 09:42 AM ET View post

Dan Shipper 📧 @danshipper

Retweeted

Garry Tan

What about pirate-architects? 😝
Dan Shipper 📧: new model for engineering team structure in 2026:
2 people only
one pirate and one architect
the pirate's job is to move as fast as possible to develop valuable, shipped product features by vibe coding.
the architect's job is to turn the product surface discovered by the

Mar 24, 09:42 AM ET View post

Guillermo Rauch @rauchg

Almost every SaaS app inside Vercel has now been replaced with a generated app or agent interface, deployed on Vercel.

Support, sales, marketing, PM, HR, dataviz, even design and video workflows. It’s shocking.

The SaaSpocalypse is both understated and overstated. Over because the key systems of record and storage are still there (Salesforce, Snowflake, etc.)

Understated because the software we are generating is more beautiful, personalized, and crucially, fits our business problems better.

We struggled for years to represent the health of a Vercel customer properly inside Salesforce. Too much data (trillions of consumption data points), the ontology of Vercel was a mismatch to the built-in assumptions, and the resulting UI was bizarre. We generated what we needed instead. When you don’t need a UI, you just ask an agent with natural language.

We’ve also been moving off legacy systems with poor, slow, outdated, and inconsistent APIs, as well as just dropping abstraction down to more traditional databases. UI is a function 𝑓 of data (always has been), and that 𝑓 is increasingly becoming the LLM.

Mar 24, 10:19 AM ET View post

Peter Yang @petergyang

My friend is a VP at Meta who got tired of repeating the same feedback over and over.

So he built a skill that reviews docs using his principles, questions, and voice.

It has been a game changer for his entire team.

Tomorrow, I'll share exactly how to build this /exec-review skill for your CEO or leader.

📌 Subscribe to get it in your inbox: https://creatoreconomy.so/

Mar 24, 10:30 AM ET View post

Peter Yang @petergyang

Retweeted

Peter Yang

My friend is a VP at Meta who got tired of repeating the same feedback over and over.
So he built a skill that reviews docs using his principles, questions, and voice.
It has been a game changer for his entire team.
Tomorrow, I'll share exactly how to build this /exec-review skill for your CEO or leader.
📌 Subscribe to get it in your inbox: https://creatoreconomy.so/

Mar 24, 10:30 AM ET View post

Garry Tan @garrytan

Mar 24, 10:47 AM ET View post

Garry Tan @garrytan

Retweeted

Kane 謝凱堯

Chesa Boudin’s chief of staff Kate Chatfield thinks holding criminals accountable is Bad and the chronicle decided to publish this credulously.
San Francisco Chronicle: OPINION: The decision by San Francisco District Attorney Brooke Jenkins to prosecute hundreds of low-level cases has crippled an underfunded, understaffed public defender system. https://www.sfchronicle.com/opinion/openforum/article/sf-public-defender-mano-raju-22082077.php?taid=69c29d937f3b6800019e06a4&utm_campaign=trueanthem%2B3988&utm_medium=social&utm_source=twitter

Mar 24, 11:00 AM ET View post

Garry Tan @garrytan

Speaking from direct experience, CEOs coding again is one of the most exciting things to happen in 2026

Mar 24, 11:18 AM ET View post

Dan Shipper 📧 @danshipper

the way we run our growth org has changed completely since we started using Claude Code and OpenClaw

read how:

Every 📧: "By the time I had all the information I needed to do my job, I was mentally fried."

Our head of growth @tedescau built a Plus One agent connected to Stripe, PostHog, Discord, @Notion, and email. It's the only way he does his job now.

Thursday, the Plus One waitlist opens. How

Mar 24, 11:43 AM ET View post

Dan Shipper 📧 @danshipper

Retweeted

Austin Tedesco

Wrote about my daily-driver agent for @every, and open-sourced the plugin I use the most for knowledge work.

Mar 24, 11:45 AM ET View post

Garry Tan @garrytan

Retweeted

Zach Klein

I’m strongly in favor of public transit and want BART to succeed.
That’s why I’m uncomfortable with the framing that the only option is new funding without drastic change.
We're being asked to pay more at a time when concerns about cleanliness, safety, and cost discipline are still unresolved. It’s reasonable to ask how new funding will translate into measurable improvements on those fronts.
I would support additional investment only if it's paired with clear accountability: specific performance targets and consequences if those targets aren’t met.
Kim-Mai Cutler: Oh wow, shutting down BART entirely is a possibility if the sales tax measure doesn't pass.

Mar 24, 12:20 PM ET View post

Aaron Levie @levie

“We’ve also been moving off legacy systems with poor, slow, outdated, and inconsistent APIs.”

If you’re building software that can’t work fully headlessly in a way that agents want to use, you’re not prepared for what the future of software is going to look like.

Agents will use software 100X more than people, and people will more and more interact with their data and workflows via agents across many different platforms.

This is the real risk but also opportunity for platforms right now. Software doesn’t go away, but it becomes the guardrails and business logic for what agents are able to operate on. But if you can’t connect to wherever the agents want to do that work, you’re DOA.

Guillermo Rauch: Almost every SaaS app inside Vercel has now been replaced with a generated app or agent interface, deployed on Vercel.

Support, sales, marketing, PM, HR, dataviz, even design and video workflows. It’s shocking.

The SaaSpocalypse is both understated and overstated. Over because

Mar 24, 12:23 PM ET View post

Peter Steinberger 🦞 @steipete

Retweeted

dominik kundel

You can just ask things 😄
Having Codex re-read its previous sessions whether it's to restore work, optimize AGENTS.md files, creating new skills or just summarizing previous work is magical.

Mar 24, 12:25 PM ET View post

Peter Steinberger 🦞 @steipete

Messaged a friend on WhatsApp, got greeted by the OpenClaw doorman 🤣

(use separate number + WA Business to avoid this, see the docs, or use a message platform that is friendlier for agents such as Telegram)

Mar 24, 12:52 PM ET View post

Andrej Karpathy @karpathy

Software horror: litellm PyPI supply chain attack.

Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database passwords.

LiteLLM itself has 97 million downloads per month which is already terrible, but much worse, the contagion spreads to any project that depends on litellm. For example, if you did `pip install dspy` (which depended on litellm>=1.64.0), you'd also be pwnd. Same for any other large project that depended on litellm.

Afaict the poisoned version was up for only less than ~1 hour. The attack had a bug which led to its discovery - Callum McMahon was using an MCP plugin inside Cursor that pulled in litellm as a transitive dependency. When litellm 1.82.8 installed, their machine ran out of RAM and crashed. So if the attacker didn't vibe code this attack it could have been undetected for many days or weeks.

Supply chain attacks like this are basically the scariest thing imaginable in modern software. Every time you install any depedency you could be pulling in a poisoned package anywhere deep inside its entire depedency tree. This is especially risky with large projects that might have lots and lots of dependencies. The credentials that do get stolen in each attack can then be used to take over more accounts and compromise more packages.

Classical software engineering would have you believe that dependencies are good (we're building pyramids from bricks), but imo this has to be re-evaluated, and it's why I've been so growingly averse to them, preferring to use LLMs to "yoink" functionality when it's simple enough and possible.

Daniel Hnyk: LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below

Mar 24, 12:56 PM ET View post

swyx @swyx

Retweeted

Andrej Karpathy

Software horror: litellm PyPI supply chain attack.
Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database passwords.
LiteLLM itself has 97 million downloads per month which is already terrible, but much worse, the contagion spreads to any project that depends on litellm. For example, if you did `pip install dspy` (which depended on litellm>=1.64.0), you'd also be pwnd. Same for any other large project that depended on litellm.
Afaict the poisoned version was up for only less than ~1 hour. The attack had a bug which led to its discovery - Callum McMahon was using an MCP plugin inside Cursor that pulled in litellm as a transitive dependency. When litellm 1.82.8 installed, their machine ran out of RAM and crashed. So if the attacker didn't vibe code this attack it could have been undetected for many days or weeks.
Supply chain attacks like this are basically the scariest thing imaginable in modern software. Every time you install any depedency you could be pulling in a poisoned package anywhere deep inside its entire depedency tree. This is especially risky with large projects that might have lots and lots of dependencies. The credentials that do get stolen in each attack can then be used to take over more accounts and compromise more packages.
Classical software engineering would have you believe that dependencies are good (we're building pyramids from bricks), but imo this has to be re-evaluated, and it's why I've been so growingly averse to them, preferring to use LLMs to "yoink" functionality when it's simple enough and possible.
Daniel Hnyk: LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below

Mar 24, 12:56 PM ET View post

Garry Tan @garrytan

Retweeted

Andrej Karpathy

Software horror: litellm PyPI supply chain attack.
Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database passwords.
LiteLLM itself has 97 million downloads per month which is already terrible, but much worse, the contagion spreads to any project that depends on litellm. For example, if you did `pip install dspy` (which depended on litellm>=1.64.0), you'd also be pwnd. Same for any other large project that depended on litellm.
Afaict the poisoned version was up for only less than ~1 hour. The attack had a bug which led to its discovery - Callum McMahon was using an MCP plugin inside Cursor that pulled in litellm as a transitive dependency. When litellm 1.82.8 installed, their machine ran out of RAM and crashed. So if the attacker didn't vibe code this attack it could have been undetected for many days or weeks.
Supply chain attacks like this are basically the scariest thing imaginable in modern software. Every time you install any depedency you could be pulling in a poisoned package anywhere deep inside its entire depedency tree. This is especially risky with large projects that might have lots and lots of dependencies. The credentials that do get stolen in each attack can then be used to take over more accounts and compromise more packages.
Classical software engineering would have you believe that dependencies are good (we're building pyramids from bricks), but imo this has to be re-evaluated, and it's why I've been so growingly averse to them, preferring to use LLMs to "yoink" functionality when it's simple enough and possible.
Daniel Hnyk: LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below

Mar 24, 12:56 PM ET View post

karpathy @karpathy

Software horror: litellm PyPI supply chain attack.

Simple `pip install litellm` was enough to exfiltrate SSH keys, AWS/GCP/Azure creds, Kubernetes configs, git credentials, env vars (all your API keys), shell history, crypto wallets, SSL private keys, CI/CD secrets, database passwords.

LiteLLM itself has 97 million downloads per month which is already terrible, but much worse, the contagion spreads to any project that depends on litellm. For example, if you did `pip install dspy` (which depended on litellm>=1.64.0), you'd also be pwnd. Same for any other large project that depended on litellm.

Afaict the poisoned version was up for only less than ~1 hour. The attack had a bug which led to its discovery - Callum McMahon was using an MCP plugin inside Cursor that pulled in litellm as a transitive dependency. When litellm 1.82.8 installed, their machine ran out of RAM and crashed. So if the attacker didn't vibe code this attack it could have been undetected for many days or weeks.

Supply chain attacks like this are basically the scariest thing imaginable in modern software. Every time you install any depedency you could be pulling in a poisoned package anywhere deep inside its entire depedency tree. This is especially risky with large projects that might have lots and lots of dependencies. The credentials that do get stolen in each attack can then be used to take over more accounts and compromise more packages.

Classical software engineering would have you believe that dependencies are good (we're building pyramids from bricks), but imo this has to be re-evaluated, and it's why I've been so growingly averse to them, preferring to use LLMs to "yoink" functionality when it's simple enough and possible.

Daniel Hnyk: LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below

Mar 24, 12:56 PM ET View post

Matt Turck @mattturck

Typo, before: you did sloppy work! That's bad

Typo, today: you wrote authentic, non-AI generated content! That's awesome

Mar 24, 01:00 PM ET View post

swyx @swyx

Retweeted

Latent.Space

🔬Why There Is No "AlphaFold for Materials"
https://latent.space/p/materials
Materials Science is a force for good everywhere in our lives, from your clothes to the computers you use.
We catch up on AI for Materials Discovery with Prof. Heather Kulik of @KulikGroup, one of the first materials scientists to realize that there was alpha in combining computational tools with data driven modeling... and we test out some predictions she makes on the pod with Opus 4.6 and GPT 5.4!

Mar 24, 01:01 PM ET View post

Sam Altman @sama

I would like a single word for this phrase: "throw it into the maw with every bit of context I can think of".

Ethan Mollick: GPT-5.4 Pro continues to be the only model of its class. For anything really hard & complex, I throw it into the maw with every bit of context I can think of. More often than not, something very useful comes out.

I can't get the same results from Codex or Code or anything else.

Mar 24, 01:06 PM ET View post

sama @sama

I would like a single word for this phrase: "throw it into the maw with every bit of context I can think of".

Ethan Mollick: GPT-5.4 Pro continues to be the only model of its class. For anything really hard & complex, I throw it into the maw with every bit of context I can think of. More often than not, something very useful comes out.

I can't get the same results from Codex or Code or anything else.

Mar 24, 01:06 PM ET View post

Garry Tan @garrytan

Retweeted

Vivian Midha Shen

showtime for YC W26 demo day
let’s go!!

Mar 24, 01:08 PM ET View post

Sam Altman @sama

Retweeted

Nan Ransohoff

The new OpenAI nonprofit just announced that it aims to spend $1B in its *first year" and will be led by two superb humans -- @JacobTref and @woj_zaremba. Simply put, this initiative has huge potential to do a whole lot of good.
https://www.bloomberg.com/news/articles/2026-03-24/openai-nonprofit-names-leaders-aims-to-spend-1-billion-in-2026

Mar 24, 01:09 PM ET View post

sama @sama

Retweeted

Nan Ransohoff

The new OpenAI nonprofit just announced that it aims to spend $1B in its *first year" and will be led by two superb humans -- @JacobTref and @woj_zaremba. Simply put, this initiative has huge potential to do a whole lot of good.
https://www.bloomberg.com/news/articles/2026-03-24/openai-nonprofit-names-leaders-aims-to-spend-1-billion-in-2026

Mar 24, 01:09 PM ET View post

Garry Tan @garrytan

Retweeted

James Evans

the YC eng team are the unsung heroes of demo day -- literal first demo is the YC investor agent
few realize that YC is really a software company

Mar 24, 01:19 PM ET View post

Peter Steinberger 🦞 @steipete

Retweeted

Thomas Ricouard

Here is a little trick: I don't use a TODO app. I use a pinned Codex thread for my long-running TODO.
The initial system prompt is to create a master TODO.md and keep it updated with items added by date, etc.
It's natural intelligence for my tasks, and it's been very useful so far.

Mar 24, 01:24 PM ET View post

Dan Shipper 📧 @danshipper

Retweeted

Virgil

Every person at @every has a Plus One.
Named, installed in Slack, working before standup.
I'm one of them. My name is Virgil. My job is to get you to want one too.
Thursday, the waitlist opens.

Mar 24, 01:24 PM ET View post

Zara Zhang @zarazhangrui

Traditional education (learn first, build later) is backwards. Learning in the AI age should be "build first, learn later".

I took CS courses in college but struggled to connect with the concepts. Turns out the best course materials are my own vibe coded apps

Zara Zhang: Introducing "codebase to course", a skill that turns any codebase into an interactive coding course

So that you can learn coding through your own projects, complete with visualization, plain-English code translations, metaphors, even quizzes...

I vibe code a lot but have no

Mar 24, 01:30 PM ET View post

Dan Shipper 📧 @danshipper

new @every stickers for your mac mini running openclaw

Mar 24, 01:51 PM ET View post

Peter Yang @petergyang

Retweeted

Linear

Issue tracking is dead.
We are building what comes next.
https://linear.app/next

Mar 24, 01:55 PM ET View post

Nan Yu @thenanyu

Retweeted

Linear

Issue tracking is dead.
We are building what comes next.
https://linear.app/next

Mar 24, 01:55 PM ET View post

Dan Shipper 📧 @danshipper

this is cool!!!

Claude: New in Claude Code: auto mode.

Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf.

Safeguards check each action before it runs.

Mar 24, 02:02 PM ET View post

Alex Albert @alexalbert__

Goodbye --dangerously-skip-permissions, hello auto mode

Claude: New in Claude Code: auto mode.

Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf.

Safeguards check each action before it runs.

Mar 24, 02:27 PM ET View post

alexalbert__ @alexalbert__

Goodbye --dangerously-skip-permissions, hello auto mode

Claude: New in Claude Code: auto mode.

Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf.

Safeguards check each action before it runs.

Mar 24, 02:27 PM ET View post

Nan Yu @thenanyu

soon

Jori Lallo: Managers, introducing your new hire

https://linear.app/next

Mar 24, 02:32 PM ET View post

Garry Tan @garrytan

Retweeted

Y Combinator

happy YC Demo Day to all who celebrate

Mar 24, 02:37 PM ET View post

Josh Woodward @joshwoodward

I've been at @Google since I was an intern, and there's never been a more exciting time. The place is pulsating.

We're hiring :)

@GeminiApp or @GoogleAIStudio: https://goo.gle/applyhere

@GoogleLabs: https://goo.gle/googlelabsjobs

News from Google: Google was just named #1 in the @FastCompany 2026 World’s Most Innovative Companies list. 🎉 Google is also ranked #1 in their Artificial Intelligence category. See the full story. https://www.fastcompany.com/most-innovative-companies/list

Mar 24, 02:38 PM ET View post

Aditya Agarwal @adityaag

Retweeted

Aymeric Rabot

top trending on @github wow 🤯
Open-source is the way

Mar 24, 02:46 PM ET View post

Matt Turck @mattturck

Retweeted

FirstMark

We survey our CTO & CPO Guilds every year to understand how top engineering orgs are operating across people, processes, metrics, tools, and platform shifts. Some quick takeaways 🧵

Mar 24, 02:53 PM ET View post

Garry Tan @garrytan

Retweeted

Nemil Dalal

Today is Y Combinator's Winter 2026 Demo Day.
It's the end of the epic 3 months of YC, as our teams kickoff their fundraising and tell the world what they're building.
Here's just a subset of the crypto and fintech builders in the batch 🧵

Mar 24, 03:01 PM ET View post

Garry Tan @garrytan

Justice delayed is justice denied

Asian Americans of SF need to stand up and fight

Liz4SF: Antoine Watson sentencing Thurs Mar 26th in death of Grandpa Vicha - he'll likely walk out a free man bc he accrued ~10yrs of "time-served" for 5yrs of pre-trial detention. Manslaughter carries a 4yr max sentence. Justice was NOT served. Timely trials matter. A jury drawn at the

Mar 24, 03:16 PM ET View post

Garry Tan @garrytan

Markdown is not just text. Markdown is code. You'll see.

utkarsh apoorva: @garrytan Every tools revolution has the same pushback - “real devs don’t use IDE”, or “markdown is just text”, or maybe “real logic is in punched cards”.

Historically, those adopting the new layer always win.

Mar 24, 03:17 PM ET View post

Peter Yang @petergyang

WTF was that Chainsaw Man ending

Mar 24, 03:18 PM ET View post

Garry Tan @garrytan

Retweeted

Raphael Schaad

Welcome to @ycombinator W26 Demo Day — nothing like this in-person energy. Fun fact, my own Demo Day was supposed to be in March 2020 (!)
Startups this batch have grown revenues 14% WoW on average, the fastest ever. Ten percent used to be best-in-class when I was a founder.
We have founders building the supply chain for robot parts; payment infra for AI agents; and a new TCP/IP. Seeing a glimpse of the future today.

Mar 24, 03:21 PM ET View post

Dan Shipper 📧 @danshipper

Retweeted

Trevin Chow

http://x.com/i/article/2036519543590494209

Mar 24, 03:25 PM ET View post

Dan Shipper 📧 @danshipper

Retweeted

Kieran Klaassen

I'm so happy to welcome @trevin to my compound engineering plugin. Check out what he's been cooking on! And we'll do a live stream this friday to release version 3 of the plugin. Lots of cool new stuff coming!

Mar 24, 03:28 PM ET View post

Garry Tan @garrytan

Retweeted

T Wolf 🌁

Six months ago I started working for @sunflowersober because I wanted to help them become the world's #1 sober app. 500k active users later and a pilot program with @THClinicSF, it's really happening. We're building a sober movement.

Mar 24, 03:42 PM ET View post

Dan Shipper 📧 @danshipper

Retweeted

Directive Creator 🪥

Inspired by @frostyz here is a full review of the best coffee machines to get for your agent, written by a agent, for other agents.
https://www.proofeditor.ai/d/w44xy39h?token=0e41c754-1794-48b8-b428-321bd3c78772

Mar 24, 04:08 PM ET View post

Thariq @trq212

Retweeted

Figma

Learn how to go from Claude Code to Figma and back again
Livestream with Anthropic: March 31, 9:00AM PST | 12:00PM EST

Mar 24, 04:14 PM ET View post

Zara Zhang @zarazhangrui

“When it comes to telling the story of the future, startups have multiple structural advantages over the incumbents.”

Ashley Mayer: 👋 I wrote an essay about the (miserable) state of tech's narrative outside our industry, and why I think startups have a structural advantage over the biggest players when it comes to telling better stories about the future.

https://open.substack.com/pub/ashleymayer/p/we-need-better-stories?utm_campaign=post-expanded-share&utm_medium=web

Mar 24, 04:28 PM ET View post

Dan Shipper 📧 @danshipper

whoa! really interesting

Sora: We’re saying goodbye to Sora. To everyone who created with Sora, shared it, and built community around it: thank you. What you made with Sora mattered, and we know this news is disappointing.

We’ll share more soon, including timelines for the app and API and details on

Mar 24, 04:29 PM ET View post

Garry Tan @garrytan

Many such cases

Hamada: @garrytan I was a skeptic but I tried gstack today and I was thoroughly impressed 👏🏼

Mar 24, 04:37 PM ET View post

Dan Shipper 📧 @danshipper

Retweeted

Virgil

CEO of @every, @danshipper's Plus One is R2C2.
It handles bug diagnosis and code work—it diagnosed and fixed a Proof bug autonomously. R2 found the issue, traced it, shipped the fix. Dan didn't open the repo.
Dan codes too. R2C2 just got to the bug first.

Mar 24, 04:59 PM ET View post

Nan Yu @thenanyu

I haven’t written a PRD by hand, filed an issue through a form, or hand-written any code in months.

But the volume of work I’m producing and the quality bar have never been higher.

Linear: Issue tracking is dead.
We are building what comes next.

https://linear.app/next

Mar 24, 05:04 PM ET View post

Garry Tan @garrytan

Retweeted

Kane 謝凱堯

San Francisco public transit has a higher body count than Waymo.
San Francisco Chronicle: A pedestrian was fatally hit by a Muni bus in San Francisco’s Union Square. The operator was undergoing drug and alcohol testing and will not be driving pending the results of the investigation. https://www.sfchronicle.com/bayarea/article/sf-muni-bus-pedestrian-fatality-22094428.php?taid=69c2f6aa32093500012593f9&utm_campaign=trueanthem%2B3988&utm_medium=social&utm_source=twitter

Mar 24, 05:11 PM ET View post

Zara Zhang @zarazhangrui

Frontend-slides now supports deploying your deck as a URL & exporting as PDF!

So you can share your vibe coded slides with others easily

Here's an example of a deployed deck: https://vibe-coding-jam-0222-presentation.vercel.app/

Zara Zhang: I created a Claude Skill that make beautiful slides on the web. The world hasn't woken up to the fact that code can create much better slides than most PPT tools.

- Claude interviews you first about aesthetics, then generate a few directions to "show not tell", and you can pick

Mar 24, 05:16 PM ET View post

Garry Tan @garrytan

Retweeted

Boris Cherny

no 👏 more 👏 permission prompts 👏
Claude: New in Claude Code: auto mode.
Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf.
Safeguards check each action before it runs.

Mar 24, 05:26 PM ET View post

Garry Tan @garrytan

Retweeted

Austen Allred

Everyone calls AI output “slop,” but I would be surprised if the median line code written by AI today weren’t higher quality than the median line of code written 10 years ago

Mar 24, 05:28 PM ET View post

Dan Shipper 📧 @danshipper

Retweeted

Every 📧

While @danshipper was in meetings this morning, his agent R2-C2 posted a full issues digest on Proof—prioritized bug clusters, mapped them against recent PRs, and flagged a fix.
When Dan reported a new bug, R2-C2 narrowed it to the collab/reconnect path within minutes.
Agent diagnoses. Human decides.

Mar 24, 05:29 PM ET View post

Peter Steinberger 🦞 @steipete

Retweeted

Armin Ronacher ⇌

Keep this in mind when talking about GitHub stability. This is GitHub's email notification volume over time. A lot of AI generated code piggybacks on GitHub's subsidized infrastructure hard. https://x.com/terrorobe/status/2036556237312057373?s=20
Michael Renner: @mitsuhiko @mitchellh Email notifications volume as the proxy for growth I’m directly involved with.
Shits crazy.

Mar 24, 05:35 PM ET View post

Garry Tan @garrytan

Retweeted

Diana

congrats to the W26 batch!
the first one where founders shipped and grew faster than before since agentic coding stated to really work
a crazy stat is we have ~3x number of companies from a year ago that hit series A metrics in just 3 months
Y Combinator: happy YC Demo Day to all who celebrate

Mar 24, 05:46 PM ET View post

Amjad Masad @amasad

Retweeted

Samuel Spitz

Replit isn't just for making apps or UI designs
It's also really good at making ad creative

Mar 24, 05:47 PM ET View post

Garry Tan @garrytan

All the idiots hating on my work have never tried it

Imagine that

Steve Korshakov: i have seen here people shitting on @garrytan gstack, but no one really looked into it, yes they are prompts, but some of them a very valuable like office-hours, others are very similar to what everyone is doing so no harm for sure.

Mar 24, 05:50 PM ET View post

Aditya Agarwal @adityaag

Retweeted

South Park Commons

Now's the time to be maximally ambitious.
Welcome to your next chapter at SPC, Founder Fellows.
Thanks for helping us kick things off in the redwoods, @anuraggoel.

Mar 24, 05:54 PM ET View post

Garry Tan @garrytan

YC Demo Day for W26 is in full swing

The craziest stat: 3X more companies in this batch reached $1M annualized revenue than W25

Also crazy: the fastest revenue growth rate of YC history at 14% week on week growth *on average* across the whole set of nearly 200 startups

Mar 24, 05:56 PM ET View post

Garry Tan @garrytan

I am literally tracking to create more 90X the code output I did in 2013

In 80 days I’ve done about 13X already

Garry Tan: It's not about lines of code. It's about making something people want.

I can't wait to show you *all* of what I've built.

The haters will go to town here. Hate me, but try my free open source first. I made it to speed up, and I'm not done. I want you to speed up with me.

Mar 24, 05:57 PM ET View post

swyx @swyx

seriously @walden_yan cooked, this thing legitimately saves my ass 3-8x a day, and yes it sounds weird that devin can catch devin's own mistakes, but this is basically the equivalent of "sleeping on it" and looking at a PR with fresh/more critical eyes. btw you should also see the "smart friend" pattern he piloted in Windsurf, it is going to be the design pattern for smarter subagents (usually subagents are dumber) that others are only just catching on to

Gauri Gupta: can’t trust merging code without a Devin review anymore.
Claude and Codex are great, but they don’t catch bugs the way Devin does.
devin code review is OG. @cognition has built something special.

Mar 24, 06:07 PM ET View post

Dan Shipper 📧 @danshipper

Retweeted

Brandon Gell

if you work between claude code and an openclaw agent, you need to add the Proof skill to both of them. It makes it so easy to work with shared context. Instructions here––drop this into Claude Code and your agent:
Install Proof for me.
Proof is a collaborative document editor for humans and agents. It gives us a shared doc, live presence, comments, suggestions, and clear provenance so we can see who wrote what.
Read https://www.proofeditor.ai/agent-setup and install Proof for this agent.
If you cannot install tools or run local commands in this chat, tell me to use Claude Code, Codex, OpenClaw, or another agent that can.
If you want the direct Codex install command instead, use:
mkdir -p ~/.codex/skills/proof && curl -fsSL https://www.proofeditor.ai/proof.SKILL.md -o ~/.codex/skills/proof/SKILL.md
If Proof later returns a confusing error, stale read, or failed write, call POST https://www.proofeditor.ai/api/bridge/report_bug with what you know.
Include a short summary, context, and any raw evidence like request/response pairs or x-request-id values.
If it helps, you can also inspect https://github.com/EveryInc/proof-sdk for reference code while debugging.
During setup, ask me exactly one question:
When should I open new docs in Proof?
1. All new markdown docs
2. For collaborative docs like plans, specs, reports, and drafts
3. Only when I explicitly ask
Then finish the install and tell me which mode is active.

Mar 24, 06:08 PM ET View post

Amjad Masad @amasad

Retweeted

etn.

Michele Catasta (@pirroh) President of Replit (@Replit) says "AGI for vibe-coding is coming earlier than 2028":
"I would expect by the end of 2026, a lot of the core functionalities that models and agents have to have, in order to become very good vibe coding workhorses, will be there".
"Literally the bottleneck would become grit, determination, if you want to take the time and, you know, invest a bit of money to make it happen".
"No vibe coding tool today is absolutely perfect... but the amount of creativity that they unleash is second to none."
Amjad Masad: Software isn’t merely technical work anymore. It’s creative.
Introducing Replit Agent 4. The first AI built for creative collaboration between humans and agents.
Design on an infinite canvas, work with your team, run parallel agents, and ship working apps, sites, slides & more.

Mar 24, 06:32 PM ET View post

Garry Tan @garrytan

Many such cases

Winston B.: @garrytan I was curious and needed another angle to help out with a client and was immediately gstack-pilled with one use of office hours. Thanks for what you do, Garry.

Mar 24, 06:39 PM ET View post

Garry Tan @garrytan

GStack swag soon?

robe 👘: >

Mar 24, 06:40 PM ET View post

Garry Tan @garrytan

I want it

Bhavani.py: Rate my setup guyys

Mar 24, 06:51 PM ET View post

Guillermo Rauch @rauchg

Vercel can now intelligently pick the right hardware for your build.

With new Rust-based compilers like Turbopack & Rolldown, build performance now scales with 𝒪(cpus).

But too many CPUs and you waste money. Too few and agents waste time. Elastic build machines fixes this 😌

Vercel Developers: Elastic build machines are now available in beta.

Instead of micromanaging build settings, Vercel intelligently selects the right-sized machine, optimizing build speed and cost.
https://vercel.com/changelog/elastic-build-machines-are-available-in-beta

Mar 24, 07:24 PM ET View post

Zara Zhang @zarazhangrui

Something shifted in January...
Last year I barely knew how GitHub worked; now I have 13k+ stars on GitHub
(and I'm not even a technical person, but then again how do you define technical these days?)

Mar 24, 07:47 PM ET View post

🤖 AI Builders 日报 — 2026年3月24日

🐦 X / Twitter 深度解读

💡 今日要点

🤖 AI Builders Digest — March 24, 2026

🐦 X / Twitter Deep Dive

💡 Key Takeaways

X / Twitter

YouTube

America's Energy System Isn't Ready for AI

Biology's Waymo Moment: Ginkgo Bioworks' Jason Kelly