GStack is not just my batmobile, I made it so it could be yours too
Siddharth: again just a reminder @garrytan is batman. and gstack is the batmobile
NO RAGRETS
MyName: @garrytan “Dress like a lobster in all things.”— 1 Tan 4:20
finally: @simonlast + @sarahmsachs on Latent Space!
Notion has rebuilt Notion AI 5 times. This is the first time Simon has told the entire story.
I've been trying to do this interview for ~3 years. We run @latentspacepod on Notion since inception, as does every other top tech company. Notion is one of the top ~3? knowledge work tools in the world, crossing 100M users in 2024 and now shipping the AI productivity suite that @ivanhzhao wants to be "steel and steam for organizations" — the backbone of a new Industrial Revolution of Infinite Minds that will change the world.
Latent.Space: 🆕 The Full Story of Notion AI
https://latent.space/p/notion
We're so excited to chat with @simonlast and @sarahmsachs about Notion's "Token Town" - the crack team of AI Engineers and Model Behavior Engineers entrusted with building AI for Silicon Valley's most beloved knowledge work
Paul Solt
OpenAI shipped GPT-5.4-Cyber. A model built to find and fix software exploits.
More capable than Mythos… and available today.
1. Binary scanning. Agents can find exploits in compiled apps… no source code required. That’s a new attack surface.
2. Prompt Refusals are lower. Verified defenders get a more permissive model than the public version.
3. Access is tiered by identity. Individuals verify at http://chatgpt.com/cyber. Enterprises go through a rep.
4. Codex Security has fixed 3,000+ critical vulnerabilities automatically.
5. They’re scaling to thousands of verified defenders.
The binary scanning unlock is scary. Stuff like this hasn’t been mainstream before.
Agents finding exploits without ever seeing your source code.
OpenAI: We’re expanding Trusted Access for Cyber with additional tiers for authenticated cybersecurity defenders.
Customers in the highest tiers can request access to GPT-5.4-Cyber, a version of GPT-5.4 fine-tuned for cybersecurity use cases, enabling more advanced defensive workflows.
Neil deGrasse Tyson
If current cuts to the Federal Science Budget proposed by the White House are approved by Congress, that will far-and-away be the largest cut to science since the United States began funding it.
Republicans & Democrats alike know there’s no surer path to Making America Not-Great
Re i didnt realize this lazy no effort post did so well… lol… chalk another W up to @josephofiowa’s Maps Theory of Twitter
GPT-5.4 Pro for making beautiful contributions to mathematics:
Leeham: GPT-5.4 Pro solves Erdős Problem #1196!
Very pleased with this result; definitely my favourite thus far! This problem has been thought about for some time which makes this reasonably impressive and meaningful (see Lichtman's comments below).
Formalisation is underway!
Gianl1974
Hey Republicans;
He pardoned 1,600 violent criminals. You said nothing.
He bulldozed the East Wing. You’ve said nothing.
He’s interfered with the release of the Epstein files. You’ve said nothing.
He took over the Kennedy Center, even renaming it after himself. You’ve said nothing.
He accepted a $400 million airplane as a personal gift. You’ve said nothing.
He’s threatened Canada, Cuba, Denmark, Greenland, Venezuela, Colombia, Brazil. You’ve said nothing.
He’s tariffed just about everyone but Russia, causing inflation and instability worldwide. You’ve said nothing.
He attacked a nation during mediated negotiations. You said nothing.
His ill-conceived war killed 175 little girls in its first days. You’ve said nothing.
He’s alienated and insulted more countries than I can keep track of. You’ve said nothing.
His ICE Army is terrorizing and murdering U.S. citizens. You've said nothing.
He has committed murder on the high seas. You've said nothing.
He's co-opted the Justice Department and directed it to prosecute his political enemies. You've said nothing.
You've not only said nothing to all of these egregious acts, and many more, but you have also enabled them.
And it’s only been a year.
Hey, Republican Congressmen, you took an oath, remember?
Not to him. To the Constitution.
It’s time to do your fucking jobs!
I built so many one-off conversion apps — now it’s just a skill.
Samuel Spitz: Introducing Replit File Converter
Skip the sketchy ad-riddled file conversion websites.
Now, you can just use Replit instead.
try the TurboTax app in ChatGPT:
Intuit: ⏰ Beat the 4/15 tax deadline. TurboTax in @ChatGPTapp just got an upgrade in time for tax day - get a personalized tax checklist and upload docs to help you maximize your refund when you file with @TurboTax. 💸🤖 https://bit.ly/3OBPWzA
btw the famous slack chart is slack propaganda and everyone who cites it is legally obligated to also link to @sophiebits
Nikunj Kothari: Every time I see a tweet saying “I can vibe code this in a weekend” - I think of the slack notification system..
It takes time, persistence and effort to get the details right.
Sure, a lot of simple workflows will get vibe coded away. And maybe you can put this in Claude Code
Eleanor Berger
The OpenAI $100 pro plan is frustrating. I got it with the intention of mostly using it for agentic non-coding stuff (I use my GitHub Copilot Pro+ for coding), but because the quota is so generous I don't manage to utilise much of it. And that makes me feel like a loser. 😢
Monologue
Coming next week!
Naveen Naidu: One of the best apps I saw on Apple Watch (I’m biased)
Lex
KYC on a subscription service is a death sentence for consumer products. Most view it as a severe overreach.
Anthropic making unexplainable decisions rn lmao
Would you hand over your gov id to use Claude code?
玩个锤子: 注意了
现在开 claude max 有概率需要 kyc 咯
进一步封锁。。。
Onur Solmaz
You need to understand one fact about OpenClaw
People are biased and incentivized to spread disinformation about OpenClaw. That is because OpenClaw IS NOT PUMPING ANYONE’S BAGS, unlike most other projects
Literally every other for-profit agent product is incentivized to trash OpenClaw, BECAUSE OpenClaw is a neutral third party across the industry and geopolitical scene. They MAKE MONEY when OpenClaw loses
OpenClaw does not worry about making money for some investors. Its founder @steipete is a successful exited founder. He is motivated by having fun and democratizing AI, literally. That is why he is suddenly so loved by everyone. He cares about PEOPLE, not MONEY
“OpenClaw is bloated”
-> Since beginning of March, OpenClaw is thinning its core and putting functionality in plugins behind a plugin SDK. Having numerous plugins to choose from does not mean bloat. This was already copied by others and is still a work in progress
“OpenClaw is not secure”
-> OpenClaw has the most eyeballs and immediately addresses any security advisories as soon as they come. It is the most secure agent, by sheer pressure
“OpenClaw is bought by OpenAI”
-> Then why is my bank account so empty bro??? All maintainers are literally unpaid and working DOUBLE beside their dayjobs to ship features to you. Do you think VC money can buy that kind of commitment?
Once you understand these facts, you’ll like OpenClaw even more. Because OpenClaw is your AI, People’s AI
And you can join us too. OpenClaw is the easiest-to-join project in AI right now. You just need to start using it, and start making good contributions. If you are competent, you can become a maintainer, and join the rest of the team making history!
Matt Pocock
Been waiting a month for Anthropic to answer a simple usage question about Claude Code subscriptions
Have I been ghosted
Matt Pocock: Can I get some questions answered by someone at Anthropic?
1. Can you use an OAuth token generated from a subscription to power the Claude Agent SDK strictly for using Claude Code in a local dev loop?
All I want is a more reliable API for parallelizing multiple Claude Code's.
Nimrod Gutman
Codex installed new firmware on my super old HP Color LaserJet MFP M277dw (after scanning stopped working)🥹
I love this new world.
once again, I’m amazed by scammers.
Shadow: Fake Krill accounts are going around, one of them was trying to get people to install ScreenConnect
Don't fall for it, the only real Krill is the one in our server and it will never DM you :)
old startup thinking: make a tool to solve a problem
new startup thinking: make a personality that can solve problems
when AI causes problems in any part of a process, the solution is always AI at the other end of the process
many such cases
Shaw (spirit/acc): The quality of your vibecoded slop is horrible. I've seen it. Absolute dogshit.
Fortunately, there is a fix.
Use this prompt:
I want to clean up my codebase and improve code quality. This is a complex task, so we'll need 8 subagents. Make a sub agent for each of the following:
Mario Zechner
Re hilarious. @threepointone Forky McForkyface strikes again. on an organizational level.
I recently came back from a 2-week trip to China and it was eye-opening to see how the world's second largest economy operates.
I think every product builder should visit at least once to understand:
→ Chinese AI work culture
→ Electric vehicles, $2 delivery, and more
→ How people in China live and work
📌 Here are 15 observations from my visit: https://creatoreconomy.so/p/15-observations-on-work-and-life
The software factory is the product: http://open-agents.dev
Elon Musk: @skorusARK The factory is the product
Vincent D. Warmerdam
It really took me a while to "get it" when it comes to nbdev. But I gotta hand it to @jeremyphoward this way of working makes too much sense once you're used to it.
As of today, I am working on tools that make this kind of work possible in @marimo_io.
https://youtu.be/ZLg27UmAJbw
If you look at GPT 5.4-Cyber and it's ability for closed source reverse engineering, I have bad news for you.
I do very much feel the pain though, there's hundreds of teams that try to poke holes into @openclaw. Our response has been of rapid iteration and code hardening. Which did introduce occasiaonal regression (and yes you all been yelling at me), but I see as the only way forward.
I would be very careful of other open source projects/harnesses that ignore this work and do not publish their advisories. https://github.com/openclaw/openclaw/security/advisories
Bailey Pumfleet: Open source is dead.
That’s not a statement we ever thought we’d make.
@calcom was built on open source. It shaped our product, our community, and our growth. But the world has changed faster than our principles could keep up.
AI has fundamentally altered the security
Enrico Shippole
We @TeraflopAI have worked together with @johngfriedman and @daftengine to open-sourced all major filings from SEC EDGAR completely for free on @huggingface. It is now more important than ever to push for open dataset releases.
TeraflopAI: Given the increasingly closed-source nature of the U.S. AI ecosystem, it is now more important than ever to push for the proliferation of open model and dataset releases. Datamule (@johngfriedman), @TeraflopAI, and @daftengine collaborated to release 43 Billion Tokens of SEC
Feels like there's an outage for Claude every other day - I wonder if this is related to the pace of shipping, just scaling compute, or something else?
More on GPT-5.4 Pro’s latest mathematical contribution:
“The closest analogy I would give would be that the main openings in chess were well-studied, but AI discovers a new opening line that had been overlooked based on human aesthetics and convention.”
Jared Duker Lichtman: In my doctorate, I proved the Erdős Primitive Set Conjecture, showing that the primes themselves are maximal among all primitive sets.
This problem will always be in my heart: I worked on it for 4 years (even when my mentors recommended against it!) and loved every minute of it.
If finding security flaws is fully automated with frontier models à la Mythos, then GitHub should have a metric, like stars, showing how much compute is spent securing/hardening an open-source package. Example:
📦 linus/linux
⭐️ 200k 🦾 $239M
Only way OSS can be trusted.
“Where do you run inference?”
“allbirds”
“The shoes?”
“Yea”
Mike Solana
“sir, can you please address the cost of housing”
“sorry best I can do is wage a civil war on behalf of honduran fentanyl dealers”
Tom Steyer: "Wow" is right. ICE is a criminal organization. As governor, I'll prosecute them like one.
This is what the future of design looks like. Not just this specific tool¹, but the fact that every team in the world is now is empowered to build their own 'design factory'.
Shader Lab was built with Claude Code, @threejs, @nextjs, and @vercel. To the exact needs, vision, and specification of @basementstudio.
Every time we work on a project with them, we get a glimpse of an arsenal of internal tools they've deployed. Some built specifically for a project, some more general purpose.
It's now easier to generate software re-assembling powerful building blocks, than searching and procuring the right SaaS for the job.
¹ though it's a banger
basement.studio: gm, today we're launching Shader Lab, like photoshop but for shaders
• design slick layered shader compositions
• export high-quality assets or shaders
• OSS package to plug & play
↳ https://eng.basement.studio/tools/shader-lab
clem 🤗
Weird how some people always target open-source in AI!
First it was:
“Open-source AI will destroy the world” (spoiler: it didn't and it won't)
Now:
“Open-source is a cybersecurity threat because of AI”
Both narratives are far too simplistic.
The truth is that the exact same risks exist in closed-source systems, often even more so. For example, in practice, APIs can create much bigger data and security vulnerabilities than open systems you can inspect, self-host, and secure yourself.
And as with software more broadly, open-source often ends up more secure because it benefits from far more scrutiny than private internal systems.
The reality is not “open vs closed.”
The reality is that AI is raising cybersecurity stakes across the board, and we need to tackle that seriously together.
everything is coding agent
New course: Spec-Driven Development with Coding Agents, built in partnership with @jetbrains, and taught by @paulweveritt.
Vibe coding is fast, but often produces code that doesn't match what you asked for. This short course teaches you spec-driven development: write a detailed spec defining what to build, and work with your coding agent to implement it. Many of the best developers already build this way.
A spec lets you control large code changes with a few words, preserve context across agent sessions, and stay in control as your project grows in complexity.
Skills you'll gain:
- Write a detailed specification to define your mission, tech stack, and roadmap, giving your agent the context it needs from the start
- Plan, implement, and validate features in iterative loops using a spec as your agent's guide
- Apply the same repeatable workflow to both new and legacy codebases
- Package your workflow into a portable agent skill that works across agents and IDEs
Join and write specs that keep your coding agent on track!
https://www.deeplearning.ai/short-courses/spec-driven-development
Kieran Klaassen
The mistake isn't automating too much; it's not knowing *when* to think.
Building Cora, I found two moments where humans belong in the loop: **Brainstorm** (what to build) and **Polish** (is it actually good?). Everything else – plan, code, review, test, PR – is automated.
`/ce-polish` is the new step I'm adding to Compound Engineering. Check out the branch, run the app, annotate what feels off. Sub-agents fix it while you use it.
Made a mermaid diagram to make it visual – distinct colors for human vs. automated phases. That clarity changed how I think about the whole flow.
Talking through this at the Compound Engineering camp Friday. Join @trevin @danshipper and me👇
Bilal
People don't get it.
I'm able to get Gemma-4-31B running at 15 tokens per second at a nice walking pace and up to 50 tokens when running now. This is the future of local healthy sustainable AI.
Amjad Masad: “Where do you run inference?”
“allbirds”
“The shoes?”
“Yea”
Temporal
Replit runs millions of AI agents. The challenge is keeping them reliable once they’re in the wild.
Thread 🧵
Shopify Engineering
Since we open-sourced pi-autoresearch, @Shopify teams have been running it on everything.
Results so far:
Unit tests: 300x faster
React component mounting: 20% faster
CI build time: 65% reduction
Made pnpm run faster
Autoresearch never stops trying things you'd never have time to try.
Repo: https://github.com/davebcn87/pi-autoresearch
That was the case in December. 4 months and thousands of work hours later, we have a great security concept; you can go all yolo, use a sandbox (Docker or OpenShell), there are allow-lists and per-access exec allow/deny prompts.
There’s hundreds of security researchers that pen-tested it.
Max Wolter: @steipete @openclaw I don't think OpenClaw is a reference. It literally doesn't have a proper security model. Nothing on OpenClaw is secure by design.
Nathan Lambert
I spent some time trying to distill all the complex factors impacting open models -- economics, capabilities, distribution, policy, etc. -- into a clear list of beliefs. Here they are in full.
1. It’s surprising that the top closed models did not show a growing capability margin over open models, based on compute differences for training and research, especially in the second half of 2025 and through today.
Wenlong Huang
I recently gave some talks on PointWorld. In this latest version, I discussed: Why world models? Why 3D? Why it matters amidst scaling data in robotics? Why it’s a missing side of the coin for “The Bitter Lesson”?
(It’s more than just a better backbone for training policies)
https://www.youtube.com/watch?v=0vfgm8LshmY
The AI Talks: The recording video is here: https://youtu.be/0vfgm8LshmY
banger
Kevin Kwok: Allbirds is not pivoting. They have always made the commodity hardware startups run in
Replit ⠕
Where code becomes culture.
Coming soon.
Join the waitlist → http://vibecon.ai
Reiner Pope
I chatted with @ysmulki about MatX, chip design and where silicon designed for LLMs is headed
(8:17) Tightly coupling SRAM and HBM on one chip
(14:03) More MoE FLOPS, smaller KV cache load
(16:08) Numerics: from 32-bit to 4-bit
(19:02) Targeting both training and inference
(22:14) Chip timelines
(27:15) Logic and memory scarcity
(29:42) Compute costs
(32:07) Latency: from 20ms to 1ms as the new table stakes
(40:50) Programming the chip
(43:00) Starting MatX
(47:11) Codesign without seeing the models
(51:57) Interconnect design
(55:44) Performance modeling philosophy
(1:07:02) Prefill vs. decode
(1:13:47) What's next
Tired of winning
James E. Clyburn: Trump has abandoned our farmers with his senseless war in Iran. The impact is worst in the South.
Diesel prices are up 46% since the war began. Nitrogen fertilizer is up 30%.
In the South, 81% of farms—including 84% of small farms—didn’t pre-book fertilizer, meaning they’re now
Nirit Weiss-Blatt, PhD
"Young, anxious followers, looking for purpose, can be radicalized by apocalyptic AI rhetoric […] The real question is how long the people fueling AI panic expect to avoid responsibility for where that radicalization leads, especially for the most vulnerable."
I've commented that "this is the year of subagents", but that is largely an optimization problem.
the inverse problem - having agents that compose and boss agents that manage/query them - is a capabilities one.
as an advisor to cog, proud to have played a small part in designing the new Spaces concept 3 months ago and today's launch is a start of even more to come. congrats to the team!
Windsurf: Introducing Windsurf 2.0.
Manage all your agents from one place and delegate work to the cloud with Devin - so your agents keep shipping even after you close your laptop.
It is generally frowned upon to have LLMs precisely regurgitate part of their training set, but it is an interesting question how you could use LLM training to nearly losslesly compress a huge corpus like the entirety of the Internet Archive.
The Hutter Prize is for perfect compression, but only one GB. There would be different trades at the PB level, and it gets much more interesting when it doesn’t have to be bit-accurate.
Armin Ronacher ⇌
Why do people call Gas Town and Beads malware? I can’t put my finger on it. https://github.com/gastownhall/gastown/issues/3649
Astasia Myers
Best part are retrieval and search details:
>Agent queries differ from human queries & what good results means changes too
>Parallel queries and ranking are both tools to the same outcome
>Top-K precision > positional ranking
>Embedding model choice matters less than you think
Latent.Space: 🆕 The Full Story of Notion AI
https://latent.space/p/notion
We're so excited to chat with @simonlast and @sarahmsachs about Notion's "Token Town" - the crack team of AI Engineers and Model Behavior Engineers entrusted with building AI for Silicon Valley's most beloved knowledge work