THE SWELL #18

SAIL Weekly Digest | May 18 – May 22, 2026 | Issue #18

May 22, 2026

This week, SAIL authors tracked the shifting financial fault lines of the AI boom. A blockbuster Cerebras IPO, the creeping “token anxiety” inside every AI budget, and Anthropic’s march into mainstream business all point to a market finally putting a price on AI scale — while the contest with China played out over talent, prestige, and open models. The throughline: intelligence is getting cheaper to run and more expensive to win.

Get a Team Subscription

The Week in AI

Google I/O 2026 — Google launched Gemini 3.5 Flash on May 19, immediately making it the default model in the Gemini app and Search worldwide, alongside Gemini Omni Flash, the Spark agent, and Android XR glasses — while Gemini 3.5 Pro slipped to June.

OpenAI files to go public — In a frantic May 21 stretch, OpenAI filed for an IPO (~$25B ARR against an $852B valuation) while Anthropic circulated prospectus-style numbers showing a projected $10.9B in Q2 revenue, up 130% quarter over quarter, and its first quarterly operating profit.

Anthropic locks in compute — Anthropic agreed to pay SpaceX roughly $1.25B per month through May 2029 — about $45B in total — as its raise of $30B+ at a $900B-plus valuation nears close, a figure that would eclipse OpenAI’s valuation for the first time.

This Week from SAIL Authors

The Trump–Xi Moment & the Policy Game

Catching up? We covered the lead-up last week in ChinaTalk's Xi-Trump to talk AI Safety, Huh? and Macartney to Mar-a-Lago.

Watching the Super Bowl of US-China Nerds — Kevin Xu frames the Trump–Xi event as the championship game for US-China AI watchers: a lot of noise, with the long game still ahead. — Interconnected. Read more

Prestige on the Cheap — ChinaTalk reads the Trump–Xi summit through Cold War history, asking what real leverage looks like beneath the pageantry. — ChinaTalk. Read more

Doing Big Things in Policy — Jordan Schneider talks with Kumar Garg and Remco Zwetsloot about finding the “white space” in policy and steering scarce talent toward neglected, high-impact work. — ChinaTalk. Read more

The Economics of AI Scale

Cerebras and the IPO pop — Cerebras’s 107% first-day jump is less a hype signal than evidence the market is finally pricing in real demand for AI inference. — Exponential View. Read more

Data to start your week: The cost of tokenmaxxing — Tokens are now one of the fastest-growing, least predictable line items in the AI stack — goodbye fixed costs, hello token anxiety. — Exponential View. Read more

Exponential View #574 — Anthropic pushes into Main Street as Microsoft and OpenAI consciously uncouple, plus the case for AI pluralism against a sycophantic model monoculture. — Exponential View. Read more

Open Models & Efficient Architectures

Latest open artifacts (#21): Open model bonanza — Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, and GLM-5.1 all landed, and a CAISI evaluation weighs how far open models still trail the American frontier. — Interconnects. Read more

Recent Developments in LLM Architectures — A tour of how new open-weight models cut long-context costs through KV sharing, multi-head compression, and compressed attention. — Ahead of AI. Read more

AI Hits the Real World

Why it might not make sense for you to own a self-driving car — A ride in Tensor’s prototype makes the case that full-autonomy hardware may be too expensive to personally own, leaving robotaxis as the likelier future. — Understanding AI. Read more

AI-Native Healthcare: 100M Doctor Visits, 10–20 Hours Saved — Abridge’s Janie Lee and Chai Asawa on scaling clinical-documentation AI across 250 health systems, and why healthcare’s fatal downside risk shapes how you build. — Latent Space Read more

Notes on AI, labor, and China — An expansion pack to Jasmine Sun’s New York Times essay on AI, work, and Silicon Valley’s growing fear of a “permanent underclass.” — Jasmi.News. Read more

Full Library

Access the complete, searchable archive of SAIL Media in our Sitemap.

Doing Big Things in Policy — Jordan Schneider & Phoebe Chow, ChinaTalk
AI-Native Healthcare: 100M Doctor Visits, 10–20 Hours Saved, Prior Auth in Minutes — Swyx, Latent Space
Exponential View #574: Inside Anthropic’s rocket ship — Azeem Azhar, Exponential View
Why it might not make sense for you to own a self-driving car — Timothy B. Lee, Understanding AI
Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention — Sebastian Raschka, Ahead of AI
Latest open artifacts (#21): Open model bonanza! — Nathan Lambert & Florian Brand, Interconnects
Cerebras and the IPO pop — Azeem Azhar, Exponential View
Notes on AI, labor, and China — Jasmine Sun, Jasmi.News
Data to start your week: The cost of tokenmaxxing — Azeem Azhar & Hannah Petrovic, Exponential View

Additional content on the SAIL Substack — supplementary coverage of the Trump–Xi event:

Watching the Super Bowl of US-China Nerds — Kevin Xu, Interconnected
Prestige on the Cheap — Jordan Schneider, Kevin Xu, Sergey Radchenko & Lily Ottinger, ChinaTalk
Get a Team Subscription

SAIL Media

Comments

Ready for more?