Four Futures series - mapping the machine-speed economy

Four Futures: Mapping the Machine-Speed Economy

TL;DR The Four Futures series asks: as AI collapses build times and concentrates infrastructure, which economic future are we actually selecting for? Read in this order: framework → signals → century horizons Four scenarios: Broad Abundance, Winner-Take-Most, Techno-Feudalism, Managed Transition Full series index: /series/four-futures/ Start here Four Futures for the Machine-Speed Economy - the map: four plausible outcomes and what to watch for Reading the Signals: Which of the Four Futures Is Actually Emerging? - scoring real-world signals against the framework as of 2026 The Year 2126: What the Next Hundred Years Actually Looks Like - century-scale consequences if the transition goes well or badly The Year 3026: Thinking Seriously About a Thousand Years From Now - what, if anything, holds value across civilisational time Supporting reading The Free Intelligence Era: What Breaks When Thinking Costs Nothing - the abundance-side argument in detail The Automation Paradox: Why More AI Makes Human Judgment More Valuable - what stays human as machines accelerate The Meaning of Work in an Age of Abundance - what work is for when production gets cheap Policy on the AI Exponential - institutional responses and governance lag Expertise and Work in the Age of AI - how trust and accountability reshape human roles Related paths AI Economics and Hardware: A Reading Path - cost and infrastructure constraints underneath every scenario Trust: Conditions for Deploying AI Agents in Production - what has to be true before handing real work to agents Related Reading Human Advancement Is Accelerating - the longer exponential curve behind the machine-speed frame What It Means to Be an Expert in 2030 - expertise futures inside a winner-take-most world Scott Galloway on AI - one outside framing of concentration and platform power

June 30, 2026 · 2 min · James M
Cursor iOS app launching coding agents from a phone

Cursor on iOS: When the Code Editor Becomes a Remote Control

TL;DR On June 29, 2026, Cursor released a native iOS app in public beta, available on all paid plans, for iPhone and iPad You can launch cloud agents from your phone - pick a repo, describe the task by voice or text, use slash commands, choose a frontier model, and let an agent run in an isolated VM Remote Control lets you take an agent already running on your desktop and keep steering it from your phone, with an option to keep the machine awake while you’re away Live Activities put agent status on your lock screen; you get push notifications, can review demos, screenshots and logs, inspect diffs, and merge pull requests without opening a laptop A launch promo gives 75% off Composer 2.5 runs in the mobile app through July 5, 2026 This lands months after SpaceX’s move on Cursor - and reframes the editor as an orchestration surface rather than a place you type code I’ve written about Cursor enough times on this blog that a phone app could have been a footnote. It isn’t. Not because the app itself is revolutionary - it’s a well-made mobile client - but because of what it quietly admits about how the work has changed. For most of software history, the editor was where you sat and typed. Cursor’s iOS app is built on the assumption that you mostly aren’t typing anymore. You’re directing. ...

June 29, 2026 · 8 min · James M
Five archetypes for a post-role team

Five Archetypes for a Post-Role Team

TL;DR Boris Cherny, who built Claude Code at Anthropic, posted a short framing: as engineering, product, design, and data science melt into one role, he sees five archetypes on his team The five are Prototyper, Builder, Sweeper, Grower, and Maintainer - and crucially, none of them map cleanly to a job title The interesting claim is not the list, it is the decoupling: the archetype is a description of what energy you bring to a system, not what your contract says you do I think the framing is genuinely useful as a self-diagnostic, and quietly radical for how teams get staffed and rewarded Where it leaves me unsure: it describes a steady-state team that already exists, and says less about how you grow people into these shapes, or what happens to the people who do not fit any of them A short post on X has been rattling around my head for a few days. Boris Cherny, who built Claude Code at Anthropic, was reflecting on what happens to roles when the old functional boundaries stop meaning much. His observation: when he looks at the Claude Code team, he does not really see engineers, designers, PMs, and data scientists. He sees five archetypes that cut across all of them. ...

June 29, 2026 · 14 min · James M
OpenAI IPO filing and ChatGPT market share falling below 50% for the first time

The $2.22 Problem: OpenAI's IPO and the First Crack in the ChatGPT Monopoly

TL;DR On June 8, 2026, OpenAI filed a confidential S-1 with the SEC, targeting a September 2026 public listing with Goldman Sachs and Morgan Stanley as underwriters The private valuation sits at $852 billion, with analysts projecting a debut above $1 trillion - one of the five largest IPOs in US history The same week, ChatGPT’s market share fell below 50% for the first time - to 46.4%, with Gemini at 27.7% and Claude at 10.3% OpenAI’s Q1 2026 non-GAAP operating margin was negative 122%: it spends $2.22 for every dollar it earns Noam Shazeer - co-author of Attention Is All You Need and the AI talent Google paid $2.7 billion to retain in 2024 - just left Google to join OpenAI Anthropic filed its own S-1 a week earlier, on June 1, targeting October, at a $965 billion valuation - the two biggest AI labs are racing to Wall Street simultaneously The timing is almost too perfect to be coincidence - and yet it is. On June 8, 2026, OpenAI submitted a confidential S-1 registration with the SEC, beginning the legal process toward a public listing. The same week, for the first time since ChatGPT launched in November 2022, OpenAI’s flagship product held less than half of the global AI assistant market. The company is going to Wall Street at the precise moment it is no longer the only name in the room. ...

June 27, 2026 · 10 min · James M
SpaceX acquires Cursor AI code editor

SpaceX's $60 Billion Cursor Acquisition: Why It Matters

TL;DR SpaceX filed a $60 billion all-stock acquisition of Cursor on June 16, 2026 - marking one of the largest AI/developer tools acquisitions ever (confirmed via SEC filing) Cursor’s revenue metrics are impressive: ~$4 billion annualized revenue with $2.6 billion from enterprise customers, suggesting strong product-market fit Strategic pivot: SpaceX is moving beyond rockets and satellites into the software infrastructure layer that powers AI development itself Signal to the market: This acquisition suggests major tech companies are betting heavily on owning the entire stack - from hardware to the tools developers use to build AI systems Enterprise focus: The majority of Cursor’s revenue coming from enterprise (65%) indicates this is a B2B infrastructure play, not just a consumer developer tool Why SpaceX Acquiring Cursor Matters On the surface, it might seem odd that a company known for rockets and space exploration would acquire an AI code editor. But this acquisition reveals something fundamental about how the largest technology companies are thinking about AI development infrastructure. ...

June 16, 2026 · 5 min · James M
Evaluating agents in production with trajectory metrics

Evaluating Agents in Production: Trajectory Metrics, Not Just Final Answers

TL;DR Endpoint evals miss the failure mode that hurts in production - an agent can reach the right answer through a reckless path: wrong tool first, lucky recovery, ignored constraints that did not bite this time Trajectory evaluation scores the run: which tools were called, in what order, with what arguments, and whether each step satisfied policy The minimum viable setup: 50–200 real examples, per-step rubrics, 10+ runs per example, statistical regression tracking, and a held-out set you never tune against Replay harnesses let you re-run a captured trace against a new model or policy without re-hitting production systems This is the measurement layer that connects broken public benchmarks to agent security - you cannot harden what you cannot observe AI Evals Are Broken argued that leaderboard numbers stopped measuring production capability. Securing AI Agents argued that the tool layer must enforce policy the model cannot be trusted to enforce. This post is the bridge: how you measure whether an agent actually behaves before and after you ship. ...

June 14, 2026 · 6 min · James M
World Models - What Comes After the Language-Only Era Banner

World Models: What Comes After the Language-Only Era

TL;DR Language-only models do not contain a reliable simulator of physical reality - they contain a statistical shadow of one, good enough for many tasks and dangerously wrong for others. A world model is a system that learns to predict how an environment evolves and can plan inside that prediction - not just describe it in text. The gap matters for agents that must act in physical space, manipulate objects, or reason about counterfactuals where the answer is not in the training corpus. The 2026 frontier includes generative world simulators, vision-language-action models for robotics, and sim-to-real pipelines - not one breakthrough but a stack assembling in parallel. For builders today: language agents with MCP tools are the right architecture for knowledge work. World models are the path to agents that can competently act in the physical world. Almost everything I have written about AI agents assumes a model whose understanding of the world arrives through text. That assumption has carried the field a long way. Context engineering, tool use via MCP, memory across sessions - all of it sits on top of language models that read, reason, and call APIs. ...

June 13, 2026 · 9 min · James M
Government directive to suspend Fable 5 and Mythos 5 access

Pulled From The Shelf: The Government Order to Suspend Fable 5 and Mythos 5

TL;DR On 12 June 2026 at 5:21pm ET, the US government issued an export control directive ordering Anthropic to suspend all access to Fable 5 and Mythos 5 - globally, for every user, including Anthropic’s own employees The stated reason is national security: the government believes it has identified a method of jailbreaking Fable 5. Anthropic says the evidence was verbal only and describes a narrow, non-universal technique - essentially asking the model to read a codebase and fix software flaws Anthropic reviewed a demonstration and found it surfaced a small number of previously known, minor vulnerabilities that are widely available from other models Anthropic disagrees that a narrow jailbreak justifies recalling a commercial model deployed to hundreds of millions of people, and warns the same standard would “essentially halt all new model deployments for all frontier model providers” All other Anthropic models are unaffected. The company says it believes this is a misunderstanding and is working to restore access Four days. That is how long Mythos-class capability lasted as a publicly available product before the US government ordered it off the shelf. ...

June 13, 2026 · 10 min · James M
AI Evals Are Broken - Why Benchmarks Stopped Measuring Real Capability Banner

AI Evals Are Broken: Why Benchmarks Stopped Measuring Real Capability

When a frontier lab releases a new model in 2026, the press release leads with a row of benchmark scores. The numbers are bigger than they were a year ago, the model is the new state-of-the-art on whichever evaluation the lab chose to highlight, and the headline writes itself. The honest summary is that most of these numbers have stopped measuring what they were designed to measure, and the gap between benchmark performance and real-world capability is now wide enough that the benchmark-led narrative is actively misleading. ...

June 12, 2026 · 14 min · James M
Expertise and work reading path

Expertise and Work in the Age of AI: A Reading Path

TL;DR Start with What Does Expertise Mean When AI Can Pass Any Exam? - the credential crisis Then What It Means to Be Expert in 2030 - where the speculation goes next The through-line: expertise is shifting from reference knowledge to judgement, accountability, and taste Read in order What Does ‘Expertise’ Mean When AI Can Pass Any Exam? What It Means to Be Expert in 2030 The Architect vs The Builder Taste Is the New Scarcity The Automation Paradox Career and pipeline Agent-First Architecture: The Engineer as Curator The Junior Developer Pipeline Problem Will AI Kill Coding Jobs? The Meaning of Work in an Age of Abundance Related Trust series - accountability when agents act on your behalf Securing AI Agents - the liability side of delegated work Related Reading AI Dev Tooling: A Reading Path for 2026 - how the tooling changes the practical skill equation day to day Four Futures: Reading the Signals - the broader economic scenarios these evolving roles exist within The Free Intelligence Era - why intelligence abundance reshapes demand for human expertise The Next Decade of AI - longer-horizon thinking on where expertise and AI diverge or converge

June 12, 2026 · 1 min · James M