• Artificial Intelligence (LLMs, AI agents, and the future of human expertise)
  • Blockchain (Decentralized infrastructure, networks, and ecosystem evolution)
  • Data Engineering (Building data infrastructure that actually scales)
  • Data Science (Graph algorithms, network analysis, and statistical methods)
  • DevOps (Infrastructure, automation, and operational philosophy)
  • General (Culture, science, and the miscellaneous)
  • Retro Computing (The machines and culture that shaped computing)
  • Music Production (Gear, sound design, and creative workflow)
  • Personal Development (Expertise, craft, and the engineering mindset)
  • Security (Threat modeling, cryptography, and systems that resist attack)
  • Software Engineering (System design, languages, and the craft of code)
  • Space (Infrastructure and vision for human expansion beyond Earth)
Cursor iOS app launching coding agents from a phone

Cursor on iOS: When the Code Editor Becomes a Remote Control

TL;DR On June 29, 2026, Cursor released a native iOS app in public beta, available on all paid plans, for iPhone and iPad You can launch cloud agents from your phone - pick a repo, describe the task by voice or text, use slash commands, choose a frontier model, and let an agent run in an isolated VM Remote Control lets you take an agent already running on your desktop and keep steering it from your phone, with an option to keep the machine awake while you’re away Live Activities put agent status on your lock screen; you get push notifications, can review demos, screenshots and logs, inspect diffs, and merge pull requests without opening a laptop A launch promo gives 75% off Composer 2.5 runs in the mobile app through July 5, 2026 This lands months after SpaceX’s move on Cursor - and reframes the editor as an orchestration surface rather than a place you type code I’ve written about Cursor enough times on this blog that a phone app could have been a footnote. It isn’t. Not because the app itself is revolutionary - it’s a well-made mobile client - but because of what it quietly admits about how the work has changed. For most of software history, the editor was where you sat and typed. Cursor’s iOS app is built on the assumption that you mostly aren’t typing anymore. You’re directing. ...

June 29, 2026 · 8 min · James M
Five archetypes for a post-role team

Five Archetypes for a Post-Role Team

TL;DR Boris Cherny, who built Claude Code at Anthropic, posted a short framing: as engineering, product, design, and data science melt into one role, he sees five archetypes on his team The five are Prototyper, Builder, Sweeper, Grower, and Maintainer - and crucially, none of them map cleanly to a job title The interesting claim is not the list, it is the decoupling: the archetype is a description of what energy you bring to a system, not what your contract says you do I think the framing is genuinely useful as a self-diagnostic, and quietly radical for how teams get staffed and rewarded Where it leaves me unsure: it describes a steady-state team that already exists, and says less about how you grow people into these shapes, or what happens to the people who do not fit any of them A short post on X has been rattling around my head for a few days. Boris Cherny, who built Claude Code at Anthropic, was reflecting on what happens to roles when the old functional boundaries stop meaning much. His observation: when he looks at the Claude Code team, he does not really see engineers, designers, PMs, and data scientists. He sees five archetypes that cut across all of them. ...

June 29, 2026 · 14 min · James M
OpenAI IPO filing and ChatGPT market share falling below 50% for the first time

The $2.22 Problem: OpenAI's IPO and the First Crack in the ChatGPT Monopoly

TL;DR On June 8, 2026, OpenAI filed a confidential S-1 with the SEC, targeting a September 2026 public listing with Goldman Sachs and Morgan Stanley as underwriters The private valuation sits at $852 billion, with analysts projecting a debut above $1 trillion - one of the five largest IPOs in US history The same week, ChatGPT’s market share fell below 50% for the first time - to 46.4%, with Gemini at 27.7% and Claude at 10.3% OpenAI’s Q1 2026 non-GAAP operating margin was negative 122%: it spends $2.22 for every dollar it earns Noam Shazeer - co-author of Attention Is All You Need and the AI talent Google paid $2.7 billion to retain in 2024 - just left Google to join OpenAI Anthropic filed its own S-1 a week earlier, on June 1, targeting October, at a $965 billion valuation - the two biggest AI labs are racing to Wall Street simultaneously The timing is almost too perfect to be coincidence - and yet it is. On June 8, 2026, OpenAI submitted a confidential S-1 registration with the SEC, beginning the legal process toward a public listing. The same week, for the first time since ChatGPT launched in November 2022, OpenAI’s flagship product held less than half of the global AI assistant market. The company is going to Wall Street at the precise moment it is no longer the only name in the room. ...

June 27, 2026 · 10 min · James M
SpaceX acquires Cursor AI code editor

SpaceX's $60 Billion Cursor Acquisition: Why It Matters

TL;DR SpaceX filed a $60 billion all-stock acquisition of Cursor on June 16, 2026 - marking one of the largest AI/developer tools acquisitions ever (confirmed via SEC filing) Cursor’s revenue metrics are impressive: ~$4 billion annualized revenue with $2.6 billion from enterprise customers, suggesting strong product-market fit Strategic pivot: SpaceX is moving beyond rockets and satellites into the software infrastructure layer that powers AI development itself Signal to the market: This acquisition suggests major tech companies are betting heavily on owning the entire stack - from hardware to the tools developers use to build AI systems Enterprise focus: The majority of Cursor’s revenue coming from enterprise (65%) indicates this is a B2B infrastructure play, not just a consumer developer tool Why SpaceX Acquiring Cursor Matters On the surface, it might seem odd that a company known for rockets and space exploration would acquire an AI code editor. But this acquisition reveals something fundamental about how the largest technology companies are thinking about AI development infrastructure. ...

June 16, 2026 · 5 min · James M
Evaluating agents in production with trajectory metrics

Evaluating Agents in Production: Trajectory Metrics, Not Just Final Answers

TL;DR Endpoint evals miss the failure mode that hurts in production - an agent can reach the right answer through a reckless path: wrong tool first, lucky recovery, ignored constraints that did not bite this time Trajectory evaluation scores the run: which tools were called, in what order, with what arguments, and whether each step satisfied policy The minimum viable setup: 50–200 real examples, per-step rubrics, 10+ runs per example, statistical regression tracking, and a held-out set you never tune against Replay harnesses let you re-run a captured trace against a new model or policy without re-hitting production systems This is the measurement layer that connects broken public benchmarks to agent security - you cannot harden what you cannot observe AI Evals Are Broken argued that leaderboard numbers stopped measuring production capability. Securing AI Agents argued that the tool layer must enforce policy the model cannot be trusted to enforce. This post is the bridge: how you measure whether an agent actually behaves before and after you ship. ...

June 14, 2026 · 6 min · James M
World Models - What Comes After the Language-Only Era Banner

World Models: What Comes After the Language-Only Era

TL;DR Language-only models do not contain a reliable simulator of physical reality - they contain a statistical shadow of one, good enough for many tasks and dangerously wrong for others. A world model is a system that learns to predict how an environment evolves and can plan inside that prediction - not just describe it in text. The gap matters for agents that must act in physical space, manipulate objects, or reason about counterfactuals where the answer is not in the training corpus. The 2026 frontier includes generative world simulators, vision-language-action models for robotics, and sim-to-real pipelines - not one breakthrough but a stack assembling in parallel. For builders today: language agents with MCP tools are the right architecture for knowledge work. World models are the path to agents that can competently act in the physical world. Almost everything I have written about AI agents assumes a model whose understanding of the world arrives through text. That assumption has carried the field a long way. Context engineering, tool use via MCP, memory across sessions - all of it sits on top of language models that read, reason, and call APIs. ...

June 13, 2026 · 9 min · James M
Government directive to suspend Fable 5 and Mythos 5 access

Pulled From The Shelf: The Government Order to Suspend Fable 5 and Mythos 5

TL;DR On 12 June 2026 at 5:21pm ET, the US government issued an export control directive ordering Anthropic to suspend all access to Fable 5 and Mythos 5 - globally, for every user, including Anthropic’s own employees The stated reason is national security: the government believes it has identified a method of jailbreaking Fable 5. Anthropic says the evidence was verbal only and describes a narrow, non-universal technique - essentially asking the model to read a codebase and fix software flaws Anthropic reviewed a demonstration and found it surfaced a small number of previously known, minor vulnerabilities that are widely available from other models Anthropic disagrees that a narrow jailbreak justifies recalling a commercial model deployed to hundreds of millions of people, and warns the same standard would “essentially halt all new model deployments for all frontier model providers” All other Anthropic models are unaffected. The company says it believes this is a misunderstanding and is working to restore access Four days. That is how long Mythos-class capability lasted as a publicly available product before the US government ordered it off the shelf. ...

June 13, 2026 · 10 min · James M
Inside Anthropic Bloomberg The Circuit Documentary Banner

Inside Anthropic: What The Bloomberg Documentary Reveals

TL;DR Bloomberg’s The Circuit with Emily Chang went inside Anthropic in a rare, in-depth episode released June 10, 2026. Dario and Daniela Amodei discuss the founding story, the Pentagon dispute, and why they say safety and commercial success are the same bet. Anthropic is now valued at $965 billion, eclipsing OpenAI’s $852 billion for the first time, after an 80-fold revenue surge in Q1 2026. The Pentagon story is not PR - Anthropic refused to remove safety guardrails from its military contract, was blacklisted by the Trump administration, and sued. A federal judge sided with Anthropic. A confidential S-1 IPO filing in June 2026 means this stops being a private company conversation soon. The Bloomberg Documentary: Emily Chang Inside Anthropic Bloomberg’s The Circuit has done this kind of access piece before - Zuckerberg, Musk, Jensen Huang. But the Anthropic episode feels different in tone. Emily Chang is not sitting across from a founder who has already won. She is sitting across from two founders in the middle of one of the most consequential moments in the company’s short history: record valuation, Pentagon litigation, IPO on the horizon, and model releases arriving fast enough that the competitive landscape changes every few months. ...

June 12, 2026 · 7 min · James M
Policy on the AI Exponential Banner

Policy on the AI Exponential: Dario Amodei's Case for Acting While the Window Is Open

Dario Amodei has published a new essay, Policy on the AI Exponential, and it reads like the third act of a trilogy. Machines of Loving Grace made the case for what powerful AI could give us. The Adolescence of Technology catalogued what could go wrong. This one is about the machinery in between - the laws, agencies, and international arrangements that will decide which of those two essays turns out to be the better prediction. ...

June 11, 2026 · 8 min · James M
When Machines Stop Speaking Our Language Banner

When Machines Stop Speaking Our Language - Binary Agents and the End of Compilers

TL;DR When two AI agents talk to each other in English, they are doing something faintly absurd: serialising rich internal state into a lossy human language, transmitting it, and decoding it back. English between machines is a compatibility layer, not a natural medium. Machines have already shown they will drop that layer the moment we let them - negotiation bots drifting out of English in 2017, agents switching to sound-based data protocols in 2025, and research systems now sharing internal model state directly with no language in between. The same logic applies to programming languages. Python and Rust exist for human readers. If agents write, maintain, and consume the software, the human-readability requirement quietly disappears - and with it, eventually, the need for source code and compilers as we know them. I do not think compilers vanish so much as sink. Like assembly, the layers below us stop being something humans write or read, while the guarantees they provide get absorbed into the agents’ toolchain. The part worth worrying about is not efficiency, it is legibility. Human language and human-readable code are our audit trail into what machines are doing. This is all speculation on my part, and I sketch where I think the line should be held. Human Language Is a Compatibility Layer Think about what actually happens when two AI agents have a conversation in English today. ...

June 10, 2026 · 11 min · James M