This section is organised around one question: what has to be true before you can trust AI to do real work? Reliability, context, economics, security, evaluation, and eventually physical action - each post is a different angle on the same problem.

Start here

I want to build

I want context

Resources

Link indexes and tool directories - useful for discovery, not the narrative spine:

Junior Developer Pipeline Problem Banner

The Junior Developer Pipeline Problem: Where Do Tomorrow's Seniors Come From?

TL;DR The work AI now automates - boring tickets, bug hunts, boilerplate - was the unspoken apprenticeship that turned juniors into seniors The skills that work built (pattern recognition, systems intuition, taste, calibration) are built by doing, not by reading - and that doing is now cheapest to delegate The new apprenticeship shifts toward reading over writing, debugging agent output, earlier architectural decisions, and deliberate practice of things agents do badly There is a coordination problem: individual organisations rationally skip junior investment in the short term, but the senior pipeline thins industry-wide a few years later If you are starting out today, optimise for proximity to a great senior engineer above salary, title, or any other variable The views in this post are my own personal reflections on the industry as a whole, written in my own time. They are not about any specific employer, team, or colleague, past or present. ...

March 12, 2026 · 11 min · James M
Learning to buy intelligence as infrastructure

We Are Learning to Buy Intelligence

TL;DR For most of history, usable intelligence - the kind that solves complex problems - required hiring expensive specialists or spending years acquiring expertise yourself Research shows the cost of running AI capability has been falling roughly an order of magnitude every one to two years, making intelligence increasingly affordable Intelligence is becoming infrastructure - like electricity or compute, available on demand through APIs rather than locked inside individuals or institutions When intelligence is cheap and abundant, creativity becomes the limiting factor, not knowledge, credentials, or access to experts This democratisation is extraordinary, but the question of how we deploy these tools wisely matters as much as the capability itself For most of human history, intelligence has been scarce. Not intelligence in the biological sense - people have always been clever - but usable intelligence. The kind that helps you design a system, debug a problem, write code, plan a strategy, analyse data, or turn a vague idea into something real. ...

March 11, 2026 · 5 min · James M
OpenClaw - AI agent framework for computer interfaces

OpenClaw Is Absolutely Wild

TL;DR OpenClaw is an open-source AI agent framework that enables language models to operate software directly through computer interfaces - clicking, typing, and navigating the same way a human does Unlike chatbots that only respond to prompts, OpenClaw acts as an operator - automating any software without requiring custom APIs or integrations This makes legacy enterprise software, complex dashboards, and multi-application workflows instantly automatable using computer vision and reasoning models Because it uses reasoning models rather than fixed scripts, it can adapt to unexpected states and recover from mistakes - closer to digital labor than traditional automation This represents a shift in computing: software that can build, run, and manage other software, driven by open projects improving rapidly every month Every now and then a piece of technology appears that quietly changes the rules. Not in a loud marketing way. Not with a huge product launch. Just a project sitting on GitHub that makes you stop, stare at the screen for a second, and think: ...

March 10, 2026 · 4 min · James M
Claude Code multi-agent code review feature

Claude Code Just Got a Serious Code Review Feature

TL;DR Claude Code’s new Code Review feature dispatches multiple AI agents in parallel to review a PR from different angles, rather than running a single shallow model pass over the diff The motivation is real: Anthropic’s internal code output per engineer increased by around 200%, making human review the bottleneck - and humans consistently miss subtle bugs on large diffs Multi-agent review cross-checks findings, filters false positives, and ranks issues by severity before posting a clean, high-signal review comment plus inline annotations Review depth scales with PR size; typical runs take about 20 minutes and cost $15 - $25, which is cheap compared to the cost of a production bug Humans still approve PRs - the tool’s role is a thorough pre-review pass, not automated sign-off, making it a complement to human judgment rather than a replacement I genuinely think a lot of people still underestimate how fast the AI developer tooling ecosystem is evolving. ...

March 9, 2026 · 5 min · James M
Hybrid AI stack for developers hitting Claude Code limits

Hitting Claude Code Limits? Here’s the Setup I’m Moving Toward

TL;DR Hitting Claude Code Pro usage limits does not mean upgrading to the $200/month plan - a hybrid AI stack is a smarter and cheaper alternative The tiering strategy: local models (free) for quick edits, cheap cloud APIs for general coding, and frontier models only for architecture or complex multi-file reasoning Tools like Ollama or LM Studio with coding models such as DeepSeek Coder or Qwen2.5 handle the majority of everyday tasks locally at no cost Cheap cloud inference providers (Groq, Together AI, DeepInfra) offer capable open models at fractions of a cent per session for heavier work A realistic usage split of 80% local / 15% cheap APIs / 5% frontier models dramatically reduces limit burn while keeping Claude available when it genuinely matters I keep running into the same problem with Claude Code Pro ($20/month): I burn through the usage limits faster than I expect. The obvious solution is upgrading to the $200/month plan, but that feels excessive for how I actually use it. ...

March 9, 2026 · 4 min · James M
GitHub Spec Kit - spec-driven development with AI

GitHub Spec Kit and the Rise of Spec-Driven Development (SDD) 🤯

TL;DR GitHub Spec Kit is a structured framework of version-controlled markdown files (spec.md, constitution.md, boundaries.md, etc.) that serve as the single source of truth for a software project Spec-Driven Development (SDD) means writing the specification first, then generating and refactoring code in alignment with it - preventing architectural drift over time Integrating Spec Kit with Cursor AI turns the spec from a static document into an active constraint the AI understands and respects The spec-first loop (define, implement, refine, repeat) creates development that is clearer, faster, and easier to maintain than ad-hoc planning SDD is especially powerful for long-term projects and large teams where shared mental models and consistent architecture matter most Spec-Driven Development is starting to reshape how modern software is planned, built, and maintained. Among the tools pushing this shift forward, GitHub Spec Kit stands out as one of the clearest, cleanest ways to bring structure and intention into your workflow. It turns the usual chaos of planning into something organised, navigable, and repeatable - and when combined with AI-powered editors like Cursor, it becomes even more powerful. ...

December 3, 2025 · 4 min · James M
Cursor AI spec-driven development workflow

Cursor AI, Spec-Driven Magic, and Why My Entire Development Workflow Just Leveled Up 🤯

Reading path: For the canonical stack essay, start with AI Dev Tooling and What Actually Belongs in My AI Dev Stack in 2026. TL;DR Cursor AI is an AI-native editor that reads your repo with architectural awareness, reasons across files, and turns complex refactors into simple conversations Integrating GitHub Spec Kit (spec.md, constitution.md, acceptance criteria) gives Cursor a structured foundation it treats as living, authoritative constraints The combined workflow creates a tight loop: refine the spec, ask Cursor to implement, update the spec, generate more code - documentation and code feed each other in real time Key benefits include automatic consistency between spec and code, safer large-scale refactors, and faster onboarding for new contributors These tools don’t replace developers - they eliminate friction between thought and execution, letting you think at a higher level Every so often a tool appears that doesn’t just streamline your workflow - it rewires the way you think about building software. Cursor AI has done exactly that. After years of bouncing between editors, IDEs, extensions, and automation layers, nothing has delivered the same sense of “this is the future of development” as Cursor. ...

December 3, 2025 · 3 min · James M
Human in-demand jobs after AI

Top 5 Human In-Demand Jobs in 10 Years

TL;DR When AI handles execution, the jobs that survive are those built on judgement, empathy, embodied skill, accountability, and taste - none of which AI can fully replicate The five durable categories are: human relationship professionals, AI wranglers and system architects, skilled trades, creative producers, and human trust and accountability roles Skilled trades (plumbers, electricians, HVAC engineers) are particularly resilient because general-purpose robots that handle unpredictable physical environments remain a hard unsolved problem AI wranglers - people who set objectives, constraints, and guardrails for AI systems - are a new and growing category driven by regulation like the EU AI Act and NIST’s AI Risk Management Framework The meta-pattern is interpretation vs execution: AI excels at execution, so high-value humans are those who bring judgment and responsibility to the question of what should be built and why Assume AI is “everywhere” - what still needs actual humans? ...

November 27, 2025 · 6 min · James M
Sergey Brin - Google Co-Founder

Sergey Brin Interviews

Most founders of Sergey Brin’s vintage and net worth do not come back to write code. Brin did. He stepped away from Google in 2019, and when the frontier of AI started moving faster than anyone expected, he returned in 2023 to work hands-on on Gemini - by his own account because staying retired through this particular moment in computing would have been a mistake. That makes his public commentary an unusually direct read on how Google sees the race, and it is why I keep this page. It is a growing, chronological index of his interviews, talks, and appearances, with enough context around each to know what you are clicking into. ...

May 20, 2025 · 7 min · James M
AI Agents Emergency Debate - jobs and the future of work

AI Agents Emergency Debate

TL;DR An “emergency debate” framing the case that AI agents will displace large parts of the workforce inside a 24-month horizon Contributors disagree on speed but agree the direction is settled - the question is which sectors move first, not whether they move Near-term pressure is on roles built around predictable, repeatable cognitive work; durable roles cluster around judgment, taste, and accountability Education and training systems are slower to adapt than the technology, which creates a real workforce mismatch in the meantime Worth watching as a snapshot of the 2025 conversation - useful frame even where you disagree with the specific predictions About This debate explores urgent questions about AI’s impact on employment and the workforce. Contributors discuss the timeline for AI-driven job displacement and the societal preparations needed to adapt to rapid automation. ...

May 12, 2025 · 2 min · James M