Ai | jamesm.blog

Learning How to Learn in the Age of AI

TL;DR AI removed the information bottleneck on learning - the new bottleneck is whether you actually retain anything You can finish tasks with AI and still have outputs without understanding six weeks later Rebuild friction: predict before lookup, practice without autocomplete, teach-back sessions, spaced retrieval Weekly rhythm: daily retrieval, twice-weekly deliberate practice offline, weekly teach-back, monthly honest review The meta-skill of this era is learning in an environment that will happily let you stop The Problem Nobody Warned You About For most of history, learning was gated by access. You wanted to understand a topic, you had to find a book, a teacher, a course, or a mentor. The bottleneck was information. If you could get your hands on the material, the rest was time and effort. ...

An AI Tooling Learning Path: Logical Phases for 2026

TL;DR The order you learn AI tools matters as much as which tools you learn - most people start with terminal agents or editors before they understand how models actually fail The seven-phase path runs: fundamentals, chat interfaces, AI-native editors, terminal agents, local models, orchestration, and review and evaluation Terminal agents (Claude Code, Cline, Aider) represent the biggest mindset shift - you move from driving with suggestions to specifying and letting the model execute Local models via Ollama belong in phase five, once you have felt the pain of API costs and know which tasks actually need frontier capability Review, evaluation, and capture (phase seven) is the phase most developers skip - and the one that separates AI-curious from AI-competent The hardest part of learning AI tooling in 2026 is not any single tool. It is the order you meet them in. ...

Amazon Doubles Down: The $25 Billion Anthropic Bet

TL;DR Amazon announced up to $25 billion in additional investment in Anthropic on April 20, 2026, bringing total committed capital past $33 billion In return, Anthropic committed to spending over $100 billion on AWS over the next decade - effectively a closed loop where Amazon’s capital funds Anthropic’s compute bill The deal gives Amazon a flagship AI workload to prove out its Trainium custom silicon against Nvidia, while countering Microsoft’s OpenAI advantage on Azure For developers building with Claude, expect more capacity, more aggressive pricing on Bedrock, and deeper AWS service integration as the compute comes online The arrangement signals that frontier AI has fully consolidated into a small number of hyperscaler-aligned labs - the era of independent AI startups is effectively over On April 20, 2026, Amazon announced it would invest up to an additional $25 billion in Anthropic, stacking on top of the $8 billion it has already poured into the AI startup over recent years. In return, Anthropic committed to spending more than $100 billion on Amazon Web Services over the next ten years. ...

Hermes Agent: Persistent Autonomy That Learns and Grows

TL;DR Hermes Agent by Nous Research is an open-source persistent autonomous system that builds memory across conversations, auto-generates reusable skills from repeated tasks, and compounds in capability over time Unlike stateless agents, Hermes accumulates project context - learning codebase quirks, team conventions, and recurring workflows so it stops asking questions it has already answered It works across Telegram, Discord, Slack, WhatsApp, Signal, Email, and CLI - meeting teams on the platforms they already use rather than requiring a dedicated app Running cost is roughly $20 to $60 per month for a solo developer (a $5-$10 VPS plus LLM API calls); it is MIT licensed with no seat fees or vendor lock-in The honest trade-off: Hermes beats alternatives on persistence and learning depth, but raises open questions about memory scaling, skill auditing, and what happens when an agent learns something wrong Most AI agents are forgettable. You ask them to do something, they do it, you close the window. The next time you need help, they start from zero - no context, no learning, no continuity. Hermes Agent works differently. Nous Research built it as a persistent system that remembers what it learns and gets measurably more capable the longer it runs. ...

MacWhisper vs Wispr Flow vs Superwhisper: The 2026 Dictation Stack Compared

TL;DR MacWhisper is a file transcription tool (audio in, text out) that runs entirely on-device - the right pick for journalists, researchers, and anyone transcribing recordings Wispr Flow is the easiest system-wide dictation option, with AI-powered prose cleanup and cross-platform sync, but it sends audio to the cloud with no on-device option Superwhisper matches Wispr Flow’s system-wide dictation but processes audio locally, with bring-your-own-key LLM cleanup and deep customisation for power users The core decision is simple: if your audio can leave your machine, use Wispr Flow; if it must stay local, use Superwhisper; if you just need transcription, use MacWhisper The real product differentiation is no longer the underlying Whisper model - it is hotkey ergonomics, auto-edit prompts, and workflow integration Voice input on the Mac used to mean fighting with the built-in Dictation feature or paying Nuance a small fortune. In 2026, the landscape looks completely different. A handful of indie and venture-backed apps have turned Whisper-class models into genuinely fast, accurate tools that sit quietly in your menu bar until you hold a hotkey. ...

Claude Opus 4.7 Lands on Databricks: Enterprise Reasoning Meets the Lakehouse

TL;DR Databricks has made Claude Opus 4.7 available on the platform, days after the model’s 16 April 2026 release across the Anthropic API, Bedrock, Vertex AI, and Foundry Databricks’ own benchmarking shows 21% fewer errors than Opus 4.6 on OfficeQA Pro, its internal benchmark for agentic reasoning over business documents The model is exposed through three surfaces: built-in SQL and Python functions, Lakeflow Declarative Pipelines, and Agent Bricks, where it is now the recommended reasoning model Unity Catalog governance, lineage tracking, and audit logging apply to every call - data never leaves the governed boundary Pricing is unchanged at $5 per million input tokens and $25 per million output tokens The bigger story is distribution: Claude is now a first-class model inside all four major enterprise data planes Databricks announced this week that Anthropic’s Claude Opus 4.7 is now live on the platform. The headline from Databricks’ own benchmarking is the part worth pausing on - 21% fewer errors than Opus 4.6 on the OfficeQA Pro document-reasoning benchmark when the model is grounded in source information. ...

AI Cloud Subscriptions: Comparing Pricing and Features in 2026

AI cloud subscriptions have fragmented into a crowded market. Frontier-lab APIs compete with open-weights challengers, consumer chat plans compete with agent platforms, and every provider is reshuffling model tiers every few months. This guide organizes the 2026 landscape so you can pick a plan without reading six pricing pages. For background on how these costs behave over time, see Token Economics: Why Costs Aren’t Going Down and Local vs Cloud AI in 2026. ...

DGX Spark vs Mac Studio: Which Personal AI Supercomputer Should You Buy?

TL;DR Best value: Mac Studio M4 Max at $1,999 for most local LLM work Best prefill speed: DGX Spark at $4,699 (3.8× faster prompt processing) Best token generation: Mac Studio M3 Ultra at $3,999 (819 GB/s bandwidth) Best for fine-tuning: DGX Spark (CUDA ecosystem wins) Best combined setup: DGX Spark + M3 Ultra = 2.8× faster than either alone Introduction The market for personal AI supercomputers has exploded in 2025-2026. Two standout options have emerged: NVIDIA’s DGX Spark and Apple’s Mac Studio lineup. Both promise desktop-scale AI compute, but they approach the problem very differently. This guide breaks down the specs, costs, and real-world performance to help you decide which is right for you. ...

The Complete AI Developer's Guide: Resources and Best Practices

TL;DR Prompt engineering, token efficiency, and structured outputs are the core skills for working effectively with any AI model System design patterns - streaming, caching, structured outputs, graceful fallbacks - matter as much as prompting fluency Testing and validation in AI systems requires clear evaluation criteria and production monitoring, not just pre-launch checks Official documentation from model providers (Anthropic, OpenAI, Google) is the most reliable source of best practices The curated resources table covers everything from GitHub Copilot to local model deployment with Ollama Most AI tutorials teach you how to get started. Few teach you how to get it right. This post curates the most valuable resources and practices for working effectively with modern AI systems - from prompt engineering fundamentals through to production system design and evaluation. ...

The Token Efficiency Mindset - Why Your Claude Conversations Cost More Than They Should

TL;DR Token costs don’t scale linearly with productivity - the context window compounds with every follow-up message, so a five-message conversation can cost 2-3x more than one well-structured request Compression is your biggest lever: cutting a prompt in half before sending it reduces cost and often improves answer quality by removing noise Batch tasks that share context together; don’t batch unrelated tasks - real batching spreads the setup cost across related work Build reusable systems (templates, project files, prompt prefixes) instead of solving the same problem repeatedly and paying the context cost each time Prompt caching can cut input token costs by 80-90% on workloads with stable prefixes - the single biggest structural saving most teams are missing If you’re paying attention to your Claude usage, you’ve probably noticed something: your token bills don’t scale linearly with your productivity. Sometimes a conversation that feels quick costs three times more than expected. Other conversations that took hours feel suspiciously cheap. ...