<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Reliability on jamesm.blog</title>
    <link>https://jamesm.blog/tags/reliability/</link>
    <description>Recent content in Reliability on jamesm.blog</description>
    <image>
      <title>jamesm.blog</title>
      <url>https://jamesm.blog/papermod-cover.png</url>
      <link>https://jamesm.blog/papermod-cover.png</link>
    </image>
    <generator>Hugo</generator>
    <language>en</language>
    <lastBuildDate>Fri, 15 May 2026 06:30:00 +0100</lastBuildDate>
    <atom:link href="https://jamesm.blog/tags/reliability/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>The Agent Reliability Problem: Debugging Non-Deterministic Systems</title>
      <link>https://jamesm.blog/ai/agent-reliability-debugging-non-deterministic/</link>
      <pubDate>Fri, 15 May 2026 06:30:00 +0100</pubDate>
      <guid>https://jamesm.blog/ai/agent-reliability-debugging-non-deterministic/</guid>
      <description>AI agents that work in the demo and fail in production are the standard story of the last two years. A practical look at why agent reliability is genuinely hard, what patterns have emerged for handling it, and which testing approaches actually work.</description>
    </item>
    <item>
      <title>AI Agents That Actually Work: Patterns From Real Projects</title>
      <link>https://jamesm.blog/ai/ai-agents-that-actually-work/</link>
      <pubDate>Fri, 01 May 2026 08:00:00 +0100</pubDate>
      <guid>https://jamesm.blog/ai/ai-agents-that-actually-work/</guid>
      <description>The patterns that separate agents that ship and stay shipped from the ones that demo well and fall over the moment they meet a real workload.</description>
    </item>
    <item>
      <title>AI Safety From First Principles: What Actually Matters vs What&#39;s Hype</title>
      <link>https://jamesm.blog/ai/ai-safety-first-principles/</link>
      <pubDate>Thu, 30 Apr 2026 08:00:00 +0100</pubDate>
      <guid>https://jamesm.blog/ai/ai-safety-first-principles/</guid>
      <description>Stripping the AI safety conversation down to its load-bearing parts, separating the engineering problems that matter today from the speculative scenarios that absorb most of the attention.</description>
    </item>
    <item>
      <title>AI Hallucinations: Understanding and Mitigating False Outputs</title>
      <link>https://jamesm.blog/ai/ai-hallucinations-understanding-and-mitigating/</link>
      <pubDate>Tue, 28 Apr 2026 00:02:00 +0100</pubDate>
      <guid>https://jamesm.blog/ai/ai-hallucinations-understanding-and-mitigating/</guid>
      <description>What hallucinations actually are, why the word is a slightly misleading import from psychiatry, why no model will ever stop producing them entirely, and the small set of techniques that genuinely move the dial on false outputs in production.</description>
    </item>
    <item>
      <title>AI Reliability Is Weird: Why Testing LLMs Breaks Everything You Know</title>
      <link>https://jamesm.blog/ai/ai-reliability-is-weird/</link>
      <pubDate>Thu, 09 Apr 2026 22:00:00 +0000</pubDate>
      <guid>https://jamesm.blog/ai/ai-reliability-is-weird/</guid>
      <description>Traditional testing methods are failing in the age of autonomous AI agents. We need a new approach to ensure reliability when the &amp;#39;builder&amp;#39; is non-deterministic.</description>
    </item>
  </channel>
</rss>
