DevOps GitHub Projects

Most of what makes a productive DevOps engineer is not hidden inside vendor portals - it lives in open source, on GitHub, and it is free. The projects below are the ones I return to most often, whether for learning, daily tooling, or reference implementations of patterns that would otherwise take weeks to work out alone. DevOps and Site Reliability Engineering (SRE) Resources to calibrate what good looks like in the discipline. ...

May 29, 2023 · 3 min · James M

DevOps Books

A working DevOps engineer draws from several disciplines at once - distributed systems, operating systems, network engineering, software development, and the organisational side that keeps it all moving. The books below are the ones I have either read cover-to-cover or regularly pull off the shelf to reference. I have added a one-line note on why each one is on the list. Culture and Practice The hardest problems in DevOps are usually not technical. ...

April 21, 2023 · 3 min · James M

DevOps Blogs

Good engineering blogs are one of the cheapest forms of mentorship available. The posts below are from teams and individuals I return to when I want to see how real organisations solve real problems - outages, scaling walls, migrations, and the occasional cultural mistake. Vendor and Platform Blogs These blogs publish architectural deep-dives and reference implementations. They are partly marketing, but the engineering detail is usually genuine. Atlassian DevOps Blog - practitioner posts on pipelines, incident response, and team topology AWS DevOps Blog - pipeline patterns, CDK/CodePipeline how-tos, and multi-account guidance Google Cloud Blog - the SRE-flavoured material that originated at Google Microsoft DevOps Blog - Azure DevOps, GitHub Actions, and developer platform posts GitLab Blog - CI/CD and platform engineering content from the GitLab team HashiCorp Blog - Terraform, Vault, Consul, and Nomad in production Individual and Community Voices Ricard Bejarano - SRE at Cisco with sharp posts on minimal container images and infrastructure hygiene Charity Majors - co-founder of Honeycomb, writing extensively on observability and on-call culture Julia Evans - illustrated explainers on Linux, networking, and debugging fundamentals Gergely Orosz - The Pragmatic Engineer - deep dives into how large engineering organisations actually operate High Scalability - architecture breakdowns of well-known systems SRE-Specific Reading Google SRE Books - the foundational texts on SRE as a discipline Increment Magazine - long-form essays on on-call, incident response, and reliability Everything DevOps (Reddit) - less polished, but a useful pulse on what practitioners are struggling with this week How I Use This List Blog posts age quickly. A Kubernetes best-practices post from 2019 may actively mislead you in 2026. When I read any of these, I check the date first and treat anything older than three years as historical context rather than current guidance. ...

April 21, 2023 · 2 min · James M