Stream vs Batch Processing Banner

Real-Time Data Processing: Stream Processing vs Batch Processing

If you spend enough time in data engineering, you will eventually encounter the conviction that batch processing is dying and streaming is the future. This is the third or fourth time the industry has had this conversation in my career, and the answer has been the same every time. Streaming is not the future. Batch is not the past. They are different tools with different operational profiles, and the systems that age well use both, with discipline about which is the right choice for which problem. ...

May 10, 2026 · 9 min · James M
Multimodal AI in 2026 Banner

Multimodal AI in 2026: Vision + Text + Audio - What's Actually Useful

TL;DR Document understanding is the unglamorous killer application - invoices, contracts, and scanned PDFs that were painful to extract data from are now tractable without dedicated pipelines Vision models still under-deliver on precise spatial reasoning, object counting, and subtle medical or scientific imagery - these remain jobs for specialist models Audio is the modality with the most upside: beyond transcription, it carries tone, pace, and hesitation that text loses, enabling fault detection, emotional analysis, and richer inputs The teams getting real value treat multimodal as an invisible enabling capability within a workflow, not a feature to demo - and they verify high-stakes outputs just as they would text The right question when evaluating multimodal is not “can we use this” but “what specific user problem becomes tractable that previously was not” When the first multimodal frontier models shipped, the demos were genuinely impressive. A photo of a fridge interior with the model suggesting a recipe. A handwritten napkin sketch becoming working code. A short audio clip of a meeting being transcribed, summarised, and structured. It looked, briefly, like the boundary between modalities had collapsed and we were entering a new regime in which models could reason fluidly across text, images, and sound. ...

May 9, 2026 · 10 min · James M
The Modern Lakehouse Stack Banner

The Modern Lakehouse Stack: What Actually Belongs in Production

The word “lakehouse” has been doing a lot of work for the last five years. It has been used to describe everything from a thin SQL layer over object storage to a fully integrated platform with governance, lineage, ML training, and BI built on top. Like most umbrella terms, this elasticity has been useful for marketers and confusing for engineers. This post is the version of the conversation I would have with a senior engineer who has been asked to “build out our lakehouse” and wants to know which pieces are load-bearing and which are noise. It draws on what I have actually seen ship and survive in production data platforms in 2026, and it tries to be specific about why each layer is in the stack rather than just describing the picture as a fait accompli. ...

May 8, 2026 · 9 min · James M
Scott Galloway on AI - The Marketing Professor's Case That the Rich Don't Need You Anymore Banner

Scott Galloway on AI: The Marketing Professor's Case That the Rich Don't Need You Anymore

Scott Galloway is the kind of commentator the AI conversation rarely produces: not a researcher, not a founder, not a doomer, not a booster. He is a marketing professor and a serial entrepreneur with a record of correctly reading the corporate stories of the last two decades, and he has spent the last two years pointing at the AI story with increasing concern. The headline of his pitch - that AI was not built for ordinary people and that the rich no longer need them - is provocative on purpose. The argument underneath is more careful, and worth pulling apart on its own terms. ...

May 4, 2026 · 14 min · James M
Hybrid Systems Montage MC-707 Banner

Hybrid Systems: Montage + MC-707 Architecture and Workflow

The Yamaha Montage M and the Roland MC-707 are both, on paper, complete instruments. The Montage is a flagship synth workstation with three distinct sound engines and the kind of polyphony and DSP headroom that makes most studio plugins look slow. The MC-707 is a compact groovebox with eight tracks, an internal sequencer, sample playback, and the kind of immediate hands-on workflow that makes laptop production feel laborious by comparison. ...

May 4, 2026 · 9 min · James M
Yamaha Montage M Six Months In Banner

The Yamaha Montage M: 6 Months In Real World Usage

A six-month review is a different beast from a release-day one. The honeymoon is over. The early enthusiasm has cooled. The features that demoed well in the showroom have either earned their place in your daily workflow or quietly been abandoned, and the features you initially overlooked have either continued to be irrelevant or become indispensable. This is a six-month review of the Yamaha Montage M, the M8X variant specifically, from the perspective of someone using it as the centrepiece of a hybrid hardware rig rather than a stage instrument or a sound-design lab. The conclusions are mine, the use case is specific, and your mileage will genuinely vary, but the patterns I have noticed are likely to repeat across other working setups. ...

May 4, 2026 · 9 min · James M
ETL Tools and Data Integration

ETL Tools & Data Integration Platforms

What is ETL? ETL is a foundational data engineering process that powers modern analytics: Extract - Retrieve data from various sources (databases, APIs, files, cloud services, streaming platforms) Transform - Clean, validate, deduplicate, and reshape data into required data models Load - Move processed data into data warehouses, data lakes, or analytical systems ETL ensures data quality, consistency, and accessibility for analytics and reporting. In 2026 the dominant pattern is ELT (Extract-Load-Transform), which leverages cloud data warehouse compute for transformation, and increasingly EtLT (adding lightweight pre-load transforms for streaming and schema drift). See the Fundamentals of Data Engineering book for a deeper framing. ...

May 4, 2026 · 9 min · James M
The State of Blockchain in 2026 Banner

The State of Blockchain in 2026

TL;DR The blockchain industry in 2026 is no longer arguing about whether it has a future. The arguments are about which layers do which jobs. Bitcoin remains the reserve asset and the most credible neutral settlement layer. Ethereum is the dominant smart-contract base layer, with most activity now happening on its Layer 2s. Solana has taken the high-throughput application crown. Polkadot is mid-pivot from infrastructure to applications. The two structural shifts that define 2026 are modular blockchains (Celestia, EigenLayer) and the stablecoin economy, where annual settlement volume now exceeds Visa. Real-world asset tokenization has gone from a slide-deck thesis to a $30B+ live market, led by BlackRock’s BUIDL and tokenized US treasuries. The destination for the next two years is clear: payments, treasuries, and AI agents using crypto rails - and most users will not know they are using a blockchain. What Actually Survived It is worth saying out loud: most of the things that called themselves “the future of finance” in 2021 are gone. The 2022-2023 unwind cleared out the projects that had no users, no revenue, and no reason to exist. What remains in 2026 is a much smaller, much more boring, and much more useful set of networks. ...

May 4, 2026 · 15 min · James M
Interstellar Physics and Philosophy Banner

The Physics and Philosophy of Interstellar

There are not many films where the visual effects pipeline produces a peer-reviewed physics paper. Christopher Nolan’s Interstellar is one of them. The visualisation of the supermassive black hole Gargantua was rigorous enough that it ended up in Classical and Quantum Gravity, co-authored by the visual effects team and Nobel laureate Kip Thorne. That single fact captures what makes the film unusual. It is, on the surface, a story about love, time, and survival. Underneath, it is a serious attempt to take Einstein’s general relativity and put it on a 70mm IMAX screen with as little fudging as Hollywood would allow. ...

May 4, 2026 · 14 min · James M
China Space Programme 2026 Banner

China's Space Programme in 2026 - Tiangong, Chang'e, Lunar Plans

TL;DR China’s space programme in 2026 is one of the most consistently executed national space efforts in history. Where Western programmes have lurched between budgets and political cycles, China’s CNSA has shipped roughly what it announced, on roughly the timelines it announced. The Tiangong space station is fully operational, continuously crewed, and has hosted both domestic and international experiments. The Chang’e lunar series has progressed from sample return (Chang’e 5, 6) to the precursors of a crewed lunar landing programme planned before 2030. China has now returned samples from both the near and far sides of the Moon - the only nation to have done so. The lunar plan centres on the International Lunar Research Station (ILRS) - a long-term, China-led, multinational lunar surface base, with crewed landings as a milestone rather than the goal. Mars sample return, deep-space exploration, and a permanent lunar presence are all on a credible timeline. The realistic 2030 picture is two distinct, durable lunar architectures - American and Chinese - running in parallel. Why It Is Worth Looking Carefully It is easy in Western coverage to treat China’s space programme as a backdrop to the Artemis story. That undersells what is actually happening. ...

May 3, 2026 · 9 min · James M