Apache Iceberg in 2026

Apache Iceberg in 2026: The Open Table Format That Won

In 2023, the question was “which open table format will survive - Iceberg, Delta, or Hudi?” In 2026, that debate is over. Apache Iceberg won, and it won for reasons that have almost nothing to do with its raw performance. It won because it is the only format that both Snowflake and Databricks now treat as a first-class citizen, because the vendors picked sides on catalogs rather than table formats, and because enterprise buyers decided that multi-engine portability was worth more than a small performance edge. ...

April 22, 2026 · 11 min · James M
Snowflake Icon

Snowflake Storage for Apache Iceberg: Enterprise Open Data Comes to AWS and Azure

A New Era for Open Data Formats Snowflake has announced the general availability of Snowflake Storage for Apache Iceberg on both AWS and Azure, marking a significant shift in how enterprises can build open, interoperable data lakehouses. This development combines Snowflake’s enterprise reliability and governance capabilities with the flexibility and openness of Apache Iceberg, one of the most promising open table formats in the data ecosystem. What is Snowflake Storage for Apache Iceberg? Snowflake Storage for Apache Iceberg enables users to query and manage Iceberg tables using Snowflake’s SQL engine while storing data in their own cloud object storage. This is fundamentally different from traditional Snowflake architectures - you get: ...

April 18, 2026 · 4 min · James M
Unity Catalog in Practice

Unity Catalog in Practice: Lessons From the Field

Unity Catalog sounds straightforward: “one governance layer for all your data and AI assets.” In theory, it’s elegant. In practice, you’ll run into gotchas that docs don’t prepare you for. This post is from the field - patterns that work, mistakes I’ve seen repeated, and how to actually build a sustainable governance layer in 2026. What Unity Catalog Is (And Isn’t) What It Is A unified access control and metadata layer for: ...

April 5, 2026 · 10 min · James M
Databricks vs Snowflake

Databricks vs Snowflake in 2026: An Honest Comparison

The question “Databricks or Snowflake?” has dominated data engineering conversations for the past five years. In 2026, it’s still the wrong question. But let me answer it anyway, because sometimes you have to pick one. The Honest Framing By 2026, both platforms have converged in surprising ways: Databricks started as a Spark compute engine and added warehouse features Snowflake started as a cloud data warehouse and added Iceberg support for lakehouse semantics Both now claim to be “lakehouses” that combine data lake flexibility with warehouse performance The difference isn’t in capability - it’s in architectural DNA, operational model, and what they expect you to optimize for. ...

April 5, 2026 · 11 min · James M
Databricks Training and Certification

Databricks Training & Certification

Overview Databricks offers certification tracks aligned to common roles: Data Engineer, Data Analyst, Apache Spark Developer, Machine Learning Engineer, and Generative AI Engineer. All certifications: Validity: 2 years from pass date Cost: $200 per exam attempt Format: Multiple choice, proctored online Recent Updates (2026): Emphasis on Lakeflow Declarative Pipelines (the evolution of DLT), Unity Catalog, liquid clustering, predictive optimization, AUTO CDC, Lakehouse Federation, and serverless compute Choose a certification based on your: ...

April 4, 2026 · 4 min · James M