Spark Declarative Pipelines blog posts on waitingforcode.com

4-day workshop · In-person or online

What would it take for you to trust your Databricks pipelines in production?

A 3-day bug hunt on a 3-person team costs up to €7,200 in lost engineering time. This workshop teaches you to prevent that — unit tests, data tests, and integration tests for PySpark and Databricks Lakeflow, including Spark Declarative Pipelines.

Unit, data & integration tests

Medallion architecture & Lakeflow SDP

Max 10 participants · production-ready templates

See the full curriculum → €7,000 flat fee · cohort of up to 10

Bartosz
Konieczny

March 17, 2026 • Apache Spark Structured Streaming

Spark Declarative Pipelines 101

One of the biggest changes to the Apache Spark Structured Streaming API over the past few years is undoubtedly the introduction of the declarative API, AKA Spark Declarative Pipelines. This post kicks off a three-part series dedicated to this new functionality. By the end of these articles, you will be able to effectively leverage declarative programming in your workflows and gain a deeper understanding of what happens under the hood when you do.

Continue Reading →

March 23, 2026 • Apache Spark Structured Streaming

Spark Declarative Pipelines, going further

Last week, we discovered Spark Declarative Pipelines as a new way of writing streaming pipelines. However, writing the pipelines is only half the battle; the other and perhaps more critical task is understanding exactly what happens once they are in motion. That is exactly what we are going to dive into today.

Continue Reading →

March 30, 2026 • Apache Spark Structured Streaming

Spark Declarative Pipelines internals

Welcome back to our series on Spark Declarative Pipelines (SDP)! So far, we've tackled the fundamentals of building jobs and the logistics of operationalizing them in production. Now that your pipelines are running smoothly, it's time to pop the hood and see what's actually happening under the surface.

Continue Reading →

May 6, 2026 • Databricks

Lakeflow Spark Declarative Pipelines, introduction and incremental refreshes

Even though I've wrapped up my exploration of Spark Declarative Pipelines, there is still one topic on my mind. How does "vanilla" SDP relate to the Databricks version, known as Lakeflow Spark Declarative Pipelines? I'll try to answer that today and, hopefully, share some interesting insights with you.

Continue Reading →

May 12, 2026 • Databricks

Lakeflow Spark Declarative Pipelines, flows, private tables, and configuration

Welcome to the second blog post on Lakeflow Spark Declarative Pipelines. Today we are going beyond the environment to see how to declare the processing jobs.

Continue Reading →

Spark Declarative Pipelines articles

What would it take for you to trust your Databricks pipelines in production?

Spark Declarative Pipelines 101

Spark Declarative Pipelines, going further

Spark Declarative Pipelines internals

Lakeflow Spark Declarative Pipelines, introduction and incremental refreshes

Lakeflow Spark Declarative Pipelines, flows, private tables, and configuration