2025 retrospective on waitingforcode.com

Before I share the usual retrospective for the past year, I want to thank you for following along in 2025! Even though I'm primarily writing for "me-from-the-future", it's always great to know that people other than my future self find these posts helpful ;)

4-day workshop · In-person or online

What would it take for you to trust your Databricks pipelines in production?

A 3-day bug hunt on a 3-person team costs up to €7,200 in lost engineering time. This workshop teaches you to prevent that — unit tests, data tests, and integration tests for PySpark and Databricks Lakeflow, including Spark Declarative Pipelines.

Unit, data & integration tests
Medallion architecture & Lakeflow SDP
Max 10 participants · production-ready templates
See the full curriculum → €7,000 flat fee · cohort of up to 10
Bartosz Konieczny
Bartosz
Konieczny

Data engineering design patterns

Last year (cf. retrospective for 2024) I revealed the so-called Secret project I had been working on past months. This year I'm excited to share the Data Engineering Design Patterns book has been finally released!

Technically, the book has even been already upgraded! Thanks to Buf's sponsorship, there is an additional, 11th chapter on streaming data design patterns! This extra chapter is only available in the PDF version, though!

And besides pure facts, let me share some community feedback, including comments on Amazon's page, few blog posts, and community reviews; I might have mentioned that already, but devoting my Miracle Morning routine for writing the book for the past two years was totally worth it! Here are some of most recent screenshots I captured from social media:

Blog in 2025

After spending my 2024 mostly on writing the book, I did a bit better this year for the blog posts, as you can see in the summary table below:

YearBlog posts
202440
202428
202353
202268
202193
2020105
2019114
2018139

Besides, I'm pretty satisfied with the blog posts distribution. As planned, I could finally write a bit more about Databricks which is translated by 9 blog posts published in 2025. The single deception for me is Apache Flink. I wanted to spend some extra time without putting too much pressure on it but I couldn't. Not having this pressure was probably a mistake leading to 0 Apache Flink blog posts in 2025. The overall distribution looks like in the next table:

TopicBlog posts in 2025Blog posts in 2024
Databricks 9 0
Apache Spark SQL 7 1
Delta Lake 7 4
Apache Spark Structured Streaming 6 10
General data engineering 6 7
SQL 2 0
Data engineering patterns 2 1
PySpark 1 0

Plans for 2025

Before I share with you my plans for 2026, let's go back to 2025 and see what I have been expecting from it and what was the realty:

Data Engineering Design Patterns:

Blogging:

Freelancing:

Become a Better Data Engineer:

Data engineering patterns on the cloud:

Speaker:

Cloud data engineer:

Plans for 2026

What about next year?

Data Engineering Design Patterns:

Blogging:

Freelancing:

Speaker:

Cloud data engineer:

I like this retrospective series. Not because it's close to my birthday but because it shows the progress made over a longer period of time, and because it helps organize the incoming year (insisting on "helps" as everything remains flexible and the single known part in life is change).

Wishing you all a healthy and successful 2026!
Best,
Bartosz

Data Engineering Design Patterns

Looking for a book that defines and solves most common data engineering problems? I wrote one on that topic! You can read it online on the O'Reilly platform, or get a print copy on Amazon.

I also help solve your data engineering problems contact@waitingforcode.com đź“©