Articles about Databricks on waitingforcode.com - articles for the pleasure of learning and discovery

May 1, 2025 • Databricks

Data quality on Databricks - DQX

In the last blog post of the data quality on Databricks series we're going to discover a Databricks Labs product, the DQX library.

Continue Reading →

April 14, 2025 • Databricks

Data quality on Databricks - Spark Expectations

Previously we learned how to control data quality with Delta Live Tables. Now, it's time to see an open source library in action, Spark Expectations.

Continue Reading →

April 9, 2025 • Databricks

Data quality on Databricks - Delta Live Tables

Data quality is one of the key factors of a successful data project. Without a good quality, even the most advanced engineering or analytics work will not be trusted, therefore, not used. Unfortunately, data quality controls are very often considered as a work item to implement in the end, which sometimes translates to never.

Continue Reading →

March 19, 2025 • Databricks

Apache Airflow XCom in Databricks with task values

If you have been working with Apache Airflow already, you certainly met XComs at some point. You know, these variables that you can "exchange" between tasks within the same DAG. If after switching to Databricks Workflows for data orchestration you're wondering how to do the same, there is good news. Databricks supports this exchange capability natively with Task values.

Continue Reading →

March 5, 2025 • Databricks

File trigger in Databricks

For over two years now you can leverage file triggers in Databricks Jobs to start processing as soon as a new file gets written to your storage. The feature looks amazing but hides some implementation challenges that we're going to see in this blog post.

Continue Reading →

Databricks articles

Data quality on Databricks - DQX

Data quality on Databricks - Spark Expectations

Data quality on Databricks - Delta Live Tables

Apache Airflow XCom in Databricks with task values

File trigger in Databricks