Articles about bucketing in Spark SQL on waitingforcode.com

September 15, 2019 • Apache Spark SQL

Buckets in Apache Spark SQL

Partitioning is the most popular method to divide a dataset into smaller parts. It's important to know that it can be completed with another technique called bucketing.

Continue Reading →

January 2, 2019 • Apache Spark SQL

Apache Spark 2.4.0 features - bucket pruning

This post begins a new series dedicated to Apache Spark 2.4.0 features. The first covered topic will be bucket pruning.

Continue Reading →

bucketing in Spark SQL articles

Buckets in Apache Spark SQL

Apache Spark 2.4.0 features - bucket pruning