Partitioning is the most popular method to divide a dataset into smaller parts. It's important to know that it can be completed with another technique called bucketing.
This post begins a new series dedicated to Apache Spark 2.4.0 features. The first covered topic will be bucket pruning.