Articles about Apache Spark 4.0.0 features on waitingforcode.com

August 20, 2025 • Apache Spark Structured Streaming

What's new in Apache Spark 4.0.0 - Arbitrary state API v2 - batch

To close the topic of the new arbitrary stateful processing API in Apache Spark Structured Streaming let's focus on its...batch counterpart!

Continue Reading →

August 13, 2025 • Apache Spark Structured Streaming

What's new in Apache Spark 4.0.0 - Arbitrary state API v2 - internals

Last week we discovered the new way to write arbitrary stateful transformations in Apache Spark 4 with the transformWithState API. Today it's time to delve into the implementation details and try to understand the internal logic a bit better.

Continue Reading →

August 6, 2025 • Apache Spark Structured Streaming

What's new in Apache Spark 4.0 - Arbitrary state API v2 - introduction

Arbitrary stateful processing has been evolving a lot in Apache Spark. The initial version with updateStateByKey evolved to mapWithState in Apache Spark 2. When Structured Streaming was released, the framework got mapGroupsWithState and flatMapGroupsWithState. Now, Apache Spark 4 introduces a completely new way to interact with the arbitrary stateful processing logic, the Arbitrary state API v2!

Continue Reading →

Apache Spark 4.0.0 features articles

What's new in Apache Spark 4.0.0 - Arbitrary state API v2 - batch

What's new in Apache Spark 4.0.0 - Arbitrary state API v2 - internals

What's new in Apache Spark 4.0 - Arbitrary state API v2 - introduction