Apache Spark data sources articles

on waitingforcode.com
Articles tagged with Apache Spark data sources. There are 1 article(s) corresponding to the tag Apache Spark data sources. If you don't find what you're looking for, please check related tags: AWS EC2, Big Data patterns implemented, Change Data Capture, completable future, horizontal scalability, hybrid orchestration and coordination, idempotent consumer, Kubernetes, POC, random algorithms.

Apache Spark 2.4.0 features - Avro data source

Apache Avro became one of the serialization standards, among others because of its use in Apache Kafka's schema registry. Previously to work with Avro files with Apache Spark we needed Databrick's external package. But it's no longer the case starting from 2.4.0 release where Avro became first-class citizen data source. Continue Reading →