waiting for code
  • Home
  • About
  • Tips
  • #Tags
  • Data engineering
    • Apache Airflow
    • Big Data algorithms
    • Big Data problems - solutions
    • Data engineering patterns
    • General Big Data
    • General data engineering
    • Graphs
    • SQL
  • Data processing
    • Apache Beam
    • Apache Spark
    • Apache Spark GraphFrames
    • Apache Spark GraphX
    • Apache Spark SQL
    • Apache Spark Streaming
    • Apache Spark Structured Streaming
    • PySpark
  • Storage
    • Apache Avro
    • Apache Cassandra
    • Apache Hudi
    • Apache Iceberg
    • Apache Parquet
    • Apache ZooKeeper
    • Delta Lake
    • Elasticsearch
    • Embedded databases
    • HDFS
    • MySQL
    • PostgreSQL
    • Time series
  • Messaging
    • Apache Kafka
    • Apache Pulsar
    • RabbitMQ
  • Cloud
    • Data engineering on AWS
    • Data engineering on Azure
    • Data engineering on GCP
    • Data engineering on the cloud
  • JVM
    • Java
    • Scala
  • Software engineering
    • Programming
    • Testing
    • Web security
Home External contributions

This blog is the main place where I'm contributing to. However, from time to time I'm also active in other places. I give some talks to my work colleagues or publish the posts in other places. You can find here the list of my external contributions:

14.06.2022 - Airbyte is in the air - data ingestion with Airbyte

26.05.2022 - Lunch & Learn internal talk about Airbyte

11.2020 - Extending Apache Spark – Beyond Spark Session Extensions

Extending Apache Spark – Beyond Spark Session Extensions from Databricks

02.2020 - Happy New Year! Paris Kafka Meetup Février: Apache Kafka + Apache Spark = ♡

Apache Spark Structured Streaming + Apache Kafka = ♡ from Bartosz Konieczny

11.2019 - Paris.py #22 @ Meilleurs Agents: Using Cerberus and PySpark to validate semi-structured datasets

Using Cerberus and PySpark to validate semi-structured datasets from Bartosz Konieczny

10.2019 - Spark+AI 2019: Using Apache Spark to Solve Sessionization Problem in Batch and Streaming

Using Apache Spark to Solve Sessionization Problem in Batch and Streaming from Databricks

09.2019 - Apache Spark Meetup chez AWS le jeudi 5 septembre 2019: Apache Spark in your likeness - low and high level customization

Apache Spark in your likeness - low and high level customization from Bartosz Konieczny

11.2018 - "Graphs - going distributed internal talk

Distributed graph processing from Bartosz Konieczny

New ebook 🔥

📚 Newsletter

Get new posts, recommended reading and other exclusive information every week. SPAM free - no 3rd party ads, only the information about waitingforcode! Curious about the content ? Check some of already sent newsletters

You want to learn data engineering but have no idea where and how to start in this wide domain? Check if Become a Data Engineer can help you 💪

  • Data engineering
  • Data processing
  • Storage
  • Messaging
  • Cloud
  • JVM
  • Software engineering

privacy policy © 2014 - 2023 waitingforcode.com. All rights reserved | Design: Jakub Kędziora