Standard data sources, such as files, queues or sockets are natively implement in Spark Streaming context. But the framework allows the creation of more flexible data consumers called receivers.
You want to learn data engineering but have no idea where and how to start in this wide domain? Check if Become a Data Engineer can help you 💪
📚 Newsletter
Get new posts, recommended reading and other exclusive information every week. SPAM free - no 3rd party ads, only the information about waitingforcode! Curious about the content ? Check some of already sent newsletters
"So what's the interest of using Data Catalog? Why we simply cannot write data on S3 directly from Firehose or any other data source without all that Glue thing?" 🤔 Check if you can answer this and +300 other questions about AWS Big Data services in my AWS Big Data services quizz