ETL streaming pipeline using python. generating a constant stream of fake web server logs, analyzing it, and storing in a PostgreSQL DB
-
Updated
Mar 4, 2020 - Python
ETL streaming pipeline using python. generating a constant stream of fake web server logs, analyzing it, and storing in a PostgreSQL DB
This project demonstrates a modern ETL (Extract, Transform, Load) streaming pipeline using various open-source technologies.
TweetPipe Apache Flink AWS Kinesis Consumer. A Flink-based consumer that reads from an AWS Kinesis source and maps the input stream elements to a domain model. Future iterations will output the transformed data to a sink.
Add a description, image, and links to the etl-streaming-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the etl-streaming-pipeline topic, visit your repo's landing page and select "manage topics."