Skip to content

This repo contains the material and projects for Udacity Data Streaming ND

Notifications You must be signed in to change notification settings

nesreensada/Data-Streaming-Udacity-Nanodegree

Repository files navigation

Data-Streaming-Udacity-Nanodegree

Introduction

This repo the material and projects for Udacity Data Streaming Nanodegree contains the exercises, projects and the extra curricular material.

this is the link for the certificate: https://confirm.udacity.com/QPLCM63E

Table of Contents

Lessons

  1. Data Ingestion with Kafka & Kafka Streaming

  2. Streaming API Development and Documentation

Projects

  1. Optimize Chicago Bus and Train Availability Using Kafka: A streaming event pipeline around Apache Kafka and its ecosystem. Using public data from the Chicago Transit Authority we will construct an event pipeline around Kafka that allows us to simulate and display the status of train lines in real time. tools: python, Kafka, Faust Stream processor and KSQL.

  2. Analyze San Francisco Crime Rate with Apache Spark Streaming: real-world dataset, extracted from Kaggle, on San Francisco crime incidents, and you will provide statistical analyses of the data using Apache Spark Structured Streaming. tools: python, Kafka, Spark Streaming.

Certificate

DataStreaming

Licence

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. Please refer to Udacity Terms of Service for further information.

About

This repo contains the material and projects for Udacity Data Streaming ND

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published