Data Engineering Capstone

Write Up

Scoping the Project

[ok] The write up includes an outline of the steps taken in the project.
[ok] The purpose of the final data model is made explicit.

Addressing Other Scenarios

The write up describes a logical approach to this project under the following scenarios:

[ok] The data was increased by 100x.
[ok] The pipelines would be run on a daily basis by 7 am every day.
[ok] The database needed to be accessed by 100+ people.

Defending Decisions

[ok] The choice of tools, technologies, and data model are justified well.

Execution

Project code is clean and modular.

[ok] All coding scripts have an intuitive, easy-to-follow structure with code separated into logical functions. Naming for variables and functions follows the PEP8 style guidelines. The code should run without errors.

Quality Checks

[ok] The project includes at least two data quality checks.

Data Model

[ok] The ETL processes result in the data model outlined in the write-up.
[doing] A data dictionary for the final data model is included.
[ok] The data model is appropriate for the identified purpose.

Datasets

The project includes:

[ok] At least 2 data sources
[ok] More than 1 million lines of data.
[ok - csv and orc] At least two data sources/formats (csv, api, json)

https://classroom.udacity.com/nanodegrees/nd027/parts/55db5e68-a304-4bed-a756-ef368985d7e2/modules/6263097f-417a-40de-9099-0602babef3d2/lessons/111b82aa-f94b-4158-901d-764bdc7d12d7/concepts/e9f4e2a3-e2a9-4b7d-8e83-d4d263ee7d90

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

checklist.md

checklist.md

Data Engineering Capstone

Write Up

Scoping the Project

Addressing Other Scenarios

Defending Decisions

Execution

Project code is clean and modular.

Quality Checks

Data Model

Datasets

Files

checklist.md

Latest commit

History

checklist.md

File metadata and controls

Data Engineering Capstone

Write Up

Scoping the Project

Addressing Other Scenarios

Defending Decisions

Execution

Project code is clean and modular.

Quality Checks

Data Model

Datasets