Skip to content

Latest commit

 

History

History
47 lines (27 loc) · 1.55 KB

checklist.md

File metadata and controls

47 lines (27 loc) · 1.55 KB

Data Engineering Capstone

Write Up

Scoping the Project

  • [ok] The write up includes an outline of the steps taken in the project.
  • [ok] The purpose of the final data model is made explicit.

Addressing Other Scenarios

The write up describes a logical approach to this project under the following scenarios:

  • [ok] The data was increased by 100x.
  • [ok] The pipelines would be run on a daily basis by 7 am every day.
  • [ok] The database needed to be accessed by 100+ people.

Defending Decisions

[ok] The choice of tools, technologies, and data model are justified well.

Execution

Project code is clean and modular.

[ok] All coding scripts have an intuitive, easy-to-follow structure with code separated into logical functions. Naming for variables and functions follows the PEP8 style guidelines. The code should run without errors.

Quality Checks

[ok] The project includes at least two data quality checks.

Data Model

  • [ok] The ETL processes result in the data model outlined in the write-up.
  • [doing] A data dictionary for the final data model is included.
  • [ok] The data model is appropriate for the identified purpose.

Datasets

The project includes:

  • [ok] At least 2 data sources
  • [ok] More than 1 million lines of data.
  • [ok - csv and orc] At least two data sources/formats (csv, api, json)

https://classroom.udacity.com/nanodegrees/nd027/parts/55db5e68-a304-4bed-a756-ef368985d7e2/modules/6263097f-417a-40de-9099-0602babef3d2/lessons/111b82aa-f94b-4158-901d-764bdc7d12d7/concepts/e9f4e2a3-e2a9-4b7d-8e83-d4d263ee7d90