Data Engineering Project Documentation Templates

This repository is used to provide guidance in a standard data engineering project that consists of a data lake and data warehouse. The documentation originated out of a need to standardize a requirements gathering methodology. It is derived from the CRISP-DM (Cross Industry Standard Process for Data Mining).

Usage

There are nine templates numbered in logical order within the templates directory. These templates have text in italics that is used for reference purposes. You may clone, modify, or fork the repository at your leisure.

Note that some documentation processes may overlap as you learn more about your project. Do not feel obligated to fill everything out in sequence. Generally you will fill out the first few documents in order and adjust as needed. For more details, learn about CRISP-DM.

Contributing

My goal in publishing these templates is to make it teach others how to formalize a process around data engineering. It would be awesome for the community to expand on some of the templates to make them more featureful.

To contribute, fork this repository and open a pull request with your changes.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Engineering Project Documentation Templates

Usage

Contributing

About

Releases

Packages

License

tylerwmarrs/data-engineering-project-doc-templates

Folders and files

Latest commit

History

Repository files navigation

Data Engineering Project Documentation Templates

Usage

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages