Build Status

Coverage Status

Cookiecutter EasyData

A flexible (but opinionated) toolkit for doing and sharing reproducible data science.

EasyData started life as an experimental fork of cookiecutter-data-science where we could try out ideas before proposing them as fixes to the upstream branch. It has grown into its own toolkit for implementing a reproducible data science workflow, and is the basis of our Bus Number tutorial on Reproducible Data Science.

Tutorial

For a tutorial on making use of this framework, visit: https://github.com/hackalog/bus_number/

Requirements to use this cookiecutter template:

$ pip install cookiecutter

or

$ conda config --add channels conda-forge
$ conda install cookiecutter

To start a new project, run:


cookiecutter https://github.com/hackalog/cookiecutter-easydata

The resulting directory structure


The directory structure of your new project looks like this:

Installing development requirements

The first time:

make create_environment
git init
git add .
git commit -m "initial import"
git branch easydata   # tag for future easydata upgrades

Subsequent updates:

make update_environment

In case you need to delete the environment later:

conda deactivate
make delete_environment