Skip to content

Light `cookiecutter` template for starting data science projects

Notifications You must be signed in to change notification settings

tnwei/cookiecutter-datascience-lite

Repository files navigation

cookiecutter-datascience-lite (WIP, overhaul in progress)

Light cookiecutter template for starting data science projects. Minimal enough to stay out of your way when starting a project, sufficiently adaptable to remain useful as the project evolves.

Motivation

I used cookiecutter-data-science and Kedro for a bit, personally grew to prefer starting projects with a clutter-free bare minimum project structure, and gradually expanding the project as needed. Thus this repo.

Example usage

After installing the cookiecutter Python package, run cookiecutter gh:tnwei/cookiecutter-datascience-lite to create a new project repo. Example:

tnwei@rama:~/projects$ cookiecutter gh:tnwei/cookiecutter-datascience-lite

project_name [Project name]: Iris Species Classification
project_slug [iris-species-classification]: iris
New project directory created at /home/tnwei/projects/iris

A default .pre-commit-config.yaml is included in the project dir.
To use it, install `pre-commit`, `cd` into the project dir and run:

$ cd iris
$ git init
$ pre-commit install -t commit-msg -t pre-commit

Refer to https://pre-commit.com/ for more info.

Created repo structure:

tnwei@rama:~/projects/iris$ tree
.
├── assets
├── data
├── iris
│   ├── data.py
│   └── utils.py
├── logs
├── output
└── README.md

About

Light `cookiecutter` template for starting data science projects

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages