https://github.com/vvssttkk/dst

yet another custom data science template via cookiecutter
https://github.com/vvssttkk/dst

codestyle cookiecutter datascience deeplearning deeplearning-ai github-template github-templates githubcli machine-learning-projects machinelearning machinelearning-python machinelearningprojects python python-package template template-project template-repository

Last synced: 7 months ago
JSON representation

yet another custom data science template via cookiecutter

Host: GitHub
URL: https://github.com/vvssttkk/dst
Owner: vvssttkk
License: mit
Created: 2019-08-11T12:20:37.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2023-04-21T11:56:22.000Z (over 2 years ago)
Last Synced: 2024-11-14T09:07:25.786Z (about 1 year ago)
Topics: codestyle, cookiecutter, datascience, deeplearning, deeplearning-ai, github-template, github-templates, githubcli, machine-learning-projects, machinelearning, machinelearning-python, machinelearningprojects, python, python-package, template, template-project, template-repository
Language: Python
Homepage: https://vvssttkk.github.io/dst/
Size: 343 KB
Stars: 67
Watchers: 2
Forks: 5
Open Issues: 0
Metadata Files:
- Readme: readme.md
- Contributing: contributing.md
- License: license
- Citation: citation.cff

Awesome Lists containing this project

README

# data science template

in this repo can look at default template for ds/ml/dl/.. projects or similar

## how to use

* **before creating a new project from this template, need to install the next dependencies**

* https://github.com/cookiecutter/cookiecutter

```bash
pip install cookiecutter
```

* https://github.com/cli/cli?tab=readme-ov-file#installation

* **after go to the directory where want to create your project and run**

```bash
cookiecutter gh:vvssttkk/dst
```

* **follow the instruction**

## using the next project structure

```markdown
├── .github/
│ ├── workflows/
│ │ ├── ci.yml
│ │
│ └── dependabot.yml
│
├── config/
│
├── data/
│ ├── external/
│ ├── interim/
│ ├── processed/
│ ├── raw/
│ ├── features/
│ └── README.md
│
├── docs/
│
├── experiments/
│ └── README.md
│
├── models/
│ └── README.md
│
├── notebooks/
│
│
├── references/
│ └── README.md
│
├── tests/
│ └── __init__.py
│
├── {{
│ ├── __init__.py
│ ├── data/
│ ├── models/
│ └── visualization/
│
├── .gitignore
├── .pre-commit-config.yaml
├── LICENSE
├── README.md
└── requirements.txt
``` <- some actions └── dependency-review.yml <- often it's yaml-files with some parameters <- data from third party sources <- intermediate data that has been transformed <- the final, canonical data sets for modeling <- the original, immutable data dump <- another <- a default sphinx project (see sphinx-doc.org for details) <- for any experiments <- trained & serialized models, model predictions, or model summaries <- notebooks for research naming convention is a number (for ordering), the creator's initials, and a short `-` delimited description, eg `1.0-jqp-initial-data-exploration` <- data dictionaries, manuals, and all other explanatory materials <- test for project cookiecutter.project_name }}/ <- source code <- propose generate with `mkinit` <- scripts to download or generate data <- scripts to train models and then use trained models to make predictions <- scripts to create exploratory and results oriented visualizations <- default for python <- custom pcc with `reorder_python_imports`, `black`, `flake8`, `pyright`, `mypy`, `pre-commit-hooks`.. <- will be created if u choose <- propose generate with `pipreqs`

## other similar templates

* https://github.com/drivendata/cookiecutter-data-science
* https://github.com/quantumblacklabs/kedro

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/vvssttkk/dst

Awesome Lists containing this project

README