Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/icoxfog417/datascience-template
Data science project template
https://github.com/icoxfog417/datascience-template
amazon-sagemaker-lab datascience jupyter-notebook machine-learning
Last synced: 4 months ago
JSON representation
Data science project template
- Host: GitHub
- URL: https://github.com/icoxfog417/datascience-template
- Owner: icoxfog417
- License: mit
- Created: 2022-08-11T01:42:26.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-12-08T00:56:38.000Z (about 2 years ago)
- Last Synced: 2024-04-15T12:50:35.549Z (10 months ago)
- Topics: amazon-sagemaker-lab, datascience, jupyter-notebook, machine-learning
- Language: Jupyter Notebook
- Homepage:
- Size: 20.5 KB
- Stars: 19
- Watchers: 3
- Forks: 4
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# datascience-template
[data:image/s3,"s3://crabby-images/c3d6b/c3d6b8f85639420a678241169d29ee582748cac0" alt="Source Code Check"](https://github.com/icoxfog417/datascience-template/actions/workflows/ci.yml)
[data:image/s3,"s3://crabby-images/65ea5/65ea56a782a92978a9d1c30ea7b56b2bd4e42d2a" alt="pre-commit"](https://github.com/pre-commit/pre-commit)
[data:image/s3,"s3://crabby-images/98647/986475842f2907062b79c4bb27fdd075d638e5b9" alt="Code style: black"](https://github.com/psf/black)
[data:image/s3,"s3://crabby-images/10899/1089942a0d6d800df6d58a079d247a7e57ce2676" alt="Code style: flake8"](https://github.com/PyCQA/flake8)
[data:image/s3,"s3://crabby-images/2b8ac/2b8ac0b40688968d026c257731b2e37213f350f1" alt="Imports: isort"](https://pycqa.github.io/isort/)
[data:image/s3,"s3://crabby-images/a7d5b/a7d5bc2d612b4ef6b27b7893e146f851535cd219" alt="Typing: mypy"](https://github.com/python/mypy)Well structured and tested data science project template. You can use this [template](https://docs.github.com/ja/repositories/creating-and-managing-repositories/creating-a-repository-from-a-template) when creating the data sicence repository.
📁 **Organized**: The project structure is refereed to [Cookiecutter Data Science](https://github.com/drivendata/cookiecutter-data-science)
🚀 **Prepared**: Major libraries are prepared in `environment.yml`. If you are familiar with [Colaboratory](https://colab.research.google.com/?utm_source=scs-index) environment, please use `environment-colab.yml` .
✅ **Tested**: `scripts` are checked by common linter when [pre-commit](https://pre-commit.com/).
Here is the notebook link to provide the quick access to your analysis. You can create the conda environment by Right click `Build Conda Environment` or `conda create -f environment.yml` in Studio Lab.
[data:image/s3,"s3://crabby-images/93f0e/93f0eb76f7b1999493dd777417858b495378833c" alt="Open in SageMaker Studio Lab"](https://studiolab.sagemaker.aws/import/github/icoxfog417/datascience-template/blob/main/notebooks/example.ipynb)
## Project Structure
```bash
.
├── data
│ ├── external # data from third party sources.
│ ├── processed # data after processing
│ ├── interim # data that transformed
│ └── raw # raw data
├── models # store models
├── notebooks # store notebooks
├── docs # documentation for your project
├── .gitignore # ignore files that cannot commit to Git
├── .pre-commit-config.yaml # configurations for pre-commit
├── pyproject.toml # dependencies for poetry
├── README.md # describe your project
├── scripts # store source code used in notebook
│ └── __init__.py # make src a Python module
└── tests # store tests
└── __init__.py # make tests a Python module
```## Customization
* `environment.yml`: Please specify the packages and versions. As a default, no version is specified.
* `.pre-commit-config.yaml`: Please check the `rev` to check the code.
* Change the Notebook url for `Open in Studio Lab`.