{"id":13487216,"url":"https://github.com/minerva-ml/steppy","last_synced_at":"2025-03-27T21:32:06.780Z","repository":{"id":37694931,"uuid":"117526101","full_name":"minerva-ml/steppy","owner":"minerva-ml","description":"Lightweight, Python library for fast and reproducible experimentation :microscope:","archived":true,"fork":false,"pushed_at":"2018-11-23T09:48:51.000Z","size":141,"stargazers_count":134,"open_issues_count":13,"forks_count":32,"subscribers_count":13,"default_branch":"master","last_synced_at":"2024-06-15T04:39:12.384Z","etag":null,"topics":["data-science","deep-learning","image-processing","machine-learning","minimal-interface","nlp","open-source","pipeline","python","python-library","python3","reproducibility","reproducible-research","steppy","steppy-library","steppy-toolkit","steps"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/minerva-ml.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-01-15T09:40:49.000Z","updated_at":"2024-06-15T04:39:12.385Z","dependencies_parsed_at":"2022-09-26T17:40:26.699Z","dependency_job_id":null,"html_url":"https://github.com/minerva-ml/steppy","commit_stats":null,"previous_names":["neptune-ml/steppy"],"tags_count":16,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minerva-ml%2Fsteppy","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minerva-ml%2Fsteppy/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minerva-ml%2Fsteppy/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minerva-ml%2Fsteppy/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/minerva-ml","download_url":"https://codeload.github.com/minerva-ml/steppy/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245927364,"owners_count":20695221,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-science","deep-learning","image-processing","machine-learning","minimal-interface","nlp","open-source","pipeline","python","python-library","python3","reproducibility","reproducible-research","steppy","steppy-library","steppy-toolkit","steps"],"created_at":"2024-07-31T18:00:56.752Z","updated_at":"2025-03-27T21:32:06.416Z","avatar_url":"https://github.com/minerva-ml.png","language":"Python","funding_links":[],"categories":["The Data Science Toolbox","工作流程和实验跟踪","Python"],"sub_categories":["Miscellaneous Tools"],"readme":"# Steppy\n[![license](https://img.shields.io/github/license/mashape/apistatus.svg?maxAge=2592000)](https://github.com/minerva-ml/steppy/blob/master/LICENSE)\n\n### What is Steppy?\n1. Steppy is a lightweight, open-source, Python 3 library for fast and reproducible experimentation.\n1. Steppy lets data scientist focus on data science, not on software development issues.\n1. Steppy's minimal interface does not impose constraints, however, enables clean machine learning pipeline design.\n\n### What problem steppy solves?\n#### Problems\nIn the course of the project, data scientist faces two problems:\n1. Difficulties with reproducibility in data science / machine learning projects.\n1. Lack of the ability to prepare or extend experiments quickly.\n\n#### Solution\nSteppy address both problems by introducing two simple abstractions: `Step` and `Tranformer`. We consider it minimal interface for building machine learning pipelines.\n1. `Step` is a wrapper over the transformer and handles multiple aspects of the execution of the pipeline, such as saving intermediate results (if needed), checkpointing the model during training and much more.\n1. `Tranformer` in turn, is purely computational, data scientist-defined piece that takes an input data and produces some output data. Typical Transformers are neural network, machine learning algorithms and pre- or post-processing routines.\n\n# Start using steppy\n### Installation\nSteppy requires `python3.5` or above.\n```bash\npip3 install steppy\n```\n_(you probably want to install it in your [virtualenv](https://virtualenv.pypa.io/en/stable))_\n\n### Resources\n1. :ledger: [Documentation](https://steppy.readthedocs.io/en/latest)\n1. :computer: [Source](https://github.com/minerva-ml/steppy)\n1. :name_badge: [Bugs reports](https://github.com/minerva-ml/steppy/issues)\n1. :rocket: [Feature requests](https://github.com/minerva-ml/steppy/issues)\n1. :star2: Tutorial notebooks ([their repository](https://github.com/minerva-ml/steppy-examples)):\n    - :arrow_forward: [Getting started](https://github.com/minerva-ml/steppy-examples/blob/master/tutorials/1-getting-started.ipynb)\n    -  :arrow_forward:[Steps with multiple inputs](https://github.com/minerva-ml/steppy-examples/blob/master/tutorials/2-multi-step.ipynb)\n    - :arrow_forward: [Advanced adapters](https://github.com/minerva-ml/steppy-examples/blob/master/tutorials/3-adapter_advanced.ipynb)\n    - :arrow_forward: [Caching and persistance](https://github.com/minerva-ml/steppy-examples/blob/master/tutorials/4-caching-persistence.ipynb)\n    - :arrow_forward: [Steppy with Keras](https://github.com/minerva-ml/steppy-examples/blob/master/tutorials/5-steps-with-keras.ipynb)\n\n### Feature Requests\nPlease send us your ideas on how to improve steppy library! We are looking for your comments here: [Feature requests](https://github.com/minerva-ml/steppy/issues).\n\n### Roadmap\n:fast_forward: At this point steppy is early-stage library heavily tested on multiple machine learning challenges ([data-science-bowl](https://github.com/minerva-ml/open-solution-data-science-bowl-2018 \"Kaggle's data science bowl 2018\"), [toxic-comment-classification-challenge](https://github.com/minerva-ml/open-solution-toxic-comments \"Kaggle's Toxic Comment Classification Challenge\"), [mapping-challenge](https://github.com/minerva-ml/open-solution-mapping-challenge \"CrowdAI's Mapping Challenge\")) and educational projects ([minerva-advanced-data-scientific-training](https://github.com/minerva-ml/minerva-training-materials \"minerva.ml -\u003e advanced data scientific training\")).\n\n:fast_forward: We are developing steppy towards practical tool for data scientists who can run their experiments easily and change their pipelines with just few manipulations in the code.\n\n### Related projects\nWe are also building [steppy-toolkit](https://github.com/minerva-ml/steppy-toolkit \"steppy toolkit\"), a collection of high quality implementations of the top deep learning architectures -\u003e all of them with the same, intuitive interface.\n\n### Contributing\nYou are welcome to contribute to the Steppy library. Please check [CONTRIBUTING](https://github.com/minerva-ml/steppy/blob/master/CONTRIBUTING.md) for more information.\n\n### Terms of use\nSteppy is [MIT-licensed](https://github.com/minerva-ml/steppy/blob/master/LICENSE).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fminerva-ml%2Fsteppy","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fminerva-ml%2Fsteppy","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fminerva-ml%2Fsteppy/lists"}