Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/datasciencetoolbox/datasciencetoolbox
A complete environment for busy polyglot data scientists
https://github.com/datasciencetoolbox/datasciencetoolbox
Last synced: 5 days ago
JSON representation
A complete environment for busy polyglot data scientists
- Host: GitHub
- URL: https://github.com/datasciencetoolbox/datasciencetoolbox
- Owner: datasciencetoolbox
- License: mit
- Created: 2013-12-22T21:42:09.000Z (almost 11 years ago)
- Default Branch: master
- Last Pushed: 2021-04-14T13:06:00.000Z (over 3 years ago)
- Last Synced: 2024-08-01T12:36:30.581Z (3 months ago)
- Language: Makefile
- Homepage:
- Size: 76.2 KB
- Stars: 468
- Watchers: 35
- Forks: 127
- Open Issues: 14
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
- awesome-starred - datasciencetoolbox/datasciencetoolbox - Complete environments for busy polyglot data scientists (others)
README
# Data Science Toolbox
[![License: MIT](https://img.shields.io/badge/License-MIT-green.svg)](https://opensource.org/licenses/MIT)
If you're a data scientist, installing all the software you need can be quite involved. The goal of the Data Science Toolbox is to provide a virtual environment that will enable you to start doing data science in a matter of minutes.
The Data Science Toolbox is currently being revived for the upcoming second edition of [Data Science at the Command Line](https://www.datascienceatthecommandline.com).
At the moment there's only a basic Docker image ([datasciencetoolbox/dsatcl2e](https://hub.docker.com/repository/docker/datasciencetoolbox/dsatcl2e)), which is based on Ubuntu 20.04 and includes tools such as:* jq
* xmlstarlet
* GNU parallel
* xsv
* pup
* vowpal wabbitUnder the hood, this project employs Packer, Ansible, and Docker. We'll soon add support for other platforms such as Vagrant, VirtualBox, VMware, and AWS. Expect many breaking changes in the coming months as we're learning this on-the-fly. Stay tuned.
## License
The Data Science Toolbox is licensed under the MIT License.