{"id":13448150,"url":"https://github.com/vhoulbreque/dafter","last_synced_at":"2025-03-22T08:31:42.478Z","repository":{"id":62566311,"uuid":"152755612","full_name":"vhoulbreque/dafter","owner":"vhoulbreque","description":"📥 Command-line downloader for public datasets","archived":false,"fork":false,"pushed_at":"2019-06-22T18:55:44.000Z","size":117,"stargazers_count":24,"open_issues_count":37,"forks_count":3,"subscribers_count":2,"default_branch":"master","last_synced_at":"2024-09-30T09:18:58.944Z","etag":null,"topics":["brew-style","command-line","data","database","dataset","download","fetcher","linux","osx","public-data","unix"],"latest_commit_sha":null,"homepage":"https://vinzeebreak.github.io/dafter-loader/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/vhoulbreque.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-10-12T13:35:46.000Z","updated_at":"2024-02-19T09:12:51.000Z","dependencies_parsed_at":"2022-11-03T16:15:53.876Z","dependency_job_id":null,"html_url":"https://github.com/vhoulbreque/dafter","commit_stats":null,"previous_names":["vinzeebreak/dafter"],"tags_count":2,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vhoulbreque%2Fdafter","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vhoulbreque%2Fdafter/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vhoulbreque%2Fdafter/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vhoulbreque%2Fdafter/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/vhoulbreque","download_url":"https://codeload.github.com/vhoulbreque/dafter/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":221826271,"owners_count":16887130,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["brew-style","command-line","data","database","dataset","download","fetcher","linux","osx","public-data","unix"],"created_at":"2024-07-31T05:01:37.075Z","updated_at":"2024-10-28T12:30:21.759Z","avatar_url":"https://github.com/vhoulbreque.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"# dafter : the data fetcher\n\n![dafter-logo](docs/dafter_logo.png)\n\n## You have just found dafter.\n\nDafter is a command line downloader of public datasets. It takes care of downloading and formatting the datasets' files so that you can spend hours building models instead of looking for datasets and their urls.\n\n - [Install](#install-dafter)\n - [Commands](#commands)\n - [How to contribute](#how-to-contribute)\n\n\n## Install dafter\n\nTo install dafter, just do:\n```bash\npip install dafter\n```\n\n## Commands\n\nTo download the MNIST dataset:\n```bash\ndafter get mnist\n```\n\nTo delete MNIST from your machine:\n```bash\ndafter delete mnist\n```\n\nTo search among downloadable datasets:\n```bash\n# Search all available datasets\ndafter search\n# Search all available datasets that have the tags \"image\" and \"deep-learning\"\n# and whose name contains \"mni\"\ndafter search mni --tags image deep-learning\n```\n\nTo list all the datasets that have been downloaded and are stored on your machine:\n```bash\n# Lists all datasets in database\ndafter list\n# Lists all datasets in database that have the tag \"twitter\" and whose name\n# contains \"sentiment\"\ndafter list sentiment --tags twitter\n```\n\n## Update\n\nTo update `dafter`, do:\n```bash\npip install --upgrade dafter\n```\n\n## Uninstall\n\nTo uninstall `dafter`, do:\n\n```bash\npip uninstall dafter\n```\n\n## How to contribute?\n\n### Add a new dataset\n\nTo add a new dataset, just add a `json` file called `name-of-the-dataset.json` in the `datasets-configs` folder.\n\n```json\n{\n  \"name\": \"name-of-the-dataset\",\n  \"urls\": [\n    {\n      \"url\": \"https://site.com/file1.tar.gz\",\n      \"bytes\": 45221\n    },\n    {\n      \"url\": \"https://site.com/file2.tar.gz\",\n      \"bytes\": 1147803\n    }\n  ],\n  \"type\": \"tar.gz\",\n  \"tags\": [\"tag1\", \"tag2\", \"tag3\"],\n  \"description\": \"This is a description of the dataset\",\n  \"source\": \"https://site.com/\"\n}\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvhoulbreque%2Fdafter","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fvhoulbreque%2Fdafter","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvhoulbreque%2Fdafter/lists"}