{"id":19448634,"url":"https://github.com/neurodata/llf_tidy_images","last_synced_at":"2026-04-19T15:06:17.615Z","repository":{"id":140562630,"uuid":"245201280","full_name":"neurodata/LLF_tidy_images","owner":"neurodata","description":"Clean image data for use in LifeLong Forests experiments","archived":false,"fork":false,"pushed_at":"2020-05-08T14:30:26.000Z","size":3674,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-02-25T08:54:13.949Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/neurodata.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-03-05T15:36:06.000Z","updated_at":"2021-09-18T15:35:24.000Z","dependencies_parsed_at":null,"dependency_job_id":"1d1c3525-45e0-4d4c-b735-5b6202e37eb8","html_url":"https://github.com/neurodata/LLF_tidy_images","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/neurodata/LLF_tidy_images","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/neurodata%2FLLF_tidy_images","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/neurodata%2FLLF_tidy_images/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/neurodata%2FLLF_tidy_images/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/neurodata%2FLLF_tidy_images/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/neurodata","download_url":"https://codeload.github.com/neurodata/LLF_tidy_images/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/neurodata%2FLLF_tidy_images/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32010977,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-18T20:23:30.271Z","status":"online","status_checked_at":"2026-04-19T02:00:07.110Z","response_time":55,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-10T16:27:55.465Z","updated_at":"2026-04-19T15:06:17.592Z","avatar_url":"https://github.com/neurodata.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# LLF_tidy_images\n\nWe assume that the below datasets have been downloaded to `../datasets/` relative\nto the root of this git repository and stored in individual folders\nappropriately named.\n\n## Datasets\n\n\n| Dataset Name | Source | Classes |\n|:-------------|:-------|:--------|\n| Food-101 | [Link](https://data.vision.ee.ethz.ch/cvl/datasets_extra/food-101/) | 101 |\n| CIFAR-10     | [Link](https://www.cs.toronto.edu/~kriz/cifar.html) | 10 | \n| CIFAR-100    | [Link](https://www.cs.toronto.edu/~kriz/cifar.html) | 100 |\n| BirdSnap     | [Link](http://thomasberg.org)  | 500 | \n| SUN397 | [Link](https://vision.princeton.edu/projects/2010/SUN/) | 397 | \n| Stanford Cars| [Link](http://ai.stanford.edu/~jkrause/cars/car_dataset.html) | 196 |\n| FGVC Aircraft| [Link](https://www.robots.ox.ac.uk/~vgg/data/fgvc-aircraft/) | 100 |\n| PASCAL VOC 2007 | [Link](http://host.robots.ox.ac.uk/pascal/VOC/voc2007/) | 20 |\n| DTD | [Link](http://www.robots.ox.ac.uk/~vgg/data/dtd/) | 47 | \n| Pets | [Link](http://www.robots.ox.ac.uk/~vgg/data/pets/) | 37 |\n| Caltech-101 | [Link](http://www.vision.caltech.edu/Image_Datasets/Caltech101/Caltech101.html) | 101\n| Flowers | [Link](https://www.robots.ox.ac.uk/~vgg/data/flowers/102/index.html) | 102 |\n\n\n\nThe total number of images should be 312,252\n\n\n| Dataset Name  | train|  test| classes| Labels |\n|:--------------|-----:|-----:|-------:|-------:|\n|birdsnap       | 47386|  2443| 500    | All    |\n|cifar10        | 50000| 10000| 10     | All    |\n|cifar100       | 50000| 10000| 100    | All    |\n|DTD            |  3760|  1880| 47     | All    |\n|FGVC_Aircraft  |  6667|  3333| 100    | All    |\n|flowers        |  2040|  6149| 102    | All    |\n|stanford-cars  |  8144|  8041| 196    | All    |\n|Oxford_Pets    |  3680|  3369| 37     | All    |\n|food-101       | 75750| 25250| 101    | All    |\n|PASCAL_VOC2007 |  5011|  4952| 20     | All    |\n|Caltech-101    |  3060|  6084| 102    | All    |\n|SUN397         | 19850| 19850| 397    | ALL    |\n\n\n# Downloading\n\nThe links to all of the datasets are save in the `download.sh` script\nwhich sets up the proper folders and downloads and unpacks the datasets.\n\nRun with `sh download.sh`\n\n\n# Image Pre-processing\n\nSee [preprocessing.md](./preprocessing.md)\n\n\n# Output labels\n\nThen to create the csv file that holds all of the file paths and labels\nfor all of the images across all of the datasets run\n\n```\npython3 tidyLabels.py\n```\n\n\n\n\n---\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fneurodata%2Fllf_tidy_images","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fneurodata%2Fllf_tidy_images","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fneurodata%2Fllf_tidy_images/lists"}