{"id":20768197,"url":"https://github.com/softwaremill/lemon-dataset","last_synced_at":"2026-02-07T16:32:29.947Z","repository":{"id":137884273,"uuid":"283124758","full_name":"softwaremill/lemon-dataset","owner":"softwaremill","description":"Lemons quality control dataset ","archived":false,"fork":false,"pushed_at":"2020-07-29T09:40:28.000Z","size":84485,"stargazers_count":104,"open_issues_count":0,"forks_count":14,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-07-23T05:44:05.513Z","etag":null,"topics":["dataset","lemonade","machine-learning","segmentation"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/softwaremill.png","metadata":{"files":{"readme":"README.MD","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-07-28T06:42:25.000Z","updated_at":"2025-03-17T15:03:48.000Z","dependencies_parsed_at":null,"dependency_job_id":"a7bd17da-ce48-486d-985b-3769eb3f880d","html_url":"https://github.com/softwaremill/lemon-dataset","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/softwaremill/lemon-dataset","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/softwaremill%2Flemon-dataset","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/softwaremill%2Flemon-dataset/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/softwaremill%2Flemon-dataset/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/softwaremill%2Flemon-dataset/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/softwaremill","download_url":"https://codeload.github.com/softwaremill/lemon-dataset/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/softwaremill%2Flemon-dataset/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29199788,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-07T16:28:23.579Z","status":"ssl_error","status_checked_at":"2026-02-07T16:28:22.566Z","response_time":63,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dataset","lemonade","machine-learning","segmentation"],"created_at":"2024-11-17T11:36:21.354Z","updated_at":"2026-02-07T16:32:29.932Z","avatar_url":"https://github.com/softwaremill.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"![](docs/img/logo.png)\n\n# Lemons quality control dataset :lemon:\n[![DOI](https://zenodo.org/badge/283124758.svg)](https://zenodo.org/badge/latestdoi/283124758)\n[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)\n[![version](https://img.shields.io/badge/version-1.0.0-yellow.svg)](https://semver.org)\n\nLemon dataset has been prepared to investigate the possibilities to tackle the issue of fruit quality control. It contains 2690 annotated images (1056 x 1056 pixels). Raw lemon images have been captured using the procedure described in the following [blogpost](https://blog.softwaremill.com/when-life-gives-you-lemons-create-a-dataset-70522d6b1aa0) and manually annotated using [CVAT](https://github.com/opencv/cvat).\n\nHere's an example of raw unannotated data:\n\n![](docs/img/lemon-dataset-raw-sprite.png)\n\nand some annotated samples:\n\n![](docs/img/lemon-dataset-annotated-sprite.png)\n\n## Labels\n\n| Name               | Attribute       | Type             | Default | Description                                                                                                                                                                                                            | Example                                                                                                        |\n| ---                | ---             | ---              | ---     | ---                                                                                                                                                                                                                    | ---                                                                                                            |\n| condition          | healthy         | boolean          | true    | Determine whether the fruit is healthy. If not regions with identified issues are annotated                                                                                                                             | ![](docs/img/examples/healthy.png)                                                                             |\n| condition          | greening        | boolean          | false   | Determine whether the fruit contains areas that are not uniformly yellow and have green areas.                                                                                                                         | ![](docs/img/examples/greening.png)                                                                            |\n| image_quality      | blurry          | boolean          | false   | Fruit image is blurry                                                                                                                                                                                                  | ![](docs/img/examples/blurry.png)                                                                              |\n| image_quality      | cropped         | boolean          | false   | Not all fruit parts are on the image                                                                                                                                                                                   | ![](docs/img/examples/cropped.png)                                                                             |\n| image_quality      | unnatural_color | boolean          | false   | There are issues with color representation.                                                                                                                                                                            | ![](docs/img/examples/unnatural_color.png)                                                                     |\n| image_quality      | no_data         | boolean          | false   | There are black spots on the fruit image that do not contain data.                                                                                                                                                     | ![](docs/img/examples/no_data.png)                                                                             |\n| illness            | -               | region           | -       | -                                                                                                                                                                                                                      | ![](docs/img/examples/illness_1.png) ![](docs/img/examples/illness_2.png) ![](docs/img/examples/illness_3.png) |\n| gangrene           | -               | region           | -       | -                                                                                                                                                                                                                      | ![](docs/img/examples/gangrene.png)                                                                            |\n| mould              | -               | region           | -       | -                                                                                                                                                                                                                      | ![](docs/img/examples/mould.png)                                                                               |\n| blemish            | artificial      | region + boolean | -       | -                                                                                                                                                                                                                      | ![](docs/img/examples/blemish_1.png) ![](docs/img/examples/blemish_2.png)                                      |\n| dark_style_remains | -               | region           | -       | After pollination the remains of style are preserved in the fruit. A dark area around the remain of style indicates an unhealthy fruit. This place is the region from which the fruit starts rotting or catches mould. | ![](docs/img/examples/dark_style_remains_1.png) ![](docs/img/examples/dark_style_remains_2.png)                |\n| pedicel            | -               | region           | -       | Pedicel refers to a structure connecting a single flower to its inflorescence.                                                                                                                                         | ![](docs/img/examples/pedicel.png)                                                                             |\n| artifact           | -               | region           | -       | Image contains artifacts i.e. regions that are not related to a fruit and are a result of wrong image processing. Those regions should be identified and described.                                                    | ![](docs/img/examples/artifact.png)                                                                            |\n\n## File name\n\nYou will notice that file names are composed to form a specific identifier e.g.:\n`0037_G_I_120_A`: `0037` (individual fruit instance), `120` (relative photo angle), `A` (photo position). Some of them are restricted to the original project and cannot be published.\n\n## Download data\n\n| File                                    | Format | Version |\n| ---                                     | ---    | ---     |\n| [Lemon Dataset](data/lemon-dataset.zip) | COCO   | v1      |\n\n[COCO API](https://github.com/cocodataset/cocoapi) can be utilized to read the dataset.\n\n```python\nfrom pycocotools.coco import COCO\n\ncoco = COCO('../lemon-dataset/annotations/instances_default.json')\n```\n\n## Citing\nIf you use the lemons data set in a scientific publication, we would appreciate references to the following paper:\n\nBiblatex entry:\n```latex\n@misc{softwaremill_2020,\n  author       = {Maciej Adamiak},\n  title        = {Lemons quality control dataset},\n  institution  = {SoftwareMill},\n  month        = jul,\n  year         = 2020,\n  doi          = {10.5281/zenodo.3965568},\n  url          = {https://github.com/softwaremill/lemon-dataset}\n}\n```\n\n## License\n\nMIT License\n\nCopyright (c) 2020 SoftwareMill\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsoftwaremill%2Flemon-dataset","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsoftwaremill%2Flemon-dataset","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsoftwaremill%2Flemon-dataset/lists"}