https://github.com/datasets/breast-cancer
Breast cancer occurrences.
https://github.com/datasets/breast-cancer
Last synced: about 1 year ago
JSON representation
Breast cancer occurrences.
- Host: GitHub
- URL: https://github.com/datasets/breast-cancer
- Owner: datasets
- Created: 2018-01-04T11:24:07.000Z (over 8 years ago)
- Default Branch: main
- Last Pushed: 2024-10-25T14:25:32.000Z (over 1 year ago)
- Last Synced: 2025-06-22T13:08:25.963Z (about 1 year ago)
- Language: Python
- Homepage: https://datahub.io/machine-learning/breast-cancer
- Size: 10.7 KB
- Stars: 24
- Watchers: 10
- Forks: 70
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
This is a dataset about breast cancer occurrences.
## Data
This dataset is taken from [OpenML - breast-cancer](https://www.openml.org/d/13)
This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.
Please include this citation if you plan to use this database.
Matjaz Zwitter & Milan Soklic (physicians) Institute of Oncology University Medical Center Ljubljana, Yugoslavia -- Donors: Ming Tan and Jeff Schlimmer (Jeffrey.Schlimmer@a.gp.cs.cmu.edu) -- Date: 11 July 1988.
* 286 instances
* 10 attributes
* Missing values: yes
Class Distribution:
* no-recurrence-events: 201 instances
* recurrence-events: 85 instances
### Output data
Output data is located in directory `data`
`data/breast-cancer.csv`
## Scripts
Scripts for dataset are located in directory `scripts`
`scripts/main.py`
## Licence
Licensed under the [Public Domain Dedication and License][pddl] (assuming
either no rights or public domain license in source data).
[pddl]: http://opendatacommons.org/licenses/pddl/1.0/