{"id":13870050,"url":"https://github.com/openml/openml-python","last_synced_at":"2025-04-10T04:48:24.610Z","repository":{"id":15210624,"uuid":"17939040","full_name":"openml/openml-python","owner":"openml","description":"OpenML's Python API for a World of Data and More 💫","archived":false,"fork":false,"pushed_at":"2025-04-01T16:01:56.000Z","size":203877,"stargazers_count":291,"open_issues_count":129,"forks_count":147,"subscribers_count":21,"default_branch":"develop","last_synced_at":"2025-04-03T02:13:58.969Z","etag":null,"topics":["benchmarking","data","datascience","machine-learning","meta-learning","openml","python","tabular-data"],"latest_commit_sha":null,"homepage":"http://openml.github.io/openml-python/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/openml.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":"CITATION.cff","codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"github":"openml","open_collective":"openml"}},"created_at":"2014-03-20T10:46:41.000Z","updated_at":"2025-03-19T12:54:36.000Z","dependencies_parsed_at":"2023-01-14T00:30:12.396Z","dependency_job_id":"af1ceddf-aabd-4ca7-9f12-9f4af214acee","html_url":"https://github.com/openml/openml-python","commit_stats":{"total_commits":1282,"total_committers":52,"mean_commits":"24.653846153846153","dds":0.6801872074882995,"last_synced_commit":"d45cf37d6ec388d3032c3d6b2c505e110aca693e"},"previous_names":[],"tags_count":22,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/openml%2Fopenml-python","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/openml%2Fopenml-python/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/openml%2Fopenml-python/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/openml%2Fopenml-python/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/openml","download_url":"https://codeload.github.com/openml/openml-python/tar.gz/refs/heads/develop","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248161242,"owners_count":21057552,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["benchmarking","data","datascience","machine-learning","meta-learning","openml","python","tabular-data"],"created_at":"2024-08-05T20:01:26.814Z","updated_at":"2025-04-10T04:48:24.583Z","avatar_url":"https://github.com/openml.png","language":"Python","readme":"\n\n\u003cdiv align=\"center\"\u003e\n\n\u003cdiv id=\"user-content-toc\"\u003e\n  \u003cul align=\"center\" style=\"list-style: none;\"\u003e\n    \u003csummary\u003e\n      \u003cimg src=\"https://github.com/openml/openml.org/blob/master/app/public/static/svg/logo.svg\" width=\"50\" alt=\"OpenML Logo\"/\u003e \n      \u003ch1\u003eOpenML-Python\u003c/h1\u003e\n      \u003cimg src=\"https://github.com/openml/docs/blob/master/docs/img/python.png\" width=\"50\" alt=\"Python Logo\"/\u003e\n    \u003c/summary\u003e\n  \u003c/ul\u003e\n\u003c/div\u003e\n\n## The Python API for a World of Data and More :dizzy:\n\n[![Latest Release](https://img.shields.io/github/v/release/openml/openml-python)](https://github.com/openml/openml-python/releases)\n[![Python Versions](https://img.shields.io/badge/python-3.8%20%7C%203.9%20%7C%203.10%20%7C%203.11%20%7C%203.12%20%7C%203.13-blue)](https://pypi.org/project/openml/)\n[![Downloads](https://static.pepy.tech/badge/openml)](https://pepy.tech/project/openml)\n[![License](https://img.shields.io/badge/License-BSD%203--Clause-blue.svg)](https://opensource.org/licenses/BSD-3-Clause)\n\u003c!-- Add green badges for CI and precommit --\u003e\n\n[Installation](https://openml.github.io/openml-python/main/#how-to-get-openml-for-python) | [Documentation](https://openml.github.io/openml-python) | [Contribution guidelines](https://github.com/openml/openml-python/blob/develop/CONTRIBUTING.md)\n\u003c/div\u003e\n\nOpenML-Python provides an easy-to-use and straightforward Python interface for [OpenML](http://openml.org), an online platform for open science collaboration in machine learning.\nIt can download or upload data from OpenML, such as datasets and machine learning experiment results.\n\n## :joystick: Minimal Example\n\nUse the following code to get the [credit-g](https://www.openml.org/search?type=data\u0026sort=runs\u0026status=active\u0026id=31) [dataset](https://docs.openml.org/concepts/data/):\n\n```python\nimport openml\n\ndataset = openml.datasets.get_dataset(\"credit-g\") # or by ID get_dataset(31)\nX, y, categorical_indicator, attribute_names = dataset.get_data(target=\"class\")\n```\n\nGet a [task](https://docs.openml.org/concepts/tasks/) for [supervised classification on credit-g](https://www.openml.org/search?type=task\u0026id=31\u0026source_data.data_id=31):\n\n```python\nimport openml\n\ntask = openml.tasks.get_task(31)\ndataset = task.get_dataset()\nX, y, categorical_indicator, attribute_names = dataset.get_data(target=task.target_name)\n# get splits for the first fold of 10-fold cross-validation\ntrain_indices, test_indices = task.get_train_test_split_indices(fold=0)\n```\n\nUse an [OpenML benchmarking suite](https://docs.openml.org/concepts/benchmarking/) to get a curated list of machine-learning tasks:\n```python\nimport openml\n\nsuite = openml.study.get_suite(\"amlb-classification-all\")  # Get a curated list of tasks for classification\nfor task_id in suite.tasks:\n    task = openml.tasks.get_task(task_id)\n```\n\n## :magic_wand: Installation\n\nOpenML-Python is supported on Python 3.8 - 3.13 and is available on Linux, MacOS, and Windows.\n\nYou can install OpenML-Python with:\n\n```bash\npip install openml\n```\n\n## :page_facing_up: Citing OpenML-Python\n\nIf you use OpenML-Python in a scientific publication, we would appreciate a reference to the following paper:\n\n[Matthias Feurer, Jan N. van Rijn, Arlind Kadra, Pieter Gijsbers, Neeratyoy Mallik, Sahithya Ravi, Andreas Müller, Joaquin Vanschoren, Frank Hutter\u003cbr/\u003e\n**OpenML-Python: an extensible Python API for OpenML**\u003cbr/\u003e\nJournal of Machine Learning Research, 22(100):1−5, 2021](https://www.jmlr.org/papers/v22/19-920.html)\n\nBibtex entry:\n```bibtex\n@article{JMLR:v22:19-920,\n  author  = {Matthias Feurer and Jan N. van Rijn and Arlind Kadra and Pieter Gijsbers and Neeratyoy Mallik and Sahithya Ravi and Andreas Müller and Joaquin Vanschoren and Frank Hutter},\n  title   = {OpenML-Python: an extensible Python API for OpenML},\n  journal = {Journal of Machine Learning Research},\n  year    = {2021},\n  volume  = {22},\n  number  = {100},\n  pages   = {1--5},\n  url     = {http://jmlr.org/papers/v22/19-920.html}\n}\n```\n","funding_links":["https://github.com/sponsors/openml","https://opencollective.com/openml"],"categories":["Python"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fopenml%2Fopenml-python","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fopenml%2Fopenml-python","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fopenml%2Fopenml-python/lists"}