{"id":13422533,"url":"https://jakevdp.github.io/PythonDataScienceHandbook/","last_synced_at":"2025-03-15T12:30:45.197Z","repository":{"id":37359336,"uuid":"65388917","full_name":"jakevdp/PythonDataScienceHandbook","owner":"jakevdp","description":"Python Data Science Handbook: full text in Jupyter Notebooks","archived":false,"fork":false,"pushed_at":"2024-04-23T02:48:00.000Z","size":43642,"stargazers_count":41685,"open_issues_count":218,"forks_count":17612,"subscribers_count":1779,"default_branch":"master","last_synced_at":"2024-05-23T02:42:47.134Z","etag":null,"topics":["jupyter-notebook","matplotlib","numpy","pandas","python","scikit-learn"],"latest_commit_sha":null,"homepage":"http://jakevdp.github.io/PythonDataScienceHandbook","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/jakevdp.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE-CODE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-08-10T14:24:36.000Z","updated_at":"2024-07-30T21:38:02.622Z","dependencies_parsed_at":"2024-07-30T21:53:55.734Z","dependency_job_id":null,"html_url":"https://github.com/jakevdp/PythonDataScienceHandbook","commit_stats":{"total_commits":207,"total_committers":18,"mean_commits":11.5,"dds":"0.14492753623188404","last_synced_commit":"d66231454ef753818dc9213c9b5942e067266966"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jakevdp%2FPythonDataScienceHandbook","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jakevdp%2FPythonDataScienceHandbook/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jakevdp%2FPythonDataScienceHandbook/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jakevdp%2FPythonDataScienceHandbook/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/jakevdp","download_url":"https://codeload.github.com/jakevdp/PythonDataScienceHandbook/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243526885,"owners_count":20305109,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["jupyter-notebook","matplotlib","numpy","pandas","python","scikit-learn"],"created_at":"2024-07-30T23:00:47.292Z","updated_at":"2025-03-15T12:30:41.725Z","avatar_url":"https://github.com/jakevdp.png","language":"Jupyter Notebook","readme":"# Python Data Science Handbook\n\n[![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jakevdp/PythonDataScienceHandbook/master?filepath=notebooks%2FIndex.ipynb)\n[![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb)\n\nThis repository contains the entire [Python Data Science Handbook](http://shop.oreilly.com/product/0636920034919.do), in the form of (free!) Jupyter notebooks.\n\n![cover image](notebooks/figures/PDSH-cover.png)\n\n## How to Use this Book\n\n- Read the book in its entirety online at https://jakevdp.github.io/PythonDataScienceHandbook/\n\n- Run the code using the Jupyter notebooks available in this repository's [notebooks](notebooks) directory.\n\n- Launch executable versions of these notebooks using [Google Colab](http://colab.research.google.com): [![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb)\n\n- Launch a live notebook server with these notebooks using [binder](https://beta.mybinder.org/): [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jakevdp/PythonDataScienceHandbook/master?filepath=notebooks%2FIndex.ipynb)\n\n- Buy the printed book through [O'Reilly Media](http://shop.oreilly.com/product/0636920034919.do)\n\n## About\n\nThe book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.\n\nThe book introduces the core libraries essential for working with data in Python: particularly [IPython](http://ipython.org), [NumPy](http://numpy.org), [Pandas](http://pandas.pydata.org), [Matplotlib](http://matplotlib.org), [Scikit-Learn](http://scikit-learn.org), and related packages.\nFamiliarity with Python as a language is assumed; if you need a quick introduction to the language itself, see the free companion project,\n[A Whirlwind Tour of Python](https://github.com/jakevdp/WhirlwindTourOfPython): it's a fast-paced introduction to the Python language aimed at researchers and scientists.\n\nSee [Index.ipynb](http://nbviewer.jupyter.org/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb) for an index of the notebooks available to accompany the text.\n\n## Software\n\nThe code in the book was tested with Python 3.5, though most (but not all) will also work correctly with Python 2.7 and other older Python versions.\n\nThe packages I used to run the code in the book are listed in [requirements.txt](requirements.txt) (Note that some of these exact version numbers may not be available on your platform: you may have to tweak them for your own use).\nTo install the requirements using [conda](http://conda.pydata.org), run the following at the command-line:\n\n```\n$ conda install --file requirements.txt\n```\n\nTo create a stand-alone environment named ``PDSH`` with Python 3.5 and all the required package versions, run the following:\n\n```\n$ conda create -n PDSH python=3.5 --file requirements.txt\n```\n\nYou can read more about using conda environments in the [Managing Environments](http://conda.pydata.org/docs/using/envs.html) section of the conda documentation.\n\n\n## License\n\n### Code\nThe code in this repository, including all code samples in the notebooks listed above, is released under the [MIT license](LICENSE-CODE). Read more at the [Open Source Initiative](https://opensource.org/licenses/MIT).\n\n### Text\nThe text content of the book is released under the [CC-BY-NC-ND license](LICENSE-TEXT). Read more at [Creative Commons](https://creativecommons.org/licenses/by-nc-nd/3.0/us/legalcode).\n","funding_links":[],"categories":["Books and Academic Journals","Topics","📊 Data Science and Machine Learning","Literature and Media",":octocat: GitHub Repositories","📖 Computer Books","Interesting Non EO parts Python","Data Science","Learning Resources","Other courses","Misc","Lists of libraries","Uncategorized","Table of Contents","Tutorial"],"sub_categories":["📖 Data Science","Books","GEDI","Python","Guides","Take my tips with a grain of salt","Design Tools","Mathematics","Uncategorized","Free Books","Machine Learning"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/jakevdp.github.io%2FPythonDataScienceHandbook%2F","html_url":"https://awesome.ecosyste.ms/projects/jakevdp.github.io%2FPythonDataScienceHandbook%2F","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/jakevdp.github.io%2FPythonDataScienceHandbook%2F/lists"}