{"id":18509086,"url":"https://github.com/unpingco/python-for-probability-statistics-and-machine-learning-2e","last_synced_at":"2025-04-06T08:14:05.751Z","repository":{"id":63585409,"uuid":"184765105","full_name":"unpingco/Python-for-Probability-Statistics-and-Machine-Learning-2E","owner":"unpingco","description":"Second edition of Springer Book Python for Probability, Statistics, and Machine Learning ","archived":false,"fork":false,"pushed_at":"2021-09-21T15:42:40.000Z","size":16802,"stargazers_count":355,"open_issues_count":0,"forks_count":122,"subscribers_count":13,"default_branch":"master","last_synced_at":"2025-03-30T07:08:57.046Z","etag":null,"topics":["machine-learning","probability-statistics","python-modules","springer-book-python"],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/unpingco.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-05-03T14:01:49.000Z","updated_at":"2025-03-21T13:13:19.000Z","dependencies_parsed_at":"2022-11-21T22:00:48.429Z","dependency_job_id":null,"html_url":"https://github.com/unpingco/Python-for-Probability-Statistics-and-Machine-Learning-2E","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/unpingco%2FPython-for-Probability-Statistics-and-Machine-Learning-2E","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/unpingco%2FPython-for-Probability-Statistics-and-Machine-Learning-2E/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/unpingco%2FPython-for-Probability-Statistics-and-Machine-Learning-2E/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/unpingco%2FPython-for-Probability-Statistics-and-Machine-Learning-2E/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/unpingco","download_url":"https://codeload.github.com/unpingco/Python-for-Probability-Statistics-and-Machine-Learning-2E/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247451665,"owners_count":20940944,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["machine-learning","probability-statistics","python-modules","springer-book-python"],"created_at":"2024-11-06T15:16:26.308Z","updated_at":"2025-04-06T08:14:05.677Z","avatar_url":"https://github.com/unpingco.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Python-for-Probability-Statistics-and-Machine-Learning-2E\n\n![Draft cover](./Python_probability_statistics_machine_learning_2E.png)\n\n\nSecond edition of [Springer text **Python for Probability, Statistics, and Machine Learning**](https://www.springer.com/gp/book/9783030185442)\n\nThis book, fully updated for Python version 3.6+, covers the key ideas that link probability, statistics, and machine learning illustrated using Python modules in these areas.  All the figures and numerical results are reproducible using the Python codes provided. The author develops key intuitions in machine learning by working meaningful examples using multiple analytical methods and Python codes, thereby connecting theoretical concepts to concrete implementations. Detailed proofs for certain important results are also provided. Modern Python modules like Pandas, Sympy, Scikit-learn, Tensorflow, and Keras are applied to simulate and visualize important machine learning concepts like the bias/variance trade-off, cross-validation, and regularization. Many abstract mathematical ideas, such as convergence in probability theory, are developed and illustrated with numerical examples. \n\nThis updated edition now includes the Fisher Exact Test and the Mann-Whitney-Wilcoxon Test. A new section on survival analysis has been included as well as substantial development of Generalized Linear Models. The new deep learning section for image processing includes an in-depth discussion of gradient descent methods that underpin all deep learning algorithms.   As with the prior edition, there are new and updated *Programming Tips* that the illustrate effective Python modules and methods for scientific programming and machine learning. There are 445 run-able code blocks with corresponding outputs that have been tested for accuracy.  Over 158 graphical visualizations (almost all generated using Python) illustrate the concepts that are developed both in code and in mathematics. We also discuss and use key Python modules such as Numpy, Scikit-learn, Sympy,  Scipy, Lifelines, CvxPy, Theano, Matplotlib, Pandas, Tensorflow, Statsmodels,  and Keras.\n\nThis book is suitable for anyone with an undergraduate-level exposure to probability, statistics, or machine learning and with rudimentary knowledge of Python programming.\n\n\n## Conda setup instructions\n\nIf you are using `conda`, you can get started by cloning this \nrepository and using the `environment.yaml` file as in the \nfollowing:\n\n    conda env create -n pyPSML -f environment.yaml\n\nand then activate the environment using the following,\n\n    conda activate pyPSML\n\nThen, you can run `jupyter notebook` and navigate the Jupyter\nnotebooks for the individual chapters. All of the notebooks are fully\nfunctional in this so-created environment. Note that there are\nembedded figures in the Jupyter notebooks that are meant to validate\nthe outputs of the Matplotlib codes therein.\n\n\n## Docker setup instructions\n\n\nIf you are using docker, there is a `Dockerfile` included. After cloning\nthis repository, you can build the image with the following,\n\n    docker build -t pypsml2e .\n\nand then run it locally using,\n\n    docker container run -it -p 8888:8888 pypsml2e\n\nThen, navigate to the output URL and you can explore the Jupyter\nnotebooks for each chapter. Alternately, if you don't want to build\nyour own image, you can do\n\n    docker run -p 8888:8888 unpingco/pypsml2e\n\nto get the docker image from\nhttps://hub.docker.com/r/unpingco/pypsml2e. Note that this may\n**not** be as up-to-date as building it yourself from this\nrepository, but should still work fine.\n\nYour comments (including errata) are welcome in the [Issues](https://github.com/unpingco/Python-for-Probability-Statistics-and-Machine-Learning-2E/issues) link above.\n\nGood luck! I hope you find these materials helpful.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Funpingco%2Fpython-for-probability-statistics-and-machine-learning-2e","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Funpingco%2Fpython-for-probability-statistics-and-machine-learning-2e","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Funpingco%2Fpython-for-probability-statistics-and-machine-learning-2e/lists"}