https://github.com/pymc-devs/pymc

Bayesian Modeling and Probabilistic Programming in Python
https://github.com/pymc-devs/pymc

bayesian-inference mcmc probabilistic-programming pytensor python statistical-analysis variational-inference

Last synced: 15 days ago
JSON representation

Bayesian Modeling and Probabilistic Programming in Python

Host: GitHub
URL: https://github.com/pymc-devs/pymc
Owner: pymc-devs
License: other
Created: 2009-05-05T09:43:50.000Z (almost 17 years ago)
Default Branch: main
Last Pushed: 2026-02-08T22:26:34.000Z (27 days ago)
Last Synced: 2026-02-09T04:09:36.799Z (27 days ago)
Topics: bayesian-inference, mcmc, probabilistic-programming, pytensor, python, statistical-analysis, variational-inference
Language: Python
Homepage: https://www.pymc.io
Size: 516 MB
Stars: 9,479
Watchers: 214
Forks: 2,203
Open Issues: 463
Metadata Files:
- Readme: README.rst
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Citation: CITATION.cff
- Governance: GOVERNANCE.md

Awesome Lists containing this project

awesome-python-data-science - PyMC - Bayesian Stochastic Modelling in Python. (Probabilistic Methods / NLP)
AiTreasureBox - pymc-devs/pymc - 11-03_9336_0](https://img.shields.io/github/stars/pymc-devs/pymc.svg)|Bayesian Modeling and Probabilistic Programming in Python| (Repos)
awesome-sciml - pymc-devs/pymc: Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Aesara
awesome-list - PyMC - Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Aesara. (Linear Algebra / Statistics Toolkit / Statistical Toolkit)
bayesian-inference - PyMC
awesome-python-machine-learning-resources - GitHub - 6% open · ⏱️ 25.08.2022): (概率统计)
awesome-systematic-trading - PyMC - commit/statsmodels/statsmodels/main) ![GitHub Repo stars](https://img.shields.io/github/stars/pymc-devs/pymc?style=social) | Python | - Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Aesara (Basic Components / Fundamental libraries)
awesome-meteo - PyMC3
StarryDivineSky - pymc-devs/pymc - Turn Sampler，允许对具有数千个参数的复杂模型进行建模。PyMC 还支持变分推断，包括 ADVI 和 mini-batch ADVI，用于快速近似后验估计。它依赖于 PyTensor，提供计算优化、动态 C 或 JAX 编译、NumPy广播和高级索引以及线性代数运算符。PyMC 还透明地支持缺失值插补。例如，可以使用 PyMC 建立线性回归模型，以预测植物生长与阳光照射时间、浇水量和土壤氮含量之间的关系。 (其他_机器学习与深度学习)
awesome-machine-master - PyMC - Markov Chain Monte Carlo sampling toolkit. (Python / General-Purpose Machine Learning)
awesome-machine-learning - PyMC - Markov Chain Monte Carlo sampling toolkit. (Python / General-Purpose Machine Learning)
awesome-machine-learning - PyMC - Markov Chain Monte Carlo sampling toolkit. (Python / General-Purpose Machine Learning)
awesome-machine-learning - PyMC - Markov Chain Monte Carlo sampling toolkit. (Python / General-Purpose Machine Learning)
my-awesome-starred - pymc - PyMC: Bayesian Stochastic Modelling in Python (for PyMC3: https://github.com/pymc-devs/pymc3) (FORTRAN)
awesome-python-resources - GitHub - 6% open · ⏱️ 25.08.2022): (科学计算和数据分析)
awesome-data-analysis - PyMC - A probabilistic programming library for Python that allows for flexible Bayesian modeling. (🎲 Statistics & Probability / Tools)
awesome-systematic-trading - PyMC - devs/pymc) | ![made-with-python](https://img.shields.io/badge/Made%20with-Python-1f425f.svg) | (Data Science / Cryptocurrencies)
fucking-awesome-machine-learning - PyMC - Markov Chain Monte Carlo sampling toolkit. (Python / General-Purpose Machine Learning)
awesome-ai-for-science - PyMC - Probabilistic programming (💻 Computing Frameworks / Specialized Frameworks)
awesome-machine-learning - PyMC - Markov Chain Monte Carlo sampling toolkit. (Python / General-Purpose Machine Learning)
awesome-machine-learning - PyMC - Markov Chain Monte Carlo sampling toolkit. (Python / General-Purpose Machine Learning)
awesome-python-data-science - PyMC - Bayesian Stochastic Modelling in Python. (Probabilistic Methods / Others)
awesome-machine-learning - PyMC - Markov Chain Monte Carlo sampling toolkit. (Python / General-Purpose Machine Learning)
awesome-advanced-metering-infrastructure - PyMC - Markov Chain Monte Carlo sampling toolkit. (Python / General-Purpose Machine Learning)

README

          .. image:: https://cdn.rawgit.com/pymc-devs/pymc/main/docs/logos/svg/PyMC_banner.svg

    :height: 100px

    :alt: PyMC logo

    :align: center

|Build Status| |Coverage| |NumFOCUS_badge| |Binder| |Dockerhub| |DOIzenodo| |Conda Downloads|

PyMC (formerly PyMC3) is a Python package for Bayesian statistical modeling

focusing on advanced Markov chain Monte Carlo (MCMC) and variational inference (VI)

algorithms. Its flexibility and extensibility make it applicable to a

large suite of problems.

Check out the `PyMC overview `__,  or

one of `the many examples `__!

For questions on PyMC, head on over to our `PyMC Discourse `__ forum.

Features

========

-  Intuitive model specification syntax, for example, ``x ~ N(0,1)``

   translates to ``x = Normal('x',0,1)``

-  **Powerful sampling algorithms**, such as the `No U-Turn

   Sampler `__, allow complex models

   with thousands of parameters with little specialized knowledge of

   fitting algorithms.

-  **Variational inference**: `ADVI `__

   for fast approximate posterior estimation as well as mini-batch ADVI

   for large data sets.

-  Relies on `PyTensor `__ which provides:

    *  Computation optimization and dynamic C or JAX compilation

    *  NumPy broadcasting and advanced indexing

    *  Linear algebra operators

    *  Simple extensibility

-  Transparent support for missing value imputation

Linear Regression Example

==========================

Plant growth can be influenced by multiple factors, and understanding these relationships is crucial for optimizing agricultural practices.

Imagine we conduct an experiment to predict the growth of a plant based on different environmental variables.

.. code-block:: python

   import pymc as pm

   # Taking draws from a normal distribution

   seed = 42

   x_dist = pm.Normal.dist(shape=(100, 3))

   x_data = pm.draw(x_dist, random_seed=seed)

   # Independent Variables:

   # Sunlight Hours: Number of hours the plant is exposed to sunlight daily.

   # Water Amount: Daily water amount given to the plant (in milliliters).

   # Soil Nitrogen Content: Percentage of nitrogen content in the soil.

   # Dependent Variable:

   # Plant Growth (y): Measured as the increase in plant height (in centimeters) over a certain period.

   # Define coordinate values for all dimensions of the data

   coords={

    "trial": range(100),

    "features": ["sunlight hours", "water amount", "soil nitrogen"],

   }

   # Define generative model

   with pm.Model(coords=coords) as generative_model:

      x = pm.Data("x", x_data, dims=["trial", "features"])

      # Model parameters

      betas = pm.Normal("betas", dims="features")

      sigma = pm.HalfNormal("sigma")

      # Linear model

      mu = x @ betas

      # Likelihood

      # Assuming we measure deviation of each plant from baseline

      plant_growth = pm.Normal("plant growth", mu, sigma, dims="trial")

   # Generating data from model by fixing parameters

   fixed_parameters = {

    "betas": [5, 20, 2],

    "sigma": 0.5,

   }

   with pm.do(generative_model, fixed_parameters) as synthetic_model:

      idata = pm.sample_prior_predictive(random_seed=seed) # Sample from prior predictive distribution.

      synthetic_y = idata.prior["plant growth"].sel(draw=0, chain=0)

   # Infer parameters conditioned on observed data

   with pm.observe(generative_model, {"plant growth": synthetic_y}) as inference_model:

      idata = pm.sample(random_seed=seed)

      summary = pm.stats.summary(idata, var_names=["betas", "sigma"])

      print(summary)

From the summary, we can see that the mean of the inferred parameters are very close to the fixed parameters

=====================  ======  =====  ========  =========  ===========  =========  ==========  ==========  =======

Params                  mean     sd    hdi_3%    hdi_97%    mcse_mean    mcse_sd    ess_bulk    ess_tail    r_hat

=====================  ======  =====  ========  =========  ===========  =========  ==========  ==========  =======

betas[sunlight hours]   4.972  0.054     4.866      5.066        0.001      0.001        3003        1257        1

betas[water amount]    19.963  0.051    19.872     20.062        0.001      0.001        3112        1658        1

betas[soil nitrogen]    1.994  0.055     1.899      2.107        0.001      0.001        3221        1559        1

sigma                   0.511  0.037     0.438      0.575        0.001      0            2945        1522        1

=====================  ======  =====  ========  =========  ===========  =========  ==========  ==========  =======

.. code-block:: python

   # Simulate new data conditioned on inferred parameters

   new_x_data = pm.draw(

      pm.Normal.dist(shape=(3, 3)),

      random_seed=seed,

   )

   new_coords = coords | {"trial": [0, 1, 2]}

   with inference_model:

      pm.set_data({"x": new_x_data}, coords=new_coords)

      pm.sample_posterior_predictive(

         idata,

         predictions=True,

         extend_inferencedata=True,

         random_seed=seed,

      )

   pm.stats.summary(idata.predictions, kind="stats")

The new data conditioned on inferred parameters would look like:

================ ======== ======= ======== =========

Output            mean     sd      hdi_3%   hdi_97%

================ ======== ======= ======== =========

plant growth[0]   14.229   0.515   13.325   15.272

plant growth[1]   24.418   0.511   23.428   25.326

plant growth[2]   -6.747   0.511   -7.740   -5.797

================ ======== ======= ======== =========

.. code-block:: python

   # Simulate new data, under a scenario where the first beta is zero

   with pm.do(

    inference_model,

    {inference_model["betas"]: inference_model["betas"] * [0, 1, 1]},

   ) as plant_growth_model:

      new_predictions = pm.sample_posterior_predictive(

         idata,

         predictions=True,

         random_seed=seed,

      )

   pm.stats.summary(new_predictions, kind="stats")

The new data, under the above scenario would look like:

================ ======== ======= ======== =========

Output            mean     sd      hdi_3%   hdi_97%

================ ======== ======= ======== =========

plant growth[0]   12.149   0.515   11.193   13.135

plant growth[1]   29.809   0.508   28.832   30.717

plant growth[2]   -0.131   0.507   -1.121    0.791

================ ======== ======= ======== =========

Getting started

===============

If you already know about Bayesian statistics:

----------------------------------------------

-  `API quickstart guide `__

-  The `PyMC tutorial `__

-  `PyMC examples `__ and the `API reference `__

Learn Bayesian statistics with a book together with PyMC

--------------------------------------------------------

-  `Bayesian Analysis with Python  `__ (third edition) by Osvaldo Martin: Great introductory book.

-  `Probabilistic Programming and Bayesian Methods for Hackers `__: Fantastic book with many applied code examples.

-  `PyMC port of the book "Doing Bayesian Data Analysis" by John Kruschke `__ as well as the `first edition `__.

-  `PyMC port of the book "Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath `__

-  `PyMC port of the book "Bayesian Cognitive Modeling" by Michael Lee and EJ Wagenmakers `__: Focused on using Bayesian statistics in cognitive modeling.

See also the section on books using PyMC on `our website `__.

Audio & Video

-------------

- Here is a `YouTube playlist `__ gathering several talks on PyMC.

- You can also find all the talks given at **PyMCon 2020** `here `__.

- The `"Learning Bayesian Statistics" podcast `__ helps you discover and stay up-to-date with the vast Bayesian community. Bonus: it's hosted by Alex Andorra, one of the PyMC core devs!

Installation

============

To install PyMC on your system, follow the instructions on the `installation guide `__.

Citing PyMC

===========

Please choose from the following:

- |DOIpaper| *PyMC: A Modern and Comprehensive Probabilistic Programming Framework in Python*, Abril-Pla O, Andreani V, Carroll C, Dong L, Fonnesbeck CJ, Kochurov M, Kumar R, Lao J, Luhmann CC, Martin OA, Osthege M, Vieira R, Wiecki T, Zinkov R. (2023)

  - BibTex version

    .. code:: bibtex

       @article{pymc2023,

         title = {{PyMC}: A Modern and Comprehensive Probabilistic Programming Framework in {P}ython},

         author = {Oriol Abril-Pla and Virgile Andreani and Colin Carroll and Larry Dong and Christopher J. Fonnesbeck and Maxim Kochurov and Ravin Kumar and Junpeng Lao and Christian C. Luhmann and Osvaldo A. Martin and Michael Osthege and Ricardo Vieira and Thomas Wiecki and Robert Zinkov },

         journal = {{PeerJ} Computer Science},

         volume = {9},

         number = {e1516},

         doi = {10.7717/peerj-cs.1516},

         year = {2023}

       }

- |DOIzenodo| A DOI for all versions.

- DOIs for specific versions are shown on Zenodo and under `Releases `_

.. |DOIpaper| image:: https://img.shields.io/badge/DOI-10.7717%2Fpeerj--cs.1516-blue.svg

     :target: https://doi.org/10.7717/peerj-cs.1516

.. |DOIzenodo| image:: https://zenodo.org/badge/DOI/10.5281/zenodo.4603970.svg

   :target: https://doi.org/10.5281/zenodo.4603970

Contact

=======

We are using `discourse.pymc.io `__ as our main communication channel.

To ask a question regarding modeling or usage of PyMC we encourage posting to our Discourse forum under the `“Questions” Category `__. You can also suggest feature in the `“Development” Category `__.

Requests for non-technical information about the project are also welcome on Discourse,

we also use Discourse internally for general announcements or governance related processes.

You can also follow us on these social media platforms for updates and other announcements:

- `LinkedIn @pymc `__

- `YouTube @PyMCDevelopers `__

- `X @pymc_devs `__

- `Mastodon @pymc@bayes.club `__

To report an issue with PyMC please use the `issue tracker `__.

License

=======

`Apache License, Version

2.0 `__

Software using PyMC

===================

General purpose

---------------

- `Bambi `__: BAyesian Model-Building Interface (BAMBI) in Python.

- `calibr8 `__: A toolbox for constructing detailed observation models to be used as likelihoods in PyMC.

- `gumbi `__: A high-level interface for building GP models.

- `SunODE `__: Fast ODE solver, much faster than the one that comes with PyMC.

- `pymc-learn `__: Custom PyMC models built on top of pymc3_models/scikit-learn API

Domain specific

---------------

- `Exoplanet `__: a toolkit for modeling of transit and/or radial velocity observations of exoplanets and other astronomical time series.

- `beat `__: Bayesian Earthquake Analysis Tool.

- `CausalPy `__: A package focusing on causal inference in quasi-experimental settings.

- `PyMC-Marketing `__: Bayesian marketing toolbox for marketing mix modeling, customer lifetime value, and more.

See also the `ecosystem page `__ on our website. Please contact us if your software is not listed here.

Papers citing PyMC

==================

See Google Scholar `here `__ and `here `__ for a continuously updated list.

Contributors

============

The `GitHub contributor page `__ shows the people who have added content to this repo

which includes a large portion of contributors to the PyMC project but not all of them. Other

contributors have added content to other repos of the ``pymc-devs`` GitHub organization or have contributed

through other project spaces outside of GitHub like `our Discourse forum `__.

If you are interested in contributing yourself, read our `Code of Conduct `__

and `Contributing guide `__.

Support

=======

PyMC is a non-profit project under NumFOCUS umbrella. If you want to support PyMC financially, you can donate `here `__.

Professional Consulting Support

===============================

You can get professional consulting support from `PyMC Labs `__.

Sponsors

========

|NumFOCUS|

|PyMCLabs|

|OpenWoundResearch|

Thanks to our contributors

==========================

|contributors|

.. |Binder| image:: https://mybinder.org/badge_logo.svg

   :target: https://mybinder.org/v2/gh/pymc-devs/pymc/main?filepath=%2Fdocs%2Fsource%2Fnotebooks

.. |Build Status| image:: https://github.com/pymc-devs/pymc/workflows/tests/badge.svg

   :target: https://github.com/pymc-devs/pymc/actions?query=workflow%3Atests+branch%3Amain

.. |Coverage| image:: https://codecov.io/gh/pymc-devs/pymc/branch/main/graph/badge.svg

   :target: https://codecov.io/gh/pymc-devs/pymc

.. |Dockerhub| image:: https://img.shields.io/docker/automated/pymc/pymc.svg

   :target: https://hub.docker.com/r/pymc/pymc

.. |NumFOCUS_badge| image:: https://img.shields.io/badge/powered%20by-NumFOCUS-orange.svg?style=flat&colorA=E1523D&colorB=007D8A

   :target: http://www.numfocus.org/

.. |NumFOCUS| image:: https://github.com/pymc-devs/brand/blob/main/sponsors/sponsor_logos/sponsor_numfocus.png?raw=true

   :target: http://www.numfocus.org/

.. |PyMCLabs| image:: https://github.com/pymc-devs/brand/blob/main/sponsors/sponsor_logos/sponsor_pymc_labs.png?raw=true

   :target: https://pymc-labs.com

.. |OpenWoundResearch| image:: https://github.com/pymc-devs/brand/blob/main/sponsors/sponsor_logos/owr/sponsor_owr.png?raw=true

   :target: https://www.openwoundresearch.com/

.. |contributors| image:: https://contrib.rocks/image?repo=pymc-devs/pymc

   :target: https://github.com/pymc-devs/pymc/graphs/contributors

.. |Conda Downloads| image:: https://anaconda.org/conda-forge/pymc/badges/downloads.svg

   :target: https://anaconda.org/conda-forge/pymc

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/pymc-devs/pymc

Awesome Lists containing this project

README