{"id":19015227,"url":"https://github.com/sdv-dev/copulas","last_synced_at":"2026-02-05T20:13:37.136Z","repository":{"id":38356081,"uuid":"110606051","full_name":"sdv-dev/Copulas","owner":"sdv-dev","description":"A library to model multivariate data using copulas.","archived":false,"fork":false,"pushed_at":"2025-04-23T21:33:09.000Z","size":28854,"stargazers_count":584,"open_issues_count":53,"forks_count":116,"subscribers_count":20,"default_branch":"main","last_synced_at":"2025-04-23T22:29:05.375Z","etag":null,"topics":["copulas","data-generation","generative-ai","generative-model","machine-learning","synthetic-data","synthetic-data-generation","tabular-data"],"latest_commit_sha":null,"homepage":"https://sdv.dev/Copulas/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/sdv-dev.png","metadata":{"files":{"readme":"README.md","changelog":"HISTORY.md","contributing":"CONTRIBUTING.rst","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":".github/CODEOWNERS","security":null,"support":null,"governance":null,"roadmap":null,"authors":"AUTHORS.rst","dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-11-13T21:45:29.000Z","updated_at":"2025-04-15T01:45:24.000Z","dependencies_parsed_at":"2023-02-12T07:01:42.973Z","dependency_job_id":"309180f6-88d4-4d23-aff9-e8d880f38038","html_url":"https://github.com/sdv-dev/Copulas","commit_stats":{"total_commits":686,"total_committers":46,"mean_commits":14.91304347826087,"dds":0.7201166180758017,"last_synced_commit":"827851fbcbf4f024531e7883e18dc8220d32c812"},"previous_names":[],"tags_count":29,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sdv-dev%2FCopulas","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sdv-dev%2FCopulas/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sdv-dev%2FCopulas/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/sdv-dev%2FCopulas/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/sdv-dev","download_url":"https://codeload.github.com/sdv-dev/Copulas/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254069476,"owners_count":22009557,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["copulas","data-generation","generative-ai","generative-model","machine-learning","synthetic-data","synthetic-data-generation","tabular-data"],"created_at":"2024-11-08T19:36:16.818Z","updated_at":"2026-02-05T20:13:37.088Z","avatar_url":"https://github.com/sdv-dev.png","language":"Python","readme":"\u003cp style=\"text-align:center\"\u003e\n    \u003ci\u003eThis repository is part of \u003ca href=\"https://sdv.dev\"\u003eThe Synthetic Data Vault Project\u003c/a\u003e, a project from \u003ca href=\"https://datacebo.com\"\u003eDataCebo\u003c/a\u003e.\u003c/i\u003e\n\u003c/p\u003e\n\n[![Development Status](https://img.shields.io/badge/Development%20Status-2%20--%20Pre--Alpha-yellow)](https://pypi.org/search/?c=Development+Status+%3A%3A+2+-+Pre-Alpha)\n[![PyPi Shield](https://img.shields.io/pypi/v/copulas.svg)](https://pypi.python.org/pypi/copulas)\n[![Downloads](https://pepy.tech/badge/copulas)](https://pepy.tech/project/copulas)\n[![Unit Tests](https://github.com/sdv-dev/Copulas/actions/workflows/unit.yml/badge.svg)](https://github.com/sdv-dev/Copulas/actions/workflows/unit.yml)\n[![Coverage Status](https://codecov.io/gh/sdv-dev/Copulas/branch/main/graph/badge.svg)](https://codecov.io/gh/sdv-dev/Copulas)\n[![Slack](https://img.shields.io/badge/Community-Slack-blue?style=plastic\u0026logo=slack)](https://bit.ly/sdv-slack-invite)\n\n\u003cbr/\u003e\n\u003cp align=\"center\" style=\"text-align:center\"\u003e\n\u003ca href=\"https://github.com/sdv-dev/Copulas\"\u003e\n\u003cimg width=40% src=\"https://github.com/sdv-dev/SDV/blob/stable/docs/images/Copulas-DataCebo.png?raw=true\"\u003e\u003c/img\u003e\n\u003c/a\u003e\n\u003c/p\u003e\n\n# Overview\n\n**Copulas** is a Python library for modeling multivariate distributions and sampling from them\nusing copula functions.\nGiven a table of numerical data, use Copulas to learn the distribution and\ngenerate new synthetic data following the same statistical properties.\n\n**Key Features:**\n\n* **Model multivariate data.** Choose from a variety of univariate\ndistributions and copulas – including Archimedian Copulas, Gaussian Copulas and Vine Copulas.\n\n* **Compare real and synthetic data visually** after building your model. Visualizations\nare available as 1D histograms, 2D scatterplots and 3D scatterplots.\n\n* **Access \u0026 manipulate learned parameters.** With complete access to the internals\nof the model, set or tune parameters to your choosing.\n\n# Install\n\nInstall the Copulas library using pip or conda.\n\n```bash\npip install copulas\n```\n\n```bash\nconda install -c conda-forge copulas\n```\n\n# Usage\n\nGet started using a demo dataset. This dataset contains 3 numerical columns.\n\n```python\nfrom copulas.datasets import sample_trivariate_xyz\n\nreal_data = sample_trivariate_xyz()\nreal_data.head()\n```\n\n\u003cimg src=\"docs/images/copulas_sample_dataset.png\" width=\"300\"\u003e\n\nModel the data using a copula and use it to create synthetic data.\nThe Copulas library offers many options including Gaussian Copula,\nVine Copulas and Archimedian Copulas.\n\n```python\nfrom copulas.multivariate import GaussianMultivariate\n\ncopula = GaussianMultivariate()\ncopula.fit(real_data)\n\nsynthetic_data = copula.sample(len(real_data))\n```\n\nVisualize the real and synthetic data side-by-side. Let's do this in 3D so see our full dataset.\n\n```python\nfrom copulas.visualization import compare_3d\n\ncompare_3d(real_data, synthetic_data)\n```\n\n![Quickstart](docs/images/quickstart.png)\n\n# Tutorials\nClick below to run the code yourself on a Colab Notebook and discover new features.\n\n[![Tutorial Notebook](https://img.shields.io/badge/Tutorial-Colab-F9AB00?style=for-the-badge\u0026logo=googlecolab\u0026color=525252)](https://bit.ly/copulas-demo)\n\n# Community \u0026 Support\n\nLearn more about Copulas library from our [documentation](https://sdv.dev/Copulas/) site.\n\n**Questions or issues?** Join our [Slack channel](https://bit.ly/sdv-slack-invite)\nto discuss more about Copulas and synthetic data.\nIf you find a bug or have a feature request, you can also\n[open an issue](https://github.com/sdv-dev/Copulas/issues/new/choose) on our GitHub.\n\n**Interested in contributing to Copulas?** Read our\n[Contribution Guide](https://sdv.dev/Copulas/contributing.html) to get started.\n\n# Credits\n\nThe Copulas open source project first started at the Data to AI Lab at MIT in 2018.\nThank you to our team of contributors who have built and maintained the library over the years!\n\n[View Contributors](https://github.com/sdv-dev/Copulas/graphs/contributors)\n\n---\n\n\n\u003cdiv align=\"center\"\u003e\n\u003ca href=\"https://datacebo.com\"\u003e\u003cimg align=\"center\" width=40% src=\"https://github.com/sdv-dev/SDV/blob/stable/docs/images/DataCebo.png\"\u003e\u003c/img\u003e\u003c/a\u003e\n\u003c/div\u003e\n\u003cbr/\u003e\n\u003cbr/\u003e\n\n[The Synthetic Data Vault Project](https://sdv.dev) was first created at MIT's [Data to AI Lab](\nhttps://dai.lids.mit.edu/) in 2016. After 4 years of research and traction with enterprise, we\ncreated [DataCebo](https://datacebo.com) in 2020 with the goal of growing the project.\nToday, DataCebo is the proud developer of SDV, the largest ecosystem for\nsynthetic data generation \u0026 evaluation. It is home to multiple libraries that support synthetic\ndata, including:\n\n* 🔄 Data discovery \u0026 transformation. Reverse the transforms to reproduce realistic data.\n* 🧠 Multiple machine learning models -- ranging from Copulas to Deep Learning -- to create tabular,\n  multi table and time series data.\n* 📊 Measuring quality and privacy of synthetic data, and comparing different synthetic data\n  generation models.\n\n[Get started using the SDV package](https://sdv.dev/SDV/getting_started/install.html) -- a fully\nintegrated solution and your one-stop shop for synthetic data. Or, use the standalone libraries\nfor specific needs.\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsdv-dev%2Fcopulas","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsdv-dev%2Fcopulas","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsdv-dev%2Fcopulas/lists"}