https://github.com/datajoint/datajoint-python
Relational data pipelines for the science lab
https://github.com/datajoint/datajoint-python
cloud-computing data-analysis data-pipelines databases datajoint mysql pipeline-framework python relational-algebra relational-databases relational-model s3 scientific-computing workflow-management
Last synced: 7 days ago
JSON representation
Relational data pipelines for the science lab
- Host: GitHub
- URL: https://github.com/datajoint/datajoint-python
- Owner: datajoint
- License: lgpl-2.1
- Created: 2012-09-19T03:50:15.000Z (over 13 years ago)
- Default Branch: master
- Last Pushed: 2025-05-03T18:54:36.000Z (10 months ago)
- Last Synced: 2025-05-03T19:39:37.517Z (10 months ago)
- Topics: cloud-computing, data-analysis, data-pipelines, databases, datajoint, mysql, pipeline-framework, python, relational-algebra, relational-databases, relational-model, s3, scientific-computing, workflow-management
- Language: Python
- Homepage: https://datajoint.com/docs
- Size: 19.7 MB
- Stars: 176
- Watchers: 17
- Forks: 86
- Open Issues: 38
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# DataJoint for Python
DataJoint is a framework for scientific data pipelines based on the **Relational Workflow Model** — a paradigm where your database schema is an executable specification of your workflow.
- **Tables represent workflow steps** — Each table is a step in your pipeline
- **Foreign keys encode dependencies** — Parent tables must be populated before child tables
- **Computations are declarative** — Define *what* to compute; DataJoint handles *when*
- **Results are immutable** — Full provenance and reproducibility
**Documentation:** https://docs.datajoint.com
> **📘 Upgrading from legacy DataJoint (pre-2.0)?**
> See the **[Migration Guide](https://docs.datajoint.com/how-to/migrate-to-v20/)** for a step-by-step upgrade path.
## Installation
```bash
pip install datajoint
```
or with Conda:
```bash
conda install -c conda-forge datajoint
```
## Example Pipeline

[Yatsenko et al., bioRxiv 2021](https://doi.org/10.1101/2021.03.30.437358)
## Resources
- **[Documentation](https://docs.datajoint.com)** — Complete guides and reference
- [Tutorials](https://docs.datajoint.com/tutorials/) — Learn by example
- [How-To Guides](https://docs.datajoint.com/how-to/) — Task-oriented guides
- [API Reference](https://docs.datajoint.com/api/) — Complete API documentation
- [Migration Guide](https://docs.datajoint.com/how-to/migrate-to-v20/) — Upgrade from legacy versions
- **[DataJoint Elements](https://datajoint.com/docs/elements/)** — Example pipelines for neuroscience
- **[GitHub Discussions](https://github.com/datajoint/datajoint-python/discussions)** — Community support
## Contributing
See [CONTRIBUTING.md](CONTRIBUTING.md) for development setup and guidelines.