{"id":27452206,"url":"https://github.com/epigen/open_pipelines","last_synced_at":"2025-08-11T16:05:35.503Z","repository":{"id":53749578,"uuid":"58192715","full_name":"epigen/open_pipelines","owner":"epigen","description":"Pipelines for NGS data preprocessing by the Bock lab and friends","archived":false,"fork":false,"pushed_at":"2022-08-29T14:12:00.000Z","size":562,"stargazers_count":21,"open_issues_count":12,"forks_count":11,"subscribers_count":11,"default_branch":"master","last_synced_at":"2025-07-31T10:39:14.501Z","etag":null,"topics":["atac-seq","bioinformatics","chip-seq","looper","ngs-pipeline","pipeline","pypiper","rna-seq"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/epigen.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2016-05-06T08:28:41.000Z","updated_at":"2025-04-08T06:28:28.000Z","dependencies_parsed_at":"2022-09-23T21:10:47.008Z","dependency_job_id":null,"html_url":"https://github.com/epigen/open_pipelines","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/epigen/open_pipelines","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/epigen%2Fopen_pipelines","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/epigen%2Fopen_pipelines/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/epigen%2Fopen_pipelines/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/epigen%2Fopen_pipelines/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/epigen","download_url":"https://codeload.github.com/epigen/open_pipelines/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/epigen%2Fopen_pipelines/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":269915422,"owners_count":24495704,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-11T02:00:10.019Z","response_time":75,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["atac-seq","bioinformatics","chip-seq","looper","ngs-pipeline","pipeline","pypiper","rna-seq"],"created_at":"2025-04-15T11:42:02.348Z","updated_at":"2025-08-11T16:05:35.480Z","avatar_url":"https://github.com/epigen.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003e :warning: **The repository is undergoing updating**: Currently, only Amplicon and ATAC-seq pipeline are PEP2.0 compatible!\n\n\n# Pipelines\n\nThese are some of the NGS pipelines developed and used in the [Bock lab](http://medical-epigenomics.org) at [CeMM](http://cemm.at).\n\nThis repository contains pipelines for processing NGS data and associated scripts used by them (in the [/pipelines/tools](pipelines/tools) subdirectory).\nPipelines here are configured to work with [`looper`](http://looper.readthedocs.io) and use [`pypiper`](http://pypiper.readthedocs.io) (see the corresponding repositories). These pipelines work with metadata formatted as a [portable encapsulated project](http://pepkit.github.io).\n\n# Installation and usage\n:warning: Before installing, make sure your pip installation is up to date!\n\n1. Install [`looper`](http://looper.readthedocs.io) and [`pypiper`](http://pypiper.readthedocs.io): \n  - `pip install \"peppy\u003e=0.30.2,\u003c1.0.0\" \"looper\u003e=1.2.0,\u003c2.0.0\" \"piper\u003e=0.12.1,\u003c1.0.0\"`\n2. Clone this repository:\n  - `git clone git@github.com:epigen/open_pipelines.git`\n3. Produce a configuration file for your project ([see here how to do it](http://looper.databio.org/en/latest/defining-a-project/)).\n4. If your project has more than one protocol (data type), link each protocol to the respective pipeine in the following manner:\n\n      ```yaml\n        sample_modifiers:\n          imply:\n            - if:\n                protocol: \"Amplicon\"\n              then:\n                pipeline_interfaces: \"${CODE}/open_pipelines/amplicon_simple.interface.yaml\"\n            - if:\n                protocol: \"ATAC-seq\"\n              then:\n                pipeline_interfaces: \"${CODE}/open_pipelines/atacseq.interface.yaml\"\n      ```\n5. Run all jobs using looper:\n  - `looper run project/metadata/project_config.yaml`\n\n\nIf you are just _using a pipeline_ in a project, and you are not _developing the pipeline_, you should treat this cloned repo as read-only, frozen code, which should reside in a shared project workspace. There should be only one clone for the project, to avoid running data under changing pipeline versions (you should not pull any pipeline updates unless you plan to re-run the whole thing).\n\n\n## Test data for pipelines\n\nSmall example data for several pipeline types is available in the [microtest repository](https://github.com/epigen/microtest/)\n\n\n# Pipeline documentation\nSee the dedicated pages detailing the steps and output of each of the pipelines:\n - [ATAC-seq and DNAse-seq](pipelines/atacseq.md)\n - [ChIP-seq and ChIPmentation](pipelines/chipseq.md)\n - [HiC and HiChIP](pipelines/hic.md)\n - [RNA-seq](pipelines/rnaseq.md)\n\nThe remaining pipelines will get their dedicated documentation soon.\n\n\n# Contributing\n\nWe appreciate and encourage contributions to existing pipelines or submitions of new ones.\n\nSimply clone the repository, make your changes and create a pull request with the changes in the `dev` branch.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fepigen%2Fopen_pipelines","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fepigen%2Fopen_pipelines","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fepigen%2Fopen_pipelines/lists"}