Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/leipzig/awesome-reproducible-research
A curated list of reproducible research case studies, projects, tutorials, and media
https://github.com/leipzig/awesome-reproducible-research
List: awesome-reproducible-research
awesome awesome-list reproducibility reproducible-analysis reproducible-research reproducible-science reproducible-workflows
Last synced: 2 months ago
JSON representation
A curated list of reproducible research case studies, projects, tutorials, and media
- Host: GitHub
- URL: https://github.com/leipzig/awesome-reproducible-research
- Owner: leipzig
- Created: 2017-11-15T18:33:19.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2024-05-21T05:25:30.000Z (8 months ago)
- Last Synced: 2024-05-23T09:27:14.283Z (8 months ago)
- Topics: awesome, awesome-list, reproducibility, reproducible-analysis, reproducible-research, reproducible-science, reproducible-workflows
- Language: Python
- Homepage:
- Size: 1.22 MB
- Stars: 297
- Watchers: 19
- Forks: 34
- Open Issues: 57
-
Metadata Files:
- Readme: readme.md
- Contributing: contributing.md
- Code of conduct: code-of-conduct.md
Awesome Lists containing this project
- more-awesome - Reproducible Research - Go from raw data to results in a repeatable and transparent manner. (Computer Science)
- awesome-security-collection - **60**星
- ultimate-awesome - awesome-reproducible-research - A curated list of reproducible research case studies, projects, tutorials, and media. (Other Lists / Monkey C Lists)
- jimsghstars - leipzig/awesome-reproducible-research - A curated list of reproducible research case studies, projects, tutorials, and media (Python)
- 100-AI-Machine-learning-Deep-learning-Computer-vision-NLP - 👆
README
# Awesome Reproducible Research [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome) [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.3564746.svg)](https://doi.org/10.5281/zenodo.3564746)
> A curated list of reproducible research case studies, projects, tutorials, and media
## Contents
- [Case studies](#case-studies)
- [Ad-hoc reproductions](#ad-hoc-reproductions)
- [Theory papers](#theory-papers)
- [Theses and dissertations](#theses-and-dissertations)
- [Tool reviews](#tool-reviews)
- [Courses](#courses)
- [Development Resources](#development-resources)
- [Literature tools](#literature-tools)
- [Scientific Data Management Systems](#scientific-data-management-systems)
- [Books](#books)
- [Databases](#databases)
- [Data Repositories](#data-repositories)
- [Exemplar Portals](#exemplar-portals)
- [Runnable Papers](#runnable-papers)
- [Journals](#journals)
- [Ontologies](#ontologies)
- [Minimal Standards](#minimal-standards)
- [Organizations](#organizations)
- [Awesome Lists](#awesome-lists)## Case studies
The term "case studies" is used here in a general sense to describe any study of reproducibility. A _reproduction_ is an attempt to arrive at comparable results with identical data using computational methods described in a paper. A _refactor_ involves refactoring existing code into frameworks and other reproducibility best practices while preserving the original data. A _replication_ involves generating new data and applying existing methods to achieve comparable results. A _robustness test_ applies various protocols, workflows, statistical models or parameters to a given data set to study their effect on results, either as a follow-up to an existing study or as a "bake-off". A _census_ is a high-level tabulation conducted by a third party. A _survey_ is a questionnaire sent to practitioners. A _case narrative_ is an in-depth first-person account. An _independent discussion_ utilizes a secondary independent author to interpret the results of a study as a means to improve inferential reproducibility.
Study
Field
Approach
Size
Medicine
Census
80 studies
Cancer biology
Refactor
8 studies
Biostatistics
Census
56 studies
Genetics
Reproduction
18 studies
Software engineering
Replication
4 companies
Signal processing
Census
134 papers
Biomedical sciences
Survey
23 PIs
Bioinformatics
Census
100 studies
Cancer biology
Replication
53 studies
Collberg et al 2014
Collberg & Proebsting 2016
Computer science
Census
613 papers
Psychology
Replication
100 studies
Biomedical sciences
Census
100 papers
Epidemiology
Robustness test
417 variables
Economics
Reproduction
67 papers
Biomedical sciences
Census
441 papers
Science
Survey
1,576 researchers
NLP
Replication
3 studies
Cancer biology
Replication
9 studies
Biomedical sciences
Census
318 journals
Science
Case narrative
31 PIs
Biological sciences
Survey
704 PIs
Bioinformatics
Refactor
1 study
Economics
Replication
18 studies
Machine learning
Census
30 studies
Archaeology
Case narrative
1 survey
Comparative toxicogenomics
Census
51,292 claims in 3,363 papers
Artificial intelligence
Census
400 papers
Economics
Census
203 papers
Computational science
Reproduction
204 papers, 180 authors
Genomics
Case narrative
1 study
Social sciences
Replication
21 papers
Psychology
Robustness test
One data set, 29 analyst teams
Medicine and health sciences
Census
30 papers
Microbiome immuno oncology
Replication
1 paper
Bioinformatics
Refactor and test of robustness
1 paper
Biomedical Sciences
Census
149 papers
Bioinformatics
Synthetic replication & refactor
1 paper
Geosciences
Survey, Reproduction
146 scientists, 41 papers
Reinforcement Learning
Reproduction, case narrative
1 paper
Computational physics
Census
306 papers
Science & Engineering
Survey
215 participants
Nephrology
Robustness test
1 paper
Social sciences & other
Census
810 Dataverse studies
Social sciences & other
Census, Survey
2109 replication datasets
GIScience/Geoinformatics
Census, Survey
32 papers, 22 participants
Genomics
Robustness test
8 studies
Geosciences
Survey
360 papers
Deep learning
Robustness test
1 analysis
Genomics
Case narrative
1 analysis
Pharmacogenomics
Case narrative
2 analyses
Biomedical sciences and Psychology
Census
127 registered reports
All
Census
1,159,166 Jupyter notebooks
Virology
Census
236 papers
Vlisides et al 2019
Sieber et al 2019
Anaesthesia
Independent discussion
1 study
Psychology
Replication
1 paper
Cell pharmacology
Robustness test
5 labs
Machine learning
Reproduction
18 conference papers
Experimental archaeology
Replication
1 theory
Neurology
Census
202 papers
Psychology
Replication
2 experiments
Ecology and Evolution
Census
163 papers
Neuroimaging
Robustness test
1 data set, 70 teams
Psychology
Replication
1 experiment, 21 labs, 2,220 participants
Psychology
Census
62 papers
Oncology
Census
154 meta-analyses
Bioinformatics
Robustness test
1 data set
Neurobiology
Census
41 papers
Genetics
Census
1799 papers
Psychology
Reproduction
33 meta-analyses
Biomedical science
Census
792 papers
Ecology
Census
346 papers
Physics
Replication
2 papers
Reproductive endocrinology
Census
222 papers
Biomedical sciences
Census
240 papers
Environmental Modelling
Census
7500 papers
Cardiology
Census
532 papers
GIS
Census
75 papers
Life Sciences
Survey
251 researchers
Genetics
Robustness test
1 paper
Life Sciences
Census
3377 articles
Computational Biology
Census
622 papers
Computational Biology
Robustness test
6 studies
Computational Biology
Survey
214 researchers
Differential expression
Census
2109 GEO submissions
Computational biology
Census
186 papers
Psychology
Robustness test
1 study
Proteomics
Refactor
1 analysis
Social science
Robustness test
73 teams
Radiology
Census
62 studies
Clinical ML
Census
511 papers
9 Fields
Census
875 articles
Life Sciences
Census
3556 papers
Biomedical Sciences
Census
9625 Jupyter notebooks
Radiation oncology
Robustness test
300k models
Behavioral sciences
Refactor
Cancer biology
Census
306 papers
Marketing
Replication
10 papers
Bioinformatics
Refactor
1 paper
Ecology
Robustness test
2 datasets, 174 teams
Psychology
Replication
16 findings
Linguistics
Census
600 articles
Ecology
Census
177 papers
Biology
Census
750 papers
Economics
Robustness test
110 papers
Research
Survey
2500 researchers
## Ad-hoc reproductions
These are one-off unpublished attempts to reproduce individual studies
Reproduction
Original study
https://rdoodles.rbind.io/2019/06/reanalyzing-data-from-human-gut-microbiota-from-autism-spectrum-disorder-promote-behavioral-symptoms-in-mice/
and
https://notstatschat.rbind.io/2019/06/16/analysing-the-mouse-autism-data/
Sharon, G. et al. Human Gut Microbiota from Autism Spectrum Disorder Promote Behavioral Symptoms in Mice. Cell 2019, 177 (6), 1600–1618.e17.
https://github.com/sean-harrison-bristol/CCR5_replication
Wei, X.; Nielsen, R. CCR5-∆32 Is Deleterious in the Homozygous State in Humans. Nat. Med. 2019 DOI: 10.1038/s41591-019-0459-6. (retracted)
https://github.com/leipzig/placenta
Leiby et al "Lack of detection of a human placenta microbiome in samples from preterm and term deliveries"
https://doi.org/10.1186/s40168-018-0575-4
Heilbut et al "Rigor and Replication in Alzheimer’s Therapeutic Development: A Case Study"
## Theory papers
Authors/Date
Title
Field
Type
Why most published research findings are false
Science
Statistical reproducibility
A Quick Guide to Organizing Computational Biology Projects
Bioinformatics
Best practices
Ten Simple Rules for Reproducible Computational Research
Computational science
Best practices
The Economics of Reproducibility in Preclinical Research
Preclinical research
Best practices
The Generalizability Crisis
Psychology
Statistical reproducibility
Unreproducible Research is Reproducible
Machine Learning
Methodology
Trustworthy data underpin reproducible research
Physics
Scientific philosophy
Scientific discovery in a model-centric framework: Reproducibility, innovation, and epistemic diversity
Science
Statistical reproducibility
A Realistic Guide to Making Data Available Alongside Code to Improve
Reproducibility
Science
Best practices
The importance of transparency and reproducibility in artificial
intelligence research
Artificial Intelligence
Critique
What is replication?
Science
Scientific philosophy
A Beginner’s Guide to Conducting Reproducible Research
Ecology
Best Practices
Realistic and Robust Reproducible Research for Biostatistics
Biostatistics
Best practices
A Link is not Enough – Reproducibility of Data
Databases
Best practices
COVID-19 pandemic reveals the peril of ignoring metadata standards
Virology
Critique
Principles for data analysis workflows
Data science
Best practices
Reproducible Research: A Retrospective
Public health
Review
Streamlining Data-Intensive Biology With Workflow Systems
Biology
Best practices
Meta Research: Questionable research practices may have little effect on replicability
Science
Statistical reproducibility
We need to keep a reproducible trace of facts, predictions, and hypotheses from gene to function in the era of big data
Functional genomics
Critique
A research parasite's perspective on establishing a baseline to avoid errors in secondary analyses
Science
Best practices
The multiplicity of analysis strategies jeopardizes replicability: lessons learned across disciplines
Science
Critique
Reproducible Results Policy
Water Resources
Policy
10 Things for Curating Reproducible and FAIR Research
Social sciences
Best practices
Checklist for Artificial Intelligence in Medical Imaging (CLAIM): A Guide for Authors and Reviewers
Medical imaging
Best practices
Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
Machine Learning
Best practices
The five pillars of computational reproducibility: Bioinformatics and beyond
Bioinformatics
Best practices
Big little lies: a compendium and simulation of p-hacking strategies
Research
Statistical reproducibility
Is N-Hacking Ever OK? The consequences of collecting more data in pursuit of statistical significance
Biology
Statistical reproducibility
A how-to guide for code-sharing in biology
Biology
Best practices
Characterising Reproducibility Debt in Scientific Software:A Systematic Literature Review
Reproducible Research
Review of reviews
The evolution of computational research in a data-centric world
Computational biology
Review
Genomic reproducibility in the bioinformatics era
Genomics
Review
Continuous Analysis: Evolution of Software Engineering and Reproducibility for Science
Science
Best practices
## Theses and dissertations
Authors/Date
Title
Institution
A Framework for Reproducible Computational Research
University of Chicago
Reproducible Research Practices, Scientific Transparency, and Subgroup Claims: A Meta-Research Dissertation
Stanford University
Publishing Reproducible Geoscientific Papers: Status quo, benefits, and opportunities
University of Münster
Data preservation and reproducibility at the LHCb experiment at CERN
University of Cambridge
Interactive Tools for Reproducible Science -- Understanding, Supporting, and Motivating Reproducible Science Practices
University of Munich
Tests of Robustness in Peer Review
Drexel University
Infrastructures and Practices for Reproducible Research in Geography, Geosciences, and GIScience
University of Münster
Free will in psychological research : considerations on methodic procedure and reproducibility of results
Leuphana University
The exploitation of provenance and versioning in the reproduction of e-experiments
University of Newcastle Upon Tyne
Reproducibility and Reusability in Deep Reinforcement Learning
McGill University
Reproducibility Crisis in Science: Causes and Possible Solutions
Boston University
Reliability, Replicability and Reproducibility in Pet Imaging
Karolinska Institutet
Assessing reproducibility and value in genomic signatures
The Johns Hopkins University
Scientific workflow execution reproducibility using cloud-aware provenance
University of the West of England, Bristol
A provenance-based semantic approach to support understandability, reproducibility, and reuse of scientific experiments
Friedrich-Schiller-Universität Jena
Rigor and reproducibility of cancer medicine evidence
Oklahoma State University
Long-term impact of an editorial intervention to improve paper transparency and reproducibility
Universitat Politècnica de Catalunya
Advancing Genomic and Transcriptomic Knowledge Through Reproducible Bioinformatics Workflows
Penn State University
## Tool reviews
Authors/Date
Title
Tools
Out-of-the-box Reproducibility: A Survey of Machine Learning Platforms
MLflow, Polyaxon, StudioML, Kubeflow, CometML, Sagemaker, GCPML, AzureML, Floydhub, BEAT, Codalab, Kaggle
A Survey on Collecting, Managing, and Analyzing Provenance from Scripts
Astro-Wise, CPL, CXXR, Datatrack, ES3, ESSW, IncPy, Lancet, Magni, noWorkflow, Provenance Curios, pypet, RDataTracker, Sacred, SisGExp, SPADE, StarFlow, Sumatra, Variolite, VCR, versuchung, WISE, YesWorkflow
Leipzig et al
2021 (supplemental)
The Role of Metadata in Reproducible Computational Research
CellML, CIF2, DATS, DICOM, EML, FAANG, GBIF, GO, ISO/TC 276, MIAME, NetCDF, OGC, ThermoML, CRAN, Conda, pip setup.cfg, EDAM, CodeMeta, Biotoolsxsd, DOAP, ontosoft, SWO, OBCS, STATO, SDMX, DDI, MEX, MLSchema, MLFlow, Rmd, CWL, CWLProv, RO-Crate, RO, WICUS, OPM, PROV-O, ReproZip, ProvOne, WES, BagIt, BCO, ERC, BEL, DC, JATS, ONIX, MeSH, LCSH, MP, Open PHACTS, SWAN, SPAR, PWO, PAV, Manubot, ReScience, PandocScholar
Konkol, Markus, Nüst, Daniel, Goulier, Laura
2020
Publishing computational research - a review of infrastructures for reproducible and transparent scholarly communication
Authorea, Binder, CodeOcean, eLife RDS, Galaxy Project, Gigantum, Manuscript, o2r, REANA, ReproZip, Whole tale
## Courses
- MOOCs
- [Coursera Reproducible Research](https://www.coursera.org/learn/reproducible-research) - Roger Peng et al JHU. Very popular course.
- [edX Principles, Statistical and Computational Tools for Reproducible Science](https://www.edx.org/course/principles-statistical-computational-harvardx-ph527x) - John Quackenbush et al Harvard
- [Reproducible research: methodological principles for transparent science](https://www.fun-mooc.fr/en/courses/reproducible-research-methodological-principles-transparent-scie/) - Beginner level. Note taking, version control, notebooks, reproducible data analysis. Bilingual English/French.
- Online course content
- [Tools for Reproducible Research](http://kbroman.org/Tools4RR/) - Karl Broman UW, includes resources page
- [R for Reproducible Scientific Analysis](https://swcarpentry.github.io/r-novice-gapminder/) - Software Carpentry workshop primer using Gapminder data
- [R-DAVIS](https://gge-ucd.github.io/R-DAVIS/syllabus.html) - Student-developed computer literacy and data course in R
- [AMIA2019](https://github.com/StatTag/amia-2019-spring-rr/) - Pragmatic RR for Analysis, Dissemination and Publication
- [PSU-PSY525](https://github.com/psu-psychology/psy-525-reproducible-research-2020) - Transparent, Open, and Reproducible Research Practices in the Social and Behavioral Sciences
- [Monash-RRR](https://monashdatafluency.github.io/r-rep-res/) - Reproducible Research in R workshop tutorial
- [OSU-OSRR](https://github.com/cbahlai/OSRR_course) - An open science and reproducible research course targeted at organismal ecologists
- [Reproducible-Science-Curriculum](https://github.com/Reproducible-Science-Curriculum) - A curriculum for teaching reproducible computational science bootcamps## Development Resources
- R
- [CRAN Task View - Reproducible Research](https://cran.r-project.org/web/views/ReproducibleResearch.html) - packages relevant to RCR in R
- [liftr](https://liftr.me/) - persistent reproducible reporting through containerized R Markdown documents
- [repo](https://github.com/franapoli/repo) - provenance framework package
- [orderly](https://vimc.github.io/orderly/articles/orderly.html) - R package that automates writing reproducible analyses
- Linux-related (polyglot)
- [Reproducible Builds](https://reproducible-builds.org/) - a set of software development practices that create an independently-verifiable path from source to binary code
- Python
- [mlf-core](https://mlf-core.com) - Framework to develop GPU deterministic machine learning models with PyTorch, TensorFlow and XGBoost## Literature tools
- [Scite](https://scite.ai/) - Citation statement AI for discovering and evaluating scientific articles
- [SciScore](https://www.sciscore.com/) - SciScore methods sections for a variety of rigor criteria and analyzes sentences that contain research resources (antibodies, cell lines, plasmids and software tools) and determines how uniquely identifiable that resource is based off of the provided metadata.
- [Ripeta](https://www.ripeta.com/) - Ripeta quickly scans research manuscripts or articles to identify and record key reproducibility variables, such as data availability, code acknowledgements, and research analysis methods.## Scientific Data Management Systems
- [DVC](https://dvc.org/) - DVC tracks machine learning models and data sets
- [DataLad](https://www.datalad.org/) - Git-based versioning for data and provenance
- [Overture](https://www.overture.bio/) - Portal, query interface, visualization and schema framework that powers ICGC, KFDC, GDC
- [Fairly Toolset](https://fairly.readthedocs.io) - Tools for preparing, publishing and downloading datasets from research data repositories directly into computing environments. It provides integration with [Zenodo](https://fairly.readthedocs.io) and [Figshare](https://figshare.com/).## Books
- [Reproducible Research with R and R Studio 2013](https://g.co/kgs/RxcFNm)
- [Implementing Reproducible Research 2014](https://osf.io/s9tya/) - Describes projects: Sumatra, Vistrails, CDE, SOLE, JUMBO, CML, knitr. Content available on OSF.
- [The Practice of Reproducible Research 2017](https://g.co/kgs/jZiMR7) - 31 first person case narratives and intro chapters
- [Dynamic Documents with R and knitr 2015](https://g.co/kgs/dpzkF4)
- [The Turing Way: A Handbook for Reproducible Data Science 2020](https://the-turing-way.netlify.com/introduction/introduction)
- [Reproducibility and Replicability in Science](https://www.nap.edu/catalog/25303/reproducibility-and-replicability-in-science)
- [Reproducibility: Principles, Problems, Practices, and Prospects](https://www.wiley.com/en-ec/Reproducibility:+Principles,+Problems,+Practices,+and+Prospects+-p-9781118864975)## Databases
- [ReplicationWiki](http://replication.uni-goettingen.de/wiki/index.php) - Database for empirical studies with information about methods, data and software used, availability of replication material and whether replications, corrections or retractions are known. Mostly focused on social sciences.
- [ReproCrawl](https://crawl.reproduciblescience.org/)
- [ReplicationDatabase](https://metaanalyses.shinyapps.io/replicationdatabase/) - 1211 replication findings on 333 psychology studies## Data Repositories
All these repositories assign Digital Object Identifiers (DOIs) to data
- [DataCite](https://datacite.org) - 12M+ DOIs registered for 46 allocators. Offers APIs and a metadata schema.
- [Data Dryad](https://datadryad.org) - curated, metadata-centric, focused on articles associated with published artices, $120 submission fee (various waivers available)
- [Figshare](https://figshare.com) - 20 GB of free private space, unlimited public space, >2M articles, >5k projects
- [OSF](https://osf.io) - Project-oriented system with access control and integration with popular tools. Unlimited storage for projects, but individual files are limited to 5 gigabytes (GB) each.
- [Zenodo](https://zenodo.org/) - Allows embargoed, restricted access, metadata support. 50GB limit.## Exemplar Portals
Places to find papers with code or portals to host them
- [Jupyter Gallery](https://github.com/jupyter/jupyter/wiki/A-gallery-of-interesting-Jupyter-Notebooks) - Gallery of interesting Jupyter notebooks
- [Papers With Code](https://paperswithcode.com/) - ML papers with code
- [NARPS](https://github.com/poldrack/narps) - Code related to Neuroimaging Analysis Replication and Prediction Study
- [Codeocean](https://codeocean.com/explore) - A gallery of cloud-based containers with reproducible analyses## Runnable Papers
Experimental papers that have associated notebooks
### Haibe-Kains lab
Publication
CodeOcean link
Mer AS et al. Integrative Pharmacogenomics Analysis of Patient Derived Xenografts
codeocean.com/capsule/056639
Gendoo, Zon et al. MetaGxData: Clinically Annotated Breast, Ovarian and Pancreatic Cancer Datasets and their Use in Generating a Multi-Cancer Gene Signature
codeocean.com/capsule/643863
Yao et al. Tissue specificity of in vitro drug sensitivity
codeocean.com/capsule/550275
Safikhani Z et al. Gene isoforms as expression-based biomarkers predictive of drug response in vitro
codeocean.com/capsule/000290
El-Hachem et al. Integrative cancer pharmacogenomics to infer large-scale drug taxonomy
codeocean.com/capsule/425224
Safikhani Z et al. Revisiting inconsistency in large pharmacogenomic studies
codeocean.com/capsule/627606
Sandhu V et al. Meta-analysis of 1,200 transcriptomic profiles identifies a prognostic model for pancreatic ductal adenocarcinoma
codeocean.com/capsule/269362
Sharifi-Noghabi et al Drug sensitivity prediction from cell line-based pharmacogenomics data: guidelines for developing machine learning models
codeocean.com/capsule/7358839
Arrowsmith et al Automated detection of dental artifacts for large-scale radiomic analysis in radiation oncology
codeocean.com/capsule/2097894
Mer et al Biological and therapeutic implications of a unique subtype of NPM1 mutated AML
codeocean.com/capsule/8791617
Ortmann et al KuLGaP: A Selective Measure for Assessing Therapy Response in Patient-Derived Xenografts
codeocean.com/capsule/2817911
Madani Tonekaboni et al Large organized chromatin lysine domains help distinguish primitive from differentiated cell populations
codeocean.com/capsule/6911149
Seo et al SYNERGxDB: an integrative pharmacogenomic portal to identify synergistic drug combinations for precision oncology
codeocean.com/capsule/6322807
Mammoliti et al Creating reproducible pharmacogenomic analysis pipelines
codeocean.com/capsule/6718332
Manem et al Modeling Cellular Response in Large-Scale Radiogenomic Databases to Advance Precision Radiotherapy
codeocean.com/capsule/1166221
Tonekaboni et al CREAM: Clustering of genomic REgions Analysis Method
codeocean.com/capsule/0002901
Madani Tonekaboni et al SIGN: similarity identification in gene expression
codeocean.com/capsule/0544852
Mer et al Integrative Pharmacogenomics Analysis of Patient-Derived Xenografts
codeocean.com/capsule/0566399
Sandhu et al Applications of Computational Systems Biology in Cancer Signaling Pathways
codeocean.com/capsule/0795540
Sandhu et al Meta-Analysis of 1,200 Transcriptomic Profiles Identifies a Prognostic Model for Pancreatic Ductal Adenocarcinoma
codeocean.com/capsule/7402260
Gendoo et al MetaGxData: Clinically Annotated Breast, Ovarian and Pancreatic Cancer Datasets and their Use in Generating a Multi-Cancer Gene Signature
codeocean.com/capsule/6438633
Yao et al Tissue specificity of in vitro drug sensitivity
codeocean.com/capsule/5502756
Safikhani et al Gene isoforms as expression-based biomarkers predictive of drug response in vitro
codeocean.com/capsule/0002901
El-Hachem et al Integrative Cancer Pharmacogenomics to Infer Large-Scale Drug Taxonomy
codeocean.com/capsule/4252248
Safikhani et al Revisiting inconsistency in large pharmacogenomic studies
codeocean.com/capsule/6276064
### Patcher lab
Publication
Github link
Pimental et al 2017. Differential analysis of RNA-seq incorporating quantification uncertainty
sleuth_paper_analysis
Melsted et al 2019. Modular and efficient pre-processing of single-cell RNA-seq
MBGBLHGP_2019
Chari et al 2021. Whole Animal Multiplexed Single-Cell RNA-Seq Reveals Plasticity of Clytia Medusa Cell Types
CWGFLHGCCHAP_2021
### Siepel lab
Blumberg et al 2021. Characterizing RNA stability genome-wide through combined analysis of PRO-seq and RNA-seq data
https://codeocean.com/capsule/7351682
## Journals
- [ReScience](http://rescience.github.io/) - Journal dedicated to insilico reproductions and tests of robustness, lives on Github.
- [eLife](https://elifesciences.org/for-the-press/eb096af1/elife-launches-executable-research-articles-for-publishing-computationally-reproducible-results) - Executable Research Articles (ERA) inline executable blocks## Ontologies
- [FAIRsharing](https://fairsharing.org) - standards, databases, and policies
- [BioPortal](https://bioportal.bioontology.org/) - 660 biomedical ontologies## Minimal Standards
- [STORMS](https://www.stormsmicrobiome.org/) - Strengthening The Organization and Reporting of Microbiome Studies (STORMS) is a checklist for reporting on human microbiome studies. [Preprint](https://doi.org/10.1101/2020.06.24.167353)## Organizations
- [ResearchObject.org](http://www.researchobject.org/) - RO specifications and publications
- [BioCompute](https://osf.io/zm97b/) - BCO specs
- [rOpenSci](https://ropensci.org) - Tools, conferences, and education
- [Open Science Framework](https://osf.io) - Open source project management
- [pyOpenSci](https://www.pyopensci.org/) - Promotes open and reproducible research through peer-review of scientific Python packages
- [Replication Network](https://replicationnetwork.com/) - Furthering the practice of replication in economics. Econ replication database.
- [repliCATS project](https://replicats.research.unimelb.edu.au/) - Estimating the replicability of research in the social sciences. [Paper](https://osf.io/preprints/metaarxiv/2pczv/)
- [ReproHack](https://reprohack.github.io/reprohack-hq/) - 1-day reproducibility hackathons held worldwide
- [CODECHECK](https://codecheck.org.uk/) - community for checking executability of scientific preprints and papers
- [CASCaD](https://www.cascad.tech/) - Certification Agency for Scientific Code and Data. Issues reproducibility certificates.
- [Reproducibility for Everyone](https://www.repro4everyone.org/) - Community-led reproducibility workshops
- [CUrating for REproducibility](https://curating4reproducibility.org/) - curation of research and code for digital preservation
- [Michigan Institute for Data Science Reproducibility Hub](https://midas.umich.edu/reproducibility-resources/) - [reproducibility challenge](https://hdsr.mitpress.mit.edu/pub/mlconlea/release/1) manuscripts & presentations
- [OpenMKT](https://openmkt.org/) - transparency and quality of marketing research published in academic journals
- [Many Co-Authors](https://manycoauthors.org/) - online platform designed to collect and share information on the provenance and availability of the data for all articles co-authored by Francesca Gino
- [FORRT](https://forrt.org/) - Framework for Open and Reproducible Research Training advancing research transparency, reproducibility, rigor, and ethics through pedagogical reform and meta-scientific research## Awesome Lists
- [Awesome Pipeline](https://github.com/pditommaso/awesome-pipeline) - So many pipelines frameworks
- [Awesome Docker](https://github.com/veggiemonk/awesome-docker) - Everything related to the Docker containerization system
- [Awesome R](https://github.com/qinwf/awesome-R#reproducible-research) - Section on RR tools
- [Awesome Reproducible R](https://github.com/datasnakes/awesome-reproducible-R) - RRR tools
- [Awesome Jupyter](https://github.com/adebar/awesome-jupyter) - Jupyter projects, libraries and resources
- [Awesome Bioinformatics Benchmarks](https://github.com/j-andrews7/Awesome-Bioinformatics-Benchmarks) - Benchmarks are a related aspect of robustness testing
- [Awesome Open Science](https://github.com/ZoranPandovski/awesome-open-science) - Resources, data, tools, and scholarship
- [Awesome Public Datasets](https://github.com/awesomedata/awesome-public-datasets) - A topic-centric list of HQ open datasets
- [Awesome Semantic Web](https://github.com/semantalytics/awesome-semantic-web) - Semantic web and linked data resources.## Contribute
Contributions welcome! Read the [contribution guidelines](contributing.md) first. You may find my `src/doi2md.py` script useful for quickly generating entries from a DOI.
## License
[![CC0](http://mirrors.creativecommons.org/presskit/buttons/88x31/svg/cc-zero.svg)](https://creativecommons.org/publicdomain/zero/1.0/)
To the extent possible under law, Jeremy Leipzig has waived all copyright and
related or neighboring rights to this work.