Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/szcf-weiya/splitclustertest.jl

Julia package for "FDR Control via Data Splitting for Testing-after-Clustering (arXiv: 2410.06451)"
https://github.com/szcf-weiya/splitclustertest.jl

data-splitting fdr post-selection

Last synced: about 1 month ago
JSON representation

Julia package for "FDR Control via Data Splitting for Testing-after-Clustering (arXiv: 2410.06451)"

Awesome Lists containing this project

README

        

# SplitClusterTest.jl

[![](https://img.shields.io/badge/docs-latest-blue.svg)](https://szcf-weiya.github.io/SplitClusterTest.jl/dev) [![codecov](https://codecov.io/gh/szcf-weiya/SplitClusterTest.jl/graph/badge.svg?token=dsRMZFM1q5)](https://codecov.io/gh/szcf-weiya/SplitClusterTest.jl) [![CI](https://github.com/szcf-weiya/SplitClusterTest.jl/actions/workflows/ci.yml/badge.svg)](https://github.com/szcf-weiya/SplitClusterTest.jl/actions/workflows/ci.yml)

Julia package for

> Wang, L., Lin, Y., & Zhao, H. (2024). False Discovery Rate Control via Data Splitting for Testing-after-Clustering (arXiv:2410.06451). arXiv.
>

The proposed approach addresses the double-dipping issue in testing-after-clustering tasks, particularly in single-cell data analysis, where the same data is used both for clustering (to identify cell types) and for testing (to select differentially expressed genes), which can inflate false positives.

![dd](https://github.com/user-attachments/assets/e5383503-2e4d-45d0-adff-77f3a0f82899)

![xkcd](https://github.com/user-attachments/assets/8de07b78-8346-4316-ae8c-855c305d625f)

> The xkcd-style cartoon is drawn with the help of R package [xkcd](https://xkcd.r-forge.r-project.org/)

## :arrow_right: See also

- R package:
- For the comparison between data splitting and data fission, check .