Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/li-pearl/gene-count-normalizer

First step of data wrangling in MERFISH data project
https://github.com/li-pearl/gene-count-normalizer

data-analysis merfish merscope python

Last synced: about 2 months ago
JSON representation

First step of data wrangling in MERFISH data project

Host: GitHub
URL: https://github.com/li-pearl/gene-count-normalizer
Owner: li-pearl
License: mit
Created: 2024-06-03T17:33:48.000Z (7 months ago)
Default Branch: main
Last Pushed: 2024-06-12T22:38:35.000Z (7 months ago)
Last Synced: 2024-06-13T22:03:18.211Z (7 months ago)
Topics: data-analysis, merfish, merscope, python
Language: Python
Homepage:
Size: 85.9 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE.md

Awesome Lists containing this project

README

# Gene Count CSV Normalizer Tool
## Overview
![tool after being run](./images/used_file.png)

This is a specialized tool with a simple user-friendly application designed to prepare a large dataset for further analysis. The tool can "normalize" gene counts for each cell in a large MERSCOPE-generated CSV file containing cell by gene data using cell volume data from another large CSV with cell metadata values to account for differences in imaged cell volumes. This tool provides a simple interface for selecting files, performing normalization, and downloading the results. The results are formatted in the same manner as the original cell by gene CSV.

## Features
- File Selection Interface
- Normalization:
- Normalizes gene count fields to account for differences in imaged cell volumes, preparing the dataset for further data analysis
- Automatically skips specified columns (e.g., Brain Region ID, Brain Region Name).
- Renames the gene count fields with a _normalized suffix.
- Notifies users of any cells in the count file that do not have corresponding volume data.
- Downloading Normalized Data:
- Download the newly generated CSV file.
- The new file name is prefixed with normalized_.

## Installation
1. Clone the Repository:
```
git clone https://github.com/li-pearl/gene-count-normalizer.git
cd gene-count-normalizer
```

2. Install Dependencies:
```
pip install pandas
```

3. Run the Tool:
```
python normalizer.py
```

## Usage
![tool without being run](./images/unused_file.png)

1. Select Files: Choose cell by gene and cell metadata files using the buttons.
2. Normalize Data: Click the "Create and Download Normalized Data" button to perform the normalization process and download the new CSV.