Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/li-pearl/gene-count-normalizer
First step of data wrangling in MERFISH data project
https://github.com/li-pearl/gene-count-normalizer
data-analysis merfish merscope python
Last synced: about 2 months ago
JSON representation
First step of data wrangling in MERFISH data project
- Host: GitHub
- URL: https://github.com/li-pearl/gene-count-normalizer
- Owner: li-pearl
- License: mit
- Created: 2024-06-03T17:33:48.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2024-06-12T22:38:35.000Z (7 months ago)
- Last Synced: 2024-06-13T22:03:18.211Z (7 months ago)
- Topics: data-analysis, merfish, merscope, python
- Language: Python
- Homepage:
- Size: 85.9 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# Gene Count CSV Normalizer Tool
## Overview
![tool after being run](./images/used_file.png)This is a specialized tool with a simple user-friendly application designed to prepare a large dataset for further analysis. The tool can "normalize" gene counts for each cell in a large MERSCOPE-generated CSV file containing cell by gene data using cell volume data from another large CSV with cell metadata values to account for differences in imaged cell volumes. This tool provides a simple interface for selecting files, performing normalization, and downloading the results. The results are formatted in the same manner as the original cell by gene CSV.
## Features
- File Selection Interface
- Normalization:
- Normalizes gene count fields to account for differences in imaged cell volumes, preparing the dataset for further data analysis
- Automatically skips specified columns (e.g., Brain Region ID, Brain Region Name).
- Renames the gene count fields with a _normalized suffix.
- Notifies users of any cells in the count file that do not have corresponding volume data.
- Downloading Normalized Data:
- Download the newly generated CSV file.
- The new file name is prefixed with normalized_.## Installation
1. Clone the Repository:
```
git clone https://github.com/li-pearl/gene-count-normalizer.git
cd gene-count-normalizer
```2. Install Dependencies:
```
pip install pandas
```3. Run the Tool:
```
python normalizer.py
```## Usage
![tool without being run](./images/unused_file.png)1. Select Files: Choose cell by gene and cell metadata files using the buttons.
2. Normalize Data: Click the "Create and Download Normalized Data" button to perform the normalization process and download the new CSV.