Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/simhayn/genomics-cannabis-bigquery

BigQuery's Cannabis_Genomics Dataset Exploration using SQL in a Python Environment
https://github.com/simhayn/genomics-cannabis-bigquery

big-data bigquery bioinformatics exploratory-data-analysis genomics python sql

Last synced: 26 days ago
JSON representation

BigQuery's Cannabis_Genomics Dataset Exploration using SQL in a Python Environment

Awesome Lists containing this project

README

        

## Cannabis Genomics Data Analysis

For the interactive plots, open the
[notebook](https://www.kaggle.com/code/natalyyakobov/bigquery-sql-cannabis-genomics)
in kaggle

This project explores and analyzes the Cannabis genomics dataset available on Google BigQuery public data, leveraging SQL for data mining within a Python environment.

Techniques from statistics, EDA, and database systems are employed to analyze the dataset. Plotly Express is utilized for interactive visualizations.

By filtering relevant columns, joining tables, and visualizing data, this project demonstrates how to use SQL and Python to uncover valuable insights into the genetics of Cannabis.