Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/simhayn/genomics-cannabis-bigquery
BigQuery's Cannabis_Genomics Dataset Exploration using SQL in a Python Environment
https://github.com/simhayn/genomics-cannabis-bigquery
big-data bigquery bioinformatics exploratory-data-analysis genomics python sql
Last synced: 26 days ago
JSON representation
BigQuery's Cannabis_Genomics Dataset Exploration using SQL in a Python Environment
- Host: GitHub
- URL: https://github.com/simhayn/genomics-cannabis-bigquery
- Owner: simhayn
- Created: 2024-07-25T13:43:55.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-07-25T17:13:49.000Z (4 months ago)
- Last Synced: 2024-10-13T01:22:25.526Z (26 days ago)
- Topics: big-data, bigquery, bioinformatics, exploratory-data-analysis, genomics, python, sql
- Language: Jupyter Notebook
- Homepage:
- Size: 584 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## Cannabis Genomics Data Analysis
For the interactive plots, open the
[notebook](https://www.kaggle.com/code/natalyyakobov/bigquery-sql-cannabis-genomics)
in kaggleThis project explores and analyzes the Cannabis genomics dataset available on Google BigQuery public data, leveraging SQL for data mining within a Python environment.
Techniques from statistics, EDA, and database systems are employed to analyze the dataset. Plotly Express is utilized for interactive visualizations.
By filtering relevant columns, joining tables, and visualizing data, this project demonstrates how to use SQL and Python to uncover valuable insights into the genetics of Cannabis.