https://github.com/salvatoreamaddio/pipeline-assessment
Pipeline Simulation
https://github.com/salvatoreamaddio/pipeline-assessment
anaconda jupiter-notebook mysql mysql-database numpy pandas pandas-python python sparql sparql-query
Last synced: 2 months ago
JSON representation
Pipeline Simulation
- Host: GitHub
- URL: https://github.com/salvatoreamaddio/pipeline-assessment
- Owner: SalvatoreAmaddio
- Created: 2024-04-03T12:47:35.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-04-07T23:35:49.000Z (about 1 year ago)
- Last Synced: 2025-01-26T12:42:20.503Z (4 months ago)
- Topics: anaconda, jupiter-notebook, mysql, mysql-database, numpy, pandas, pandas-python, python, sparql, sparql-query
- Language: Jupyter Notebook
- Homepage: https://salvatoreamaddio.co.uk/
- Size: 2.13 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# PIPELINE ASSESSMENT
# INTRODUCTION
This assessment simulates the use of a Pipeline. It uses Jupiter Notebook with Python. The project was developed by using Anaconda.
## The Data
The data used concerns Mental Health and its trends over the years. It is taken from two different sources.
- The first source is from the Scottish Government. It uses an API which relies on the **SPARQL** technology.
- The second source is CSV file from **Kaggle**. The file can be downloaded at https://www.kaggle.com/datasets/thedevastator/uncover-global-trends-in-mental-health-disorder.### Before excuting this file
This project needs a few packages to run, which can be found in the **requirements.txt** file. Also, this project will create a **MySQL** Database. Therefore, ensure you have changed the UserName and Password variables before running the file. These are at the top of the file with a visible heading called **MYSQL USER AND PASSWORD.** Ensure your MySQL Server is up and running.