Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/themuhd/clinical-trials-analysis
Repository contains analysis performed on clinical trials and pharmaceutical violations datasets
https://github.com/themuhd/clinical-trials-analysis
Last synced: about 22 hours ago
JSON representation
Repository contains analysis performed on clinical trials and pharmaceutical violations datasets
- Host: GitHub
- URL: https://github.com/themuhd/clinical-trials-analysis
- Owner: themuhd
- Created: 2023-09-13T21:52:11.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-09-13T23:57:13.000Z (about 1 year ago)
- Last Synced: 2023-09-14T12:57:57.572Z (about 1 year ago)
- Language: Jupyter Notebook
- Size: 30.5 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Clinical-Trials-Analysis
The repository contains analysis performed on clinical trials and pharmaceutical violation datasets
Three different implementations were used per assessment requirements to showcase different styles of working with data in Spark —DataFrame for a structured approach, Spark SQL for SQL-based analysis, and RDD for more control over data manipulation.## The Data
1.Clinicaltrial_.csv:
Each row represents an individual clinical trial, identified by an Id, listing the sponsor
(Sponsor), the status of the study at the time of the file’s download (Status), the start
and completion dates (Start and Completion respectively), the type of study (Type),
when the trial was first submitted (Submission), and the lists of conditions the trial
concerns (Conditions) and the interventions explored (Interventions). Individual
commas separate conditions and interventions.
(Source: https://ClinicalTrials.gov)2. pharma.csv:
The file contains a small number of publicly available lists of pharmaceutical
violations. For the purposes of this work, we are interested in the second column,
Parent Company, which contains the name of the pharmaceutical company in
question.
(Source: https://violationtracker.goodjobsfirst.org/industry/pharmaceuticals)