{"id":20831957,"url":"https://github.com/themuhd/clinical-trials-analysis","last_synced_at":"2025-03-12T08:25:26.750Z","repository":{"id":194651192,"uuid":"691286585","full_name":"themuhd/Clinical-Trials-Analysis","owner":"themuhd","description":"Repository contains analysis performed on clinical trials and pharmaceutical violations datasets","archived":false,"fork":false,"pushed_at":"2023-09-13T23:57:13.000Z","size":31986,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-18T20:37:14.461Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/themuhd.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2023-09-13T21:52:11.000Z","updated_at":"2024-01-19T15:16:35.000Z","dependencies_parsed_at":"2023-09-14T13:08:48.515Z","dependency_job_id":null,"html_url":"https://github.com/themuhd/Clinical-Trials-Analysis","commit_stats":null,"previous_names":["themuhd/clinical-trials-analysis"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/themuhd%2FClinical-Trials-Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/themuhd%2FClinical-Trials-Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/themuhd%2FClinical-Trials-Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/themuhd%2FClinical-Trials-Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/themuhd","download_url":"https://codeload.github.com/themuhd/Clinical-Trials-Analysis/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243182291,"owners_count":20249604,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-18T00:09:39.603Z","updated_at":"2025-03-12T08:25:26.730Z","avatar_url":"https://github.com/themuhd.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Clinical-Trials-Analysis\nThe repository contains analysis performed on clinical trials and pharmaceutical violation datasets\nThree different implementations were used per assessment requirements to showcase different styles of working with data in Spark —DataFrame for a structured approach, Spark SQL for SQL-based analysis, and RDD for more control over data manipulation.\n\n## The Data\n1.Clinicaltrial_\u003cyear\u003e.csv:\nEach row represents an individual clinical trial, identified by an Id, listing the sponsor \n(Sponsor), the status of the study at the time of the file’s download (Status), the start \nand completion dates (Start and Completion respectively), the type of study (Type), \nwhen the trial was first submitted (Submission), and the lists of conditions the trial \nconcerns (Conditions) and the interventions explored (Interventions). Individual \ncommas separate conditions and interventions. \n(Source: https://ClinicalTrials.gov)\n\u003c/br\u003e\n2. pharma.csv:\nThe file contains a small number of publicly available lists of pharmaceutical \nviolations. For the purposes of this work, we are interested in the second column, \nParent Company, which contains the name of the pharmaceutical company in \nquestion. \n(Source: https://violationtracker.goodjobsfirst.org/industry/pharmaceuticals)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fthemuhd%2Fclinical-trials-analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fthemuhd%2Fclinical-trials-analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fthemuhd%2Fclinical-trials-analysis/lists"}