{"id":22665513,"url":"https://github.com/edochiari/layoffs-data_cleaning","last_synced_at":"2026-01-27T22:02:56.178Z","repository":{"id":252214619,"uuid":"839771001","full_name":"EdoChiari/Layoffs-Data_cleaning","owner":"EdoChiari","description":"SQL script for cleaning a dataset related to work layoffs. It removes duplicates, standardizes data, handles null values, and eliminates irrelevant columns and rows, ensuring data integrity","archived":false,"fork":false,"pushed_at":"2024-08-09T15:36:21.000Z","size":58,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-04T00:39:10.562Z","etag":null,"topics":["datacleaning","layoffdata","sql"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/EdoChiari.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-08T09:43:24.000Z","updated_at":"2024-09-03T10:25:02.000Z","dependencies_parsed_at":"2024-12-09T13:40:48.465Z","dependency_job_id":"346bde23-f697-4242-ae1b-ec1776762c4d","html_url":"https://github.com/EdoChiari/Layoffs-Data_cleaning","commit_stats":null,"previous_names":["edochiari/sql-layoffs-data_cleaning","edochiari/layoffs-data_cleaning"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/EdoChiari/Layoffs-Data_cleaning","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/EdoChiari%2FLayoffs-Data_cleaning","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/EdoChiari%2FLayoffs-Data_cleaning/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/EdoChiari%2FLayoffs-Data_cleaning/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/EdoChiari%2FLayoffs-Data_cleaning/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/EdoChiari","download_url":"https://codeload.github.com/EdoChiari/Layoffs-Data_cleaning/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/EdoChiari%2FLayoffs-Data_cleaning/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28823922,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-27T18:44:20.126Z","status":"ssl_error","status_checked_at":"2026-01-27T18:44:09.161Z","response_time":168,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["datacleaning","layoffdata","sql"],"created_at":"2024-12-09T13:32:10.173Z","updated_at":"2026-01-27T22:02:56.151Z","avatar_url":"https://github.com/EdoChiari.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# SQL-layoffs-data_cleaning\n\nWhen dealing with a dataset related to work layoffs, it’s crucial to maintain data integrity and ensure that the information is clean and ready for analysis. This SQL script is designed to prepare the dataset by performing several essential data cleaning tasks:\n\n1.\tRemoving Duplicates:\n\t•\tDuplicate records can lead to inaccurate analyses, such as overestimating the number of layoffs. The script identifies and removes any duplicate rows to ensure that each record in the dataset is unique.\n2.\tStandardizing Data:\n\t•\tInconsistent formatting (e.g., varying date formats, inconsistent case in text fields) can create confusion and errors in subsequent analyses. This step standardizes key fields to ensure uniformity, making the data more reliable and easier to query.\n4.\tHandling Null or Blank Values:\n\t•\tNull or blank values can disrupt analysis and lead to misleading results. The script identifies fields with null or blank values and either fills them with appropriate default values or flags them for further review.\n5.\tRemoving Unnecessary Columns and Rows:\n\t•\tNot all data in the dataset may be relevant to the analysis of work layoffs. The script removes columns that don’t contribute to the analysis and rows that are irrelevant or incomplete, resulting in a streamlined and more focused dataset.\n\nThis code provides a robust foundation for further exploration and analysis of the layoffs dataset, ensuring that the data is clean, consistent, and ready for any subsequent operations.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fedochiari%2Flayoffs-data_cleaning","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fedochiari%2Flayoffs-data_cleaning","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fedochiari%2Flayoffs-data_cleaning/lists"}