{"id":26234565,"url":"https://github.com/sadia-khan13/data-preprocessing","last_synced_at":"2026-04-11T03:32:29.917Z","repository":{"id":281690503,"uuid":"946070768","full_name":"Sadia-Khan13/Data-preprocessing","owner":"Sadia-Khan13","description":"Welcome to the Data preprocessing Repository! This repository is dedicated to showcase the comprehensive resources and implementations related to Data Preprocessing using Python and Jupyter Notebook.","archived":false,"fork":false,"pushed_at":"2025-03-10T16:13:13.000Z","size":354,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"my-new-branch","last_synced_at":"2025-03-10T17:27:52.076Z","etag":null,"topics":["artificial-intelligence","data-analysis","data-mining","data-preprocessing","data-science","jupyter-notebook","matplotlib","numpy","pandas","python","seaborn-python","sklearn"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Sadia-Khan13.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-03-10T15:05:56.000Z","updated_at":"2025-03-10T16:13:18.000Z","dependencies_parsed_at":"2025-03-10T17:38:08.469Z","dependency_job_id":null,"html_url":"https://github.com/Sadia-Khan13/Data-preprocessing","commit_stats":null,"previous_names":["sadia-khan13/data-preprocessing"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Sadia-Khan13%2FData-preprocessing","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Sadia-Khan13%2FData-preprocessing/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Sadia-Khan13%2FData-preprocessing/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Sadia-Khan13%2FData-preprocessing/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Sadia-Khan13","download_url":"https://codeload.github.com/Sadia-Khan13/Data-preprocessing/tar.gz/refs/heads/my-new-branch","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243324617,"owners_count":20273135,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["artificial-intelligence","data-analysis","data-mining","data-preprocessing","data-science","jupyter-notebook","matplotlib","numpy","pandas","python","seaborn-python","sklearn"],"created_at":"2025-03-13T02:19:13.282Z","updated_at":"2025-12-30T22:09:11.225Z","avatar_url":"https://github.com/Sadia-Khan13.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 🛠 Data Preprocessing with Python\n\nThis repository contains essential techniques and implementations for Data Preprocessing using Python and Jupyter Notebook. Data preprocessing is a critical step in any data science or machine learning workflow, ensuring raw data is clean, structured, and ready for analysis.\n\n📂 Repository Contents\n\n🧹 Data Cleaning – Handling missing values, duplicates, and inconsistencies\n \n 🔄 Data Transformation – Scaling, normalization, and encoding categorical data\n \n 🏗️ Feature Engineering – Creating, modifying, and selecting important features\n \n 🔻 Dimensionality Reduction – PCA, LDA, and other techniques\n \n 🚨 Outlier Detection \u0026 Handling – Identifying and dealing with anomalies\n \n 📊 Real-world Case Studies – Applying preprocessing techniques on real datasets\n\n 🛠 Tools \u0026 Technologies Used\n \nProgramming Language: Python 🐍\n\nNotebook Environment: Jupyter Notebook 📒\n\nKey Libraries: NumPy, Pandas, Scikit-learn, Matplotlib, Seaborn, etc.\n\nThis repository serves as a valuable reference for anyone working with data, from beginners to experienced data scientists\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsadia-khan13%2Fdata-preprocessing","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsadia-khan13%2Fdata-preprocessing","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsadia-khan13%2Fdata-preprocessing/lists"}