{"id":23903004,"url":"https://github.com/elfgk/kc-house-data-analysis","last_synced_at":"2026-04-22T23:35:29.729Z","repository":{"id":269354383,"uuid":"907149473","full_name":"elfgk/KC-House-Data-Analysis","owner":"elfgk","description":"KC House Data Analysis","archived":false,"fork":false,"pushed_at":"2024-12-23T00:15:50.000Z","size":137,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-24T06:08:06.467Z","etag":null,"topics":["classification","dataanalysis","dataanalysis-projects","dataanalysisusingpython","dataanalystportfolio","jupyter-notebook","machine-learning","python","regression-models","xgbregressor"],"latest_commit_sha":null,"homepage":"https://www.kaggle.com/code/elfgkk/kc-house/notebook","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/elfgk.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-12-23T00:02:46.000Z","updated_at":"2025-01-05T15:26:44.000Z","dependencies_parsed_at":"2024-12-23T01:19:44.011Z","dependency_job_id":"5333110b-0218-4802-af9f-ba5ca475ac5f","html_url":"https://github.com/elfgk/KC-House-Data-Analysis","commit_stats":null,"previous_names":["elfgk/kc-house-data-analysis"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/elfgk/KC-House-Data-Analysis","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elfgk%2FKC-House-Data-Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elfgk%2FKC-House-Data-Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elfgk%2FKC-House-Data-Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elfgk%2FKC-House-Data-Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/elfgk","download_url":"https://codeload.github.com/elfgk/KC-House-Data-Analysis/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elfgk%2FKC-House-Data-Analysis/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32159959,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-22T17:06:48.269Z","status":"ssl_error","status_checked_at":"2026-04-22T17:06:19.037Z","response_time":58,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["classification","dataanalysis","dataanalysis-projects","dataanalysisusingpython","dataanalystportfolio","jupyter-notebook","machine-learning","python","regression-models","xgbregressor"],"created_at":"2025-01-04T22:51:55.663Z","updated_at":"2026-04-22T23:35:29.701Z","avatar_url":"https://github.com/elfgk.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# KC House Data Analysis\n\nThis project focuses on analyzing house price data for King County, Washington. The goal is to explore the dataset, perform data cleaning, feature engineering, and build a machine learning model to predict house prices.\n\nDataset: https://www.kaggle.com/datasets/harlfoxem/housesalesprediction\n\n## Project Overview\n\nThe dataset used in this project contains information about homes sold in King County, including house features such as square footage, number of bedrooms, location, and more. This project includes the following steps:\n\n1. **Exploratory Data Analysis (EDA):**\n   - Visualizing the data to understand trends, distributions, and correlations.\n   - Identifying missing values and outliers in the data.\n\n2. **Data Preprocessing:**\n   - Cleaning the data by handling missing values, encoding categorical variables, and scaling numerical features.\n\n3. **Feature Engineering:**\n   - Creating new features that can improve model performance.\n\n4. **Modeling:**\n   - Building machine learning models (such as linear regression, decision trees, etc.) to predict house prices.\n\n5. **Model Evaluation:**\n   - Evaluating models using various metrics such as Mean Absolute Error (MAE), Mean Squared Error (MSE), and R-squared.\n\n## Dataset\n\nThe dataset used in this project contains information about houses sold in King County, WA. It includes the following features:\n\n- `id`: Unique ID for each house\n- `date`: Date when the house was sold\n- `price`: Price of the house\n- `bedrooms`: Number of bedrooms\n- `bathrooms`: Number of bathrooms\n- `sqft_living`: Square footage of the house\n- `sqft_lot`: Square footage of the lot\n- `floors`: Number of floors in the house\n- `waterfront`: Whether the house has a waterfront view (1 if true, 0 if false)\n- `view`: Quality of the view (a rating from 0 to 4)\n- `condition`: Condition of the house (a rating from 1 to 5)\n- `grade`: Grade of the house (a rating from 1 to 13)\n- `sqft_above`: Square footage of the house above ground\n- `sqft_basement`: Square footage of the basement\n- `yr_built`: Year the house was built\n- `yr_renovated`: Year the house was renovated\n- `zipcode`: ZIP code of the house's location\n\n## Libraries Used\n\n- `pandas`: For data manipulation and analysis.\n- `numpy`: For numerical operations.\n- `matplotlib` and `seaborn`: For data visualization.\n- `scikit-learn`: For machine learning models and evaluation.\n\n## Getting Started\n\nTo run the analysis on your local machine, follow these steps:\n\n1. Clone or download the repository:\n\n   ```bash\n   git clone https://github.com/elfgk/KC-House-Data-Analysis.git\n   ```\n\n2. Install the required Python libraries.\n3. Open the kc-house.ipynb Jupyter notebook and follow the steps.\n\n\u003ch1 align=\"center\"\u003e 𓍢ִ໋☕️✧˚ ༘ ⋆ \u003c/h1\u003e\n\n\u003ch1\u003e Contact Me🧑‍💻: \u003c/h1\u003e\n\n[![LinkedIn](https://img.shields.io/badge/LinkedIn-0A66C2?style=for-the-badge\u0026logo=linkedin\u0026logoColor=white)](https://www.linkedin.com/in/elfgk/)\n[![Stack Overflow](https://img.shields.io/badge/StackOverflow-FE7A16?style=for-the-badge\u0026logo=stackoverflow\u0026logoColor=white)](https://stackoverflow.com/users/27559679/elfgk)\n[![Hugging Face](https://img.shields.io/badge/HuggingFace-9C30FF?style=for-the-badge\u0026logo=huggingface\u0026logoColor=white)](https://huggingface.co/elfgk)\n[![Kaggle](https://img.shields.io/badge/Kaggle-20BEFF?style=for-the-badge\u0026logo=kaggle\u0026logoColor=white)](https://www.kaggle.com/elfgkk)\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Felfgk%2Fkc-house-data-analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Felfgk%2Fkc-house-data-analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Felfgk%2Fkc-house-data-analysis/lists"}