https://github.com/mikaremi/simple-data-cleaning
https://github.com/mikaremi/simple-data-cleaning
Last synced: 5 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/mikaremi/simple-data-cleaning
- Owner: Mikaremi
- Created: 2024-07-29T11:27:09.000Z (12 months ago)
- Default Branch: main
- Last Pushed: 2024-07-29T11:34:23.000Z (12 months ago)
- Last Synced: 2024-12-30T22:17:16.711Z (7 months ago)
- Language: Jupyter Notebook
- Size: 37.1 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Simple-Data-Cleaning
The steps included:
* Stripping trailing and leading spaces form the columns in the dataset
* Addressing the NaN values by replacing them with np.nan
* Standardizing the date colum (vist date)
* filling the missing values in the numeric columns using the mean strategy
* filling the missing values in the categorical columns using a place holder or a most frquent value '(mode()[0])'
* Saving the cleaned data to new csv file