An open API service indexing awesome lists of open source software.

https://github.com/jenderal92/data-cleaning-tools

This tool is simple and effective for cleaning datasets in CSV format. With its features, you can improve data quality automatically.
https://github.com/jenderal92/data-cleaning-tools

data-cleaing-tools datacleaning python python27 remove-duplicates remove-empty-rows

Last synced: 8 months ago
JSON representation

This tool is simple and effective for cleaning datasets in CSV format. With its features, you can improve data quality automatically.

Awesome Lists containing this project

README

          

# data-cleaning-tools

This tool is simple and effective for cleaning datasets in CSV format. With its features, you can improve data quality automatically.

![DATA CLEANING TOOLS Jenderal92](https://github.com/user-attachments/assets/201bfe8b-9cb7-4c7c-af16-2db3a52cdf25)

## Features

  • Delete Blank Lines

  • Removing Duplicates

  • Text Normalization
  • ## How To Use

  • Download and install Python from the official Python website: https://www.python.org.

  • Create a raw dataset file in CSV format (example: test.csv).

  • This file must contain data with columns separated by commas.

  • Run the script with: python file.py.

  • Next follow the instructions in the terminal