https://github.com/thepycoach/data_preprocessing
Data cleaning, Tokenization, Regular Expressions and Pandas guide.
https://github.com/thepycoach/data_preprocessing
Last synced: 4 months ago
JSON representation
Data cleaning, Tokenization, Regular Expressions and Pandas guide.
- Host: GitHub
- URL: https://github.com/thepycoach/data_preprocessing
- Owner: thepycoach
- Created: 2021-03-06T18:49:16.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2022-12-07T21:38:33.000Z (over 2 years ago)
- Last Synced: 2024-08-13T07:03:29.145Z (8 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 1.2 MB
- Stars: 63
- Watchers: 5
- Forks: 70
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- jimsghstars - thepycoach/data_preprocessing - Data cleaning, Tokenization, Regular Expressions and Pandas guide. (Jupyter Notebook)
README
# Data Preprocessing in Python
This repository contains all the articles I published related to data preprocessing techniques in Python- Guide to Cleaning and Preparing Data in Python: https://towardsdatascience.com/a-straightforward-guide-to-cleaning-and-preparing-data-in-python-8c82f209ae33
- 5 Simple Ways to Tokenize Text in Python: https://towardsdatascience.com/5-simple-ways-to-tokenize-text-in-python-92c6804edfc4
- Guide to Move from Excel to Python (Pandas/Numpy Guide): https://towardsdatascience.com/a-complete-yet-simple-guide-to-move-from-excel-to-python-d664e5683039
- Guide to Regular Expressions https://towardsdatascience.com/a-simple-and-intuitive-guide-to-regular-expressions-404b057b1081