https://github.com/cuadernin/class_tel_email
Email and phone clasification
https://github.com/cuadernin/class_tel_email
classification classification-algorithm emails pandas python regex telephone-number
Last synced: about 1 month ago
JSON representation
Email and phone clasification
- Host: GitHub
- URL: https://github.com/cuadernin/class_tel_email
- Owner: Cuadernin
- Created: 2022-01-11T19:00:54.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2022-11-08T18:02:47.000Z (over 3 years ago)
- Last Synced: 2024-05-17T16:59:19.633Z (about 2 years ago)
- Topics: classification, classification-algorithm, emails, pandas, python, regex, telephone-number
- Language: Python
- Homepage:
- Size: 22.5 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Classification of emails and telephones 📖
This algorithm classifies emails and telephones by levels according to different aspects.
## Use 📝
The use is simple. Just call the function and declare the parameters.
```
clas_emails(df,['EMAIL_1','EMAIL_2'])
```
Where *df* is the dataframe, *EMAIL_1* and *EMAIL_2* are the **columns** to be classified.
## Example 🔍
A new column is created with the levels that correspond to each email. \
**In the case of telephones, it works in a similar way.**
### **Note:**
PARTE 3 and PARTE 7 contain different commented lines. This is because there are different regular expression patterns capable of detecting an email as "invalid". For example, with a regular expression __brian-95smith@yahoo.com__ is a invalid email but in others it's valid. The same thing happens with david88/_s1@company.org which in some cases is invalid and in others valid. It could even depend on the domain as well. That's why I put two more expressions, to show how it identifies one email as invalido and other doesn't.