Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mariooohzc/gender-prediction-datacamp-project


https://github.com/mariooohzc/gender-prediction-datacamp-project

Last synced: 26 days ago
JSON representation

Awesome Lists containing this project

README

        

# Gender Prediction using sounds
This project identifies the gender of best selling authors from 2008 to 2017 listed out by the New York Times.
The datasets used in this project is directly obtained from DataCamp.

Given the potential for names to share similar pronunciations despite variations in spelling, the fuzzy library is utilised to check if two names sound the same. The babynames_nysiis.csv, which contains the unique NYSIIS versions of baby names and the corresponding genders, is cross-referenced with the NYSIIS equivalent of the author's first name.

The result of this project show that there are more female authors than male authors on the New York Times best seller's list throughout the years, 2008 - 2017.