An open API service indexing awesome lists of open source software.

https://github.com/fusi3/natural_language_coursework

Assessing the impact of different pre-processing techniques for classifying the sentiment of movie reviews
https://github.com/fusi3/natural_language_coursework

bag-of-words latent-semantic-analysis lemmatization multilayer-perceptron nlp sentiment-analysis stemming support-vector-machines tfidf

Last synced: 2 months ago
JSON representation

Assessing the impact of different pre-processing techniques for classifying the sentiment of movie reviews

Awesome Lists containing this project

README

        

# natural_language_coursework

This worked focused on attempting various kinds of preprocessing to see the impact that they may have on the classification of sentiment for movie reviews. The preprocessing was tested with Multilayer Perceptrons and Support Vector Machines. This included various n-gram levels through BoW and TF-IDF for stemming and lemming. The effect of Latent Semantic Analysis was also assessed, however it seemed that the best performance came through using stemming and uni gram tf-idf. Please read the other read me to run the notebooks.