An open API service indexing awesome lists of open source software.

https://github.com/brianruizy/web-scraping-nlp

Machine Learning - Personal Python project demonstrating a data analytic technique known as Web Scraping + Natural Language Processing performed on a real webpage.
https://github.com/brianruizy/web-scraping-nlp

Last synced: 2 months ago
JSON representation

Machine Learning - Personal Python project demonstrating a data analytic technique known as Web Scraping + Natural Language Processing performed on a real webpage.

Awesome Lists containing this project

README

          


Lumen
Web Scraping



Personal portfolio project demonstrating a data analytic technique known as web scraping, on this particular project the analysis will be performed onto Barnes & Noble data.

## About
Web scraping is a technique used to extract certain elements from web pages to develop quantitative resolutions from data. With the use of Sentiment Analysis, a useful method for detecting expression in text, web scraping can become a valuable application for many companies or brands who desire a deeper insight into their clients/ customers.

For this particular project, I extracted data from a *Barnes & Noble* webpage depicting a particular used-book seller feedback. I managed to gather the sites .html through *Chrome's Developer Tools*.

## Dependencies
* textblob
* bs4 (BeautifulSoup)
* termcolor

## Result
The output of some of the "reviews" with the respective polarity preceeding.

![screen shot 2019-03-01 at 2 43 09 am](https://user-images.githubusercontent.com/23439187/53626620-d5084680-3bcb-11e9-89e8-36488b1ba566.png)

The output of the count of both positive and negative reviews.

![screen shot 2019-03-01 at 2 36 48 am](https://user-images.githubusercontent.com/23439187/53626312-ff0d3900-3bca-11e9-99bf-d930d65d4ac7.png)

------
#### © [Brian Ruiz](https://github.com/BrianRuizy)