https://github.com/brianruizy/web-scraping-nlp

Machine Learning - Personal Python project demonstrating a data analytic technique known as Web Scraping + Natural Language Processing performed on a real webpage.
https://github.com/brianruizy/web-scraping-nlp

Last synced: 2 months ago
JSON representation

Machine Learning - Personal Python project demonstrating a data analytic technique known as Web Scraping + Natural Language Processing performed on a real webpage.

Host: GitHub
URL: https://github.com/brianruizy/web-scraping-nlp
Owner: BrianRuizy
Created: 2019-03-01T07:52:59.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2021-06-07T20:00:53.000Z (over 4 years ago)
Last Synced: 2025-04-07T21:51:24.606Z (6 months ago)
Language: HTML
Homepage:
Size: 88.9 KB
Stars: 3
Watchers: 0
Forks: 1
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

Web Scraping

Personal portfolio project demonstrating a data analytic technique known as web scraping, on this particular project the analysis will be performed onto Barnes & Noble data.

## About
Web scraping is a technique used to extract certain elements from web pages to develop quantitative resolutions from data. With the use of Sentiment Analysis, a useful method for detecting expression in text, web scraping can become a valuable application for many companies or brands who desire a deeper insight into their clients/ customers.

For this particular project, I extracted data from a *Barnes & Noble* webpage depicting a particular used-book seller feedback. I managed to gather the sites .html through *Chrome's Developer Tools*.

## Dependencies
* textblob
* bs4 (BeautifulSoup)
* termcolor

## Result
The output of some of the "reviews" with the respective polarity preceeding.

![screen shot 2019-03-01 at 2 43 09 am](https://user-images.githubusercontent.com/23439187/53626620-d5084680-3bcb-11e9-89e8-36488b1ba566.png)

The output of the count of both positive and negative reviews.

![screen shot 2019-03-01 at 2 36 48 am](https://user-images.githubusercontent.com/23439187/53626312-ff0d3900-3bca-11e9-99bf-d930d65d4ac7.png)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/brianruizy/web-scraping-nlp

Awesome Lists containing this project

README

Web Scraping

Personal portfolio project demonstrating a data analytic technique known as web scraping, on this particular project the analysis will be performed onto Barnes & Noble data.