Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/easonlai/news-scraping-and-text-analytics


https://github.com/easonlai/news-scraping-and-text-analytics

azure azure-cognitive-services azure-text-analysis beautifulsoup beautifulsoup4 data-scraping microsoft-azure news-scraper pandas pandas-dataframe python python3 web-scraping

Last synced: about 2 months ago
JSON representation

Awesome Lists containing this project

README

        

# News Scraping and Text Analytics (Sentiment Analysis)

This is demo repo to demostrate how to scrape [news headlines data from Yahoo Hong Kong](https://hk.news.yahoo.com/) by Python with library [BeautifulSoup 4](https://beautiful-soup-4.readthedocs.io/en/latest/). And then use [Azure Text Analytics](https://docs.microsoft.com/en-us/azure/cognitive-services/text-analytics/overview) to perform [sentiment analysis](https://docs.microsoft.com/en-us/azure/cognitive-services/text-analytics/overview#sentiment-analysis) for news headlines.

News headlines are collected from Yahoo Hong Kong, content language would be in Tranditional Chinese (Cantonese).

Pie Chart to visualize sentiment distribution of news headlines.
![alt text](https://github.com/easonlai/news-scraping-and-text-analytics/blob/main/git-images/git-image-1.png)