Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/easonlai/news-scraping-and-text-analytics
https://github.com/easonlai/news-scraping-and-text-analytics
azure azure-cognitive-services azure-text-analysis beautifulsoup beautifulsoup4 data-scraping microsoft-azure news-scraper pandas pandas-dataframe python python3 web-scraping
Last synced: about 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/easonlai/news-scraping-and-text-analytics
- Owner: easonlai
- Created: 2021-08-01T16:13:23.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2021-08-02T09:38:53.000Z (over 3 years ago)
- Last Synced: 2024-04-28T05:09:40.378Z (8 months ago)
- Topics: azure, azure-cognitive-services, azure-text-analysis, beautifulsoup, beautifulsoup4, data-scraping, microsoft-azure, news-scraper, pandas, pandas-dataframe, python, python3, web-scraping
- Language: Jupyter Notebook
- Homepage:
- Size: 58.6 KB
- Stars: 1
- Watchers: 1
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# News Scraping and Text Analytics (Sentiment Analysis)
This is demo repo to demostrate how to scrape [news headlines data from Yahoo Hong Kong](https://hk.news.yahoo.com/) by Python with library [BeautifulSoup 4](https://beautiful-soup-4.readthedocs.io/en/latest/). And then use [Azure Text Analytics](https://docs.microsoft.com/en-us/azure/cognitive-services/text-analytics/overview) to perform [sentiment analysis](https://docs.microsoft.com/en-us/azure/cognitive-services/text-analytics/overview#sentiment-analysis) for news headlines.
News headlines are collected from Yahoo Hong Kong, content language would be in Tranditional Chinese (Cantonese).
Pie Chart to visualize sentiment distribution of news headlines.
![alt text](https://github.com/easonlai/news-scraping-and-text-analytics/blob/main/git-images/git-image-1.png)