https://github.com/am-ankitgit/webscrapping_nlp_sentiment_analyisis
To Collect Data From Various Website To Perform Sentiment Analyisis and Various possible Variable
https://github.com/am-ankitgit/webscrapping_nlp_sentiment_analyisis
sentiment-analysis webscraping
Last synced: about 1 year ago
JSON representation
To Collect Data From Various Website To Perform Sentiment Analyisis and Various possible Variable
- Host: GitHub
- URL: https://github.com/am-ankitgit/webscrapping_nlp_sentiment_analyisis
- Owner: AM-Ankitgit
- Created: 2023-02-24T02:24:52.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-04-10T14:41:29.000Z (about 3 years ago)
- Last Synced: 2025-02-15T20:52:15.905Z (over 1 year ago)
- Topics: sentiment-analysis, webscraping
- Language: Jupyter Notebook
- Homepage:
- Size: 20.5 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# WebScrapping_NLP_Sentiment_Analysis
## To Collect Data From Various Website To Perform Sentiment Analyisis and Calculate possible Variable
## 1)Import All Required Dependencies
## 2)Data Collection
## 3)Create Function to Perform Text Analysis
➢ Count Syllables Word
➢ Count Complex Words
➢ Count Words
➢ Average Word Length
➢ Fog Index
➢ Personal Pronouns
## 4)Data Cleaning Function
➢ Remove Space
➢ Get Expanded
➢ Get Handle Accented
➢ Remove HTML Tags
➢ Remove Stopword
## 5)Data Extraction and Text Analysis
➢ Create Empty List To append the data
➢ Request the HTML page
➢ Extract the article title and title
➢ Calling function and passing and perform Data cleaning
➢ Save the Clean article to text file
➢ for text processing used TextBlob Library and perform sentiment analysis
## 6)Calculate all the Variable
## 7)Converted into the DataFrame
## 8)Convert DataFrame into excel file