https://github.com/anurima-saha/yelp_review_classification_with_roberta
Project begins with webscrapping Yelp reviews and rating with 'BeautifulSoup' in Python, followed by Natural Language Processing (text cleaning, removing stopwords, tokenization and lemmatization) using 'NLTK'. RoBERTa from 'HuggingFace' has been fine-tuned for text classification with early stopping and regularization using Pytorch.
https://github.com/anurima-saha/yelp_review_classification_with_roberta
beautifulsoup4 deep-learning early-stopping fine-tuning huggingface-transformers large-language-model natural-language-processing nltk-python pytorch roberta-model roberta-tokenizer webscapping
Last synced: 8 months ago
JSON representation
Project begins with webscrapping Yelp reviews and rating with 'BeautifulSoup' in Python, followed by Natural Language Processing (text cleaning, removing stopwords, tokenization and lemmatization) using 'NLTK'. RoBERTa from 'HuggingFace' has been fine-tuned for text classification with early stopping and regularization using Pytorch.
- Host: GitHub
- URL: https://github.com/anurima-saha/yelp_review_classification_with_roberta
- Owner: anurima-saha
- License: mit
- Created: 2024-12-22T05:23:43.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-12-22T05:39:49.000Z (10 months ago)
- Last Synced: 2025-02-19T21:14:18.754Z (8 months ago)
- Topics: beautifulsoup4, deep-learning, early-stopping, fine-tuning, huggingface-transformers, large-language-model, natural-language-processing, nltk-python, pytorch, roberta-model, roberta-tokenizer, webscapping
- Language: Jupyter Notebook
- Homepage:
- Size: 227 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Yelp_Review_Classification_with_RoBERTa
Project begins with webscrapping Yelp reviews and rating with 'BeautifulSoup' in Python, followed by Natural Language Processing (text cleaning, removing stopwords, tokenization and lemmatization) using 'NLTK'. RoBERTa from 'HuggingFace' has been fine-tuned for text classification with early stopping and regularization using Pytorch.![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()
![]()