https://github.com/chatterjeesaurabh/natural-language-processing
Text Preprocessing, Embedding Methods such as BoW, TF-IDF and Word2Vec, Text Classification using LSTM, Topic Modeling with LDA and BERTopic.
https://github.com/chatterjeesaurabh/natural-language-processing
embedding-models natural-language-processing nlp-machine-learning text-classification topic-modeling
Last synced: 3 days ago
JSON representation
Text Preprocessing, Embedding Methods such as BoW, TF-IDF and Word2Vec, Text Classification using LSTM, Topic Modeling with LDA and BERTopic.
- Host: GitHub
- URL: https://github.com/chatterjeesaurabh/natural-language-processing
- Owner: chatterjeesaurabh
- License: mit
- Created: 2024-07-05T15:16:04.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-12-23T19:09:40.000Z (over 1 year ago)
- Last Synced: 2025-02-23T11:44:45.438Z (over 1 year ago)
- Topics: embedding-models, natural-language-processing, nlp-machine-learning, text-classification, topic-modeling
- Language: Jupyter Notebook
- Homepage:
- Size: 222 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Natural Language Processing
| | Title | Packages | Code |
| ------| ------------ |-----------| -------------------|
| 1 | Cleaning, Stop Words Removal, Stemming, Lemmatization, Bag of Words, TF-IDF | NLTK, SkLearn | [NLP 1](https://github.com/chatterjeesaurabh/Natural-Language-Processing/blob/main/NLP_1_BoW_TFIDF.ipynb) |
| 2 | Word2Vec | Gensim | [NLP 2](https://github.com/chatterjeesaurabh/Natural-Language-Processing/blob/main/NLP_2_Word2Vec.ipynb) |
| 3 | Spam Classification : Bow, TF-IDF, Word2Vec | NLTK, Gensim, SkLearn, Pandas | [NLP 3](https://github.com/chatterjeesaurabh/Natural-Language-Processing/blob/main/NLP_3_Spam_Classifier_BoW_TFIDF_Word2Vec.ipynb) |
| 4 | Word Embedding Techniques using Embedding Layer in Keras | Keras | [NLP 4](https://github.com/chatterjeesaurabh/Natural-Language-Processing/blob/main/NLP_4_Word_Embedding_Techniques.ipynb) |
| 5 | Fake News Classifier using LSTM | NLTK, Keras | [NLP 5](https://github.com/chatterjeesaurabh/Natural-Language-Processing/blob/main/NLP_5_Fake_News_Classifier_LSTM.ipynb) |
| 6 | [Topic Modeling with LDA](https://github.com/chatterjeesaurabh/Amazon-Reviews-Topic-Modelling-with-LDA) | NLTK, spaCy, Gensim, Pandas | [Repo](https://github.com/chatterjeesaurabh/Amazon-Reviews-Topic-Modelling-with-LDA) |
| 7 | [Topic Modeling with BERTopic](https://github.com/chatterjeesaurabh/Topic-Modeling-arXiv-Abstracts-using-BERTopic) | BERTopic | [Repo](https://github.com/chatterjeesaurabh/Topic-Modeling-arXiv-Abstracts-using-BERTopic) |
| 8 | [Dialogue Summarization with Large Language Model](https://github.com/chatterjeesaurabh/Dialogue-Summarization-with-Large-Language-Model) | Hugging Face Transformers | [Repo](https://github.com/chatterjeesaurabh/Dialogue-Summarization-with-Large-Language-Model) |
| 9 | [Contextual RAG System with Hybrid Search and Reranking](https://github.com/chatterjeesaurabh/Contextual-RAG-System-with-Hybrid-Search-and-Reranking) | LangChain | [Repo](https://github.com/chatterjeesaurabh/Contextual-RAG-System-with-Hybrid-Search-and-Reranking) |