An open API service indexing awesome lists of open source software.

https://github.com/md-emon-hasan/nlp-codebasics

Collection of basic Natural Language Processing examples that cover essential techniques like tokenization, text representation, and text classification.
https://github.com/md-emon-hasan/nlp-codebasics

bag-of-words bow gensim gensim-word2vec lematization nlp nlp-library nlp-machine-learning nltk nltk-python python3 spacy text-classification text-processing tokenization

Last synced: 6 months ago
JSON representation

Collection of basic Natural Language Processing examples that cover essential techniques like tokenization, text representation, and text classification.

Awesome Lists containing this project

README

          

# NLP CodeBasics

This repository provides fundamental code examples and techniques for Natural Language Processing (NLP). It covers essential concepts, tools, and methods such as tokenization, part-of-speech tagging, named entity recognition, and text classification. The project employs popular Python libraries like **spaCy**, **NLTK**, and **Gensim** to implement various NLP tasks.

Key Features:
- Tokenization with **spaCy**
- **Part-of-Speech Tagging** and **Named Entity Recognition** (NER)
- Text representation with **TF-IDF**, **Word Vectors**, and **Word2Vec**
- **Text Classification** using **spaCy** and **Gensim**
- Comparative studies: **spaCy vs NLTK**
- Advanced NLP techniques for feature extraction and preprocessing