https://github.com/niteshchawla/nlp-content-classification
The goal of this project is to use a bunch of news articles extracted from the companies’ internal database and categorize them into several categories like politics, technology, sports, business and entertainment based on their content.
https://github.com/niteshchawla/nlp-content-classification
bag-of-words lemmatization multiclass-classification natural-language-processing stopwords text-classification text-processing tf-idf tokenization
Last synced: 8 months ago
JSON representation
The goal of this project is to use a bunch of news articles extracted from the companies’ internal database and categorize them into several categories like politics, technology, sports, business and entertainment based on their content.
- Host: GitHub
- URL: https://github.com/niteshchawla/nlp-content-classification
- Owner: Niteshchawla
- Created: 2025-09-21T10:43:53.000Z (9 months ago)
- Default Branch: main
- Last Pushed: 2025-09-21T10:47:09.000Z (9 months ago)
- Last Synced: 2025-09-21T12:27:12.777Z (9 months ago)
- Topics: bag-of-words, lemmatization, multiclass-classification, natural-language-processing, stopwords, text-classification, text-processing, tf-idf, tokenization
- Language: Jupyter Notebook
- Homepage:
- Size: 1.29 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
The company aims to revolutionize the way Indians perceive finance, business, and capital market investment, by giving it a boost through artificial intelligence (AI) and machine learning (ML). They’re on a mission to reinvent financial literacy for Indians, where financial awareness is driven by smart information discovery and engagement with peers. Through their smart content discovery and contextual engagement, the company is simplifying business, finance, and investment for millennials and first-time investors
Objective:
The goal of this project is to use a bunch of news articles extracted from the companies’ internal database and categorize them into several categories like politics, technology, sports, business and entertainment based on their content. Use natural language processing and create & compare at least three different models.