Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/joaoassalim/class-by-description-classifier-with-nlp

Enhancing Item Classification through Natural Language Processing: Leveraging Text Descriptions for Precise Categorization
https://github.com/joaoassalim/class-by-description-classifier-with-nlp

bert fine-tuning nlp nlp-machine-learning scikit-learn sklearn tensorflow

Last synced: 4 days ago
JSON representation

Enhancing Item Classification through Natural Language Processing: Leveraging Text Descriptions for Precise Categorization

Awesome Lists containing this project

README

        

# Text Classification Project

## Overview

This project focuses on the classification of items based on their descriptions using Natural Language Processing (NLP) techniques. The goal is to leverage machine learning classifiers to automatically categorize items into predefined classes, including real estate, machinery, services, information technology, and furniture.

## Introduction

In our daily lives, the task of classifying diverse items into distinct categories is a common challenge that spans various industries. This project addresses this challenge by employing Natural Language Processing (NLP) techniques for text classification. The primary objective is to develop a model capable of accurately categorizing items into predefined classes based on their textual descriptions.

## Project Structure

- **`data/`**: Stores the dataset used for training and evaluation.
- **`notebooks/`**: Jupyter notebooks for experimentation and analysis.

## Requirements

- Python 3.9
- Required Python packages can be installed using: `pip install -r requirements.txt`

## Installation

1. Clone the repository:

```bash
git clone https://github.com/JoaoAssalim/Class-by-Description-Classifier-with-NLP.git
```

## Available Implementations

This repository includes three different approaches for implementing the model:

- **`Sklearn`**: Implementation using traditional machine learning libraries.
- **`TensorFlow`**: Implementation using TensorFlow for building and training neural network models.
- **`Fine-tuning BERT`**: Implementation using the pre-trained model noneuralmind/bert-base-portuguese-cased for fine-tuning in Portuguese.

## Contributing

Contributions are welcome! If you'd like to contribute to this project, please fork the repository and submit a pull request.