https://github.com/alro10/ds_nlp_challenge
https://github.com/alro10/ds_nlp_challenge
Last synced: 5 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/alro10/ds_nlp_challenge
- Owner: Alro10
- Created: 2021-12-27T19:43:16.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2021-12-27T21:00:35.000Z (over 4 years ago)
- Last Synced: 2025-03-13T18:12:47.844Z (over 1 year ago)
- Size: 1000 Bytes
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# NLP Challenge
This repo contains the data science NLP challenge. We use it for evaluating DS candidates at IBM CIO Brazil..
## Datasets:
You are able to choose one of those datasets:
- https://www.kaggle.com/leandrodoze/sentiment-analysis-in-portuguese/data
- https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge/data
- https://www.kaggle.com/datatattle/covid-19-nlp-text-classification
## Steps:
To evaluate better the skills and knowledge, we consider the following steps:
1- Understand and clean the dataset. After that, create and evaluate the model
2- Expose the model using a simple API (you could use Flask, FastAPI, etc)
- This app will receive a request (json format) and then it will respond a json with model inference.
3- Create a basic UI, this will be consumible for non technical team.
Pls, consider to follow the CRISP-DM methodology. If you have any questions, do not hesitate to contact me.