https://github.com/jersongb22/tokenclassification-tensorflow
https://github.com/jersongb22/tokenclassification-tensorflow
bert-large hugging-face named-entity-recognition part-of-speech-tagging plotly python sickit-learn tensorflow token-classification
Last synced: 11 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/jersongb22/tokenclassification-tensorflow
- Owner: JersonGB22
- Created: 2024-06-09T00:41:03.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-06-14T00:28:34.000Z (over 1 year ago)
- Last Synced: 2025-01-25T11:25:40.504Z (about 1 year ago)
- Topics: bert-large, hugging-face, named-entity-recognition, part-of-speech-tagging, plotly, python, sickit-learn, tensorflow, token-classification
- Language: Jupyter Notebook
- Homepage:
- Size: 586 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
#
**Token Classification**
This repository implements Token Classification models, a Natural Language Processing (NLP) task that assigns labels to individual tokens in a sentence. These models are built using TensorFlow and the Hugging Face Transformers library. The architectures are based on [LSTM](https://www.tensorflow.org/api_docs/python/tf/keras/layers/LSTM) networks and the pretrained [BERT](https://huggingface.co/docs/transformers/model_doc/bert) model.
Key applications of token classification include Named Entity Recognition (NER) and Part-of-Speech (PoS) tagging. In real-world scenarios, these tasks are crucial for various applications such as information extraction, text analysis, and language understanding.
## **Use Cases So Far:**
- **Named Entity Recognition:** This model identifies and classifies named entities in a text, such as names of persons, dates, locations, organizations, etc. It has been trained using the [NER dataset from Kaggle](https://www.kaggle.com/datasets/namanj27/ner-dataset), which provides 17 different labels for this task.
- **Part-of-Speech Tagging:** This model recognizes and tags parts of speech, such as nouns, pronouns, adjectives, or verbs, in a given text. It has been trained using a dataset containing 42 labels specifically for this task, also sourced from Kaggle.
## **Some Results of the Predictions**
- **Named Entity Recognition**
---
- **Part-of-Speech Tagging**
---
#### *Further results from the predictions can be found in their respective notebooks.*
## **Technological Stack**
[](https://docs.python.org/3/)
[](https://www.tensorflow.org/api_docs)
[](https://huggingface.co/)
[](https://scikit-learn.org/stable/)
[](https://plotly.com/)
## **Contact**
[](mailto:jerson.gimenesbeltran@gmail.com)
[](https://www.linkedin.com/in/jerson-gimenes-beltran/)
[](https://github.com/JersonGB22/)