{"id":21681702,"url":"https://github.com/jersongb22/tokenclassification-tensorflow","last_synced_at":"2026-04-13T23:03:13.349Z","repository":{"id":244147831,"uuid":"812460895","full_name":"JersonGB22/TokenClassification-TensorFlow","owner":"JersonGB22","description":null,"archived":false,"fork":false,"pushed_at":"2024-06-14T00:28:34.000Z","size":600,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-20T10:51:20.450Z","etag":null,"topics":["bert-large","hugging-face","named-entity-recognition","part-of-speech-tagging","plotly","python","sickit-learn","tensorflow","token-classification"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/JersonGB22.png","metadata":{"files":{"readme":"readme.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-06-09T00:41:03.000Z","updated_at":"2024-06-26T18:05:39.000Z","dependencies_parsed_at":"2024-06-13T05:30:17.875Z","dependency_job_id":"86707d31-7775-411e-ba91-dfd443a64642","html_url":"https://github.com/JersonGB22/TokenClassification-TensorFlow","commit_stats":null,"previous_names":["jersongb22/tokenclassification-tensorflow"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/JersonGB22/TokenClassification-TensorFlow","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JersonGB22%2FTokenClassification-TensorFlow","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JersonGB22%2FTokenClassification-TensorFlow/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JersonGB22%2FTokenClassification-TensorFlow/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JersonGB22%2FTokenClassification-TensorFlow/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/JersonGB22","download_url":"https://codeload.github.com/JersonGB22/TokenClassification-TensorFlow/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JersonGB22%2FTokenClassification-TensorFlow/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31774547,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-13T20:17:16.280Z","status":"ssl_error","status_checked_at":"2026-04-13T20:17:08.216Z","response_time":93,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bert-large","hugging-face","named-entity-recognition","part-of-speech-tagging","plotly","python","sickit-learn","tensorflow","token-classification"],"created_at":"2024-11-25T15:30:51.770Z","updated_at":"2026-04-13T23:03:13.321Z","avatar_url":"https://github.com/JersonGB22.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# \u003ch1 align=\"center\"\u003e**Token Classification**\u003c/h1\u003e\n\n\u003cp align=\"center\"\u003e\n\u003cimg src=\"images/image_readme.jpeg\"\u003e \n\u003c/p\u003e\n\nThis repository implements Token Classification models, a Natural Language Processing (NLP) task that assigns labels to individual tokens in a sentence. These models are built using TensorFlow and the Hugging Face Transformers library. The architectures are based on [LSTM](https://www.tensorflow.org/api_docs/python/tf/keras/layers/LSTM) networks and the pretrained [BERT](https://huggingface.co/docs/transformers/model_doc/bert) model.\n\nKey applications of token classification include Named Entity Recognition (NER) and Part-of-Speech (PoS) tagging. In real-world scenarios, these tasks are crucial for various applications such as information extraction, text analysis, and language understanding.\n\n## **Use Cases So Far:**\n\n- **Named Entity Recognition:** This model identifies and classifies named entities in a text, such as names of persons, dates, locations, organizations, etc. It has been trained using the [NER dataset from Kaggle](https://www.kaggle.com/datasets/namanj27/ner-dataset), which provides 17 different labels for this task.\n\n- **Part-of-Speech Tagging:** This model recognizes and tags parts of speech, such as nouns, pronouns, adjectives, or verbs, in a given text. It has been trained using a dataset containing 42 labels specifically for this task, also sourced from Kaggle.\n\n## **Some Results of the Predictions**\n\n- **Named Entity Recognition**\n\n\u003cp align=\"left\"\u003e\n\u003cimg src=\"images/ner/ner_prediction_1.png\" style=\"width: 649px;\"\u003e \n\u003c/p\u003e\n\n---\n\u003cp align=\"left\"\u003e\n\u003cimg src=\"images/ner/ner_prediction_2.png\" style=\"width: 878px;\"\u003e \n\u003c/p\u003e\n\n- **Part-of-Speech Tagging**\n\n\u003cp align=\"left\"\u003e\n\u003cimg src=\"images/pos/pos_prediction_1.png\" style=\"width: 705px;\"\u003e \n\u003c/p\u003e\n\n---\n\u003cp align=\"left\"\u003e\n\u003cimg src=\"images/pos/pos_prediction_2.png\" style=\"width: 535px;\"\u003e \n\u003c/p\u003e\n\n#### *Further results from the predictions can be found in their respective notebooks.*\n\n## **Technological Stack**\n[![Python](https://img.shields.io/badge/Python-3776AB?style=for-the-badge\u0026logo=python\u0026logoColor=white\u0026labelColor=101010)](https://docs.python.org/3/) \n[![TensorFlow](https://img.shields.io/badge/TensorFlow-FF6F00?style=for-the-badge\u0026logo=tensorflow\u0026logoColor=white\u0026labelColor=101010)](https://www.tensorflow.org/api_docs)\n[![Hugging Face](https://img.shields.io/badge/Hugging%20Face-FFD21E?style=for-the-badge\u0026logo=huggingface\u0026logoColor=white\u0026labelColor=101010)](https://huggingface.co/)\n[![Scikit-learn](https://img.shields.io/badge/Scikit--learn-F7931E?style=for-the-badge\u0026logo=scikit-learn\u0026logoColor=white\u0026labelColor=101010)](https://scikit-learn.org/stable/)\n[![Plotly](https://img.shields.io/badge/Plotly-3F4F75?style=for-the-badge\u0026logo=plotly\u0026logoColor=white\u0026labelColor=101010)](https://plotly.com/)\n\n## **Contact**\n[![Gmail](https://img.shields.io/badge/Gmail-D14836?style=for-the-badge\u0026logo=gmail\u0026logoColor=white\u0026labelColor=101010)](mailto:jerson.gimenesbeltran@gmail.com)\n[![LinkedIn](https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge\u0026logo=linkedin\u0026logoColor=white\u0026labelColor=101010)](https://www.linkedin.com/in/jerson-gimenes-beltran/)\n[![GitHub](https://img.shields.io/badge/GitHub-181717?style=for-the-badge\u0026logo=github\u0026logoColor=white\u0026labelColor=101010)](https://github.com/JersonGB22/)","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjersongb22%2Ftokenclassification-tensorflow","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjersongb22%2Ftokenclassification-tensorflow","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjersongb22%2Ftokenclassification-tensorflow/lists"}