Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/data-pioneer/mlops-name-entity-recognition-end-to-end-main
Implementation of MLops pipeline for Named Entity Recognition (NER) using pretrained Huggingface BERT transformer based model, further use CircleCI CI/CD tool for deployment on google cloud platform by using Docker image, Flask front end interface.
https://github.com/data-pioneer/mlops-name-entity-recognition-end-to-end-main
artifact-registry bert-fine-tuning circleci deep-neural-networks docker-image flask-application google-cloud huggingface-transformers pythin3
Last synced: about 2 months ago
JSON representation
Implementation of MLops pipeline for Named Entity Recognition (NER) using pretrained Huggingface BERT transformer based model, further use CircleCI CI/CD tool for deployment on google cloud platform by using Docker image, Flask front end interface.
- Host: GitHub
- URL: https://github.com/data-pioneer/mlops-name-entity-recognition-end-to-end-main
- Owner: data-pioneer
- License: mit
- Created: 2024-05-16T03:07:31.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-05-17T09:32:49.000Z (8 months ago)
- Last Synced: 2024-08-24T16:35:23.330Z (4 months ago)
- Topics: artifact-registry, bert-fine-tuning, circleci, deep-neural-networks, docker-image, flask-application, google-cloud, huggingface-transformers, pythin3
- Language: Python
- Homepage:
- Size: 936 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# End-to-End Named Entity Recognition (NER) with BERT Transformed learning on GCP (Docker & CircleCI)
This repository implements a complete Named Entity Recognition (NER) pipeline using a pre-trained Hugging Face Transformers model (BERT). It enables to identify and classify named entities (e.g. people, organizations, locations) within text data. The pipeline leverages the power of Google Cloud Platform (GCP) for deployment and scalability, containerized with Docker for portability, and streamlined with CircleCI for continuous integration and continuous delivery (CI/CD).
# Features
- Leverages pre-trained BERT model from Hugging Face Transformers for efficient and accurate NER.
- Provides a user-friendly interface to process text data and extract named entities.
- Scales seamlessly on GCP for handling large text datasets.
- Encapsulated in Docker containers for easy deployment across various environments.
- Automated CI/CD pipeline through CircleCI for streamlined development and deployment.# Flow Diaglram of MLops pipeline:
![MLops_NER_Architexture_Flow_Diagram](https://github.com/data-pioneer/MLops-Name-Entity-Recognition-End-to-End-main/assets/33811437/49401d27-b51e-49a8-ae71-9bfbbdb15396)
## Workflows
- constants
- config_entity
- artifact_entity
- components
- pipeline
- app.py## Git commands
```bash
git add .git commit -m "Updated"
git push origin main
```## GCP Configuration
```bash
#Gcloud cli download link: https://cloud.google.com/sdk/docs/install#windowsgcloud init
```## How to run?
```bash
conda create -n nerproj python=3.8 -y
``````bash
conda activate nerproj
``````bash
pip install -r requirements.txt
``````bash
python app.py
```## GCP CICD Deployment with CircleCI:
- artifact registry --> create a repository
- change line 42,50,72,76,54 in circleci config
- Opne circleci --> create a project### Set Environment variables in CircleCI
```bash
GCLOUD_SERVICE_KEY --> service account
GOOGLE_COMPUTE_ZONE = asia-south1
GOOGLE_PROJECT_ID
```
## Create a VM instances & setup scripts