{"id":14964931,"url":"https://github.com/minggnim/nlp-models","last_synced_at":"2025-11-11T18:10:05.411Z","repository":{"id":54913735,"uuid":"522735315","full_name":"minggnim/nlp-models","owner":"minggnim","description":"A repository for training transformer based models","archived":false,"fork":false,"pushed_at":"2024-07-30T03:42:36.000Z","size":16719,"stargazers_count":2,"open_issues_count":1,"forks_count":1,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-09-30T07:14:39.358Z","etag":null,"topics":["chatbot","chatbots","ctransformers","deeplearning","falcon","fine-tuning","gpt-2","langchain","llama2","llms","multi-label-classification","multi-task-learning","nlp","pytorch","qdrant-vector-database","transformers"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/minggnim.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-08-08T23:15:11.000Z","updated_at":"2024-07-30T03:42:39.000Z","dependencies_parsed_at":"2023-01-25T22:01:59.024Z","dependency_job_id":"2e7d9713-abb7-4adb-93a9-392183596841","html_url":"https://github.com/minggnim/nlp-models","commit_stats":{"total_commits":158,"total_committers":3,"mean_commits":"52.666666666666664","dds":"0.025316455696202556","last_synced_commit":"254a7d071483fd0f7b8c62cf83a0abbf6b1011b2"},"previous_names":["minggnim/nlp-classification-model"],"tags_count":40,"template":false,"template_full_name":null,"purl":"pkg:github/minggnim/nlp-models","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minggnim%2Fnlp-models","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minggnim%2Fnlp-models/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minggnim%2Fnlp-models/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minggnim%2Fnlp-models/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/minggnim","download_url":"https://codeload.github.com/minggnim/nlp-models/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/minggnim%2Fnlp-models/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278590830,"owners_count":26012020,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-06T02:00:05.630Z","response_time":65,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["chatbot","chatbots","ctransformers","deeplearning","falcon","fine-tuning","gpt-2","langchain","llama2","llms","multi-label-classification","multi-task-learning","nlp","pytorch","qdrant-vector-database","transformers"],"created_at":"2024-09-24T13:33:59.310Z","updated_at":"2025-10-08T16:54:15.061Z","avatar_url":"https://github.com/minggnim.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"![PyPI - Package Version](https://img.shields.io/pypi/v/nlp-models?logo=pypi\u0026style=flat\u0026color=blue)\n\u003ca href=\"https://github.com/minggnim/nlp-models/blob/master/LICENSE\"\u003e\u003cimg src=\"https://img.shields.io/badge/License-MIT-blue\" alt=\"MIT License\"\u003e\u003c/a\u003e\n[![PyPI pyversions](https://img.shields.io/pypi/pyversions/nlp-models.svg)](https://pypi.python.org/pypi/nlp-models/)\n[![Python package](https://github.com/minggnim/nlp-classification-model/actions/workflows/python-package.yml/badge.svg)](https://github.com/minggnim/nlp-classification-model/actions/workflows/python-package.yml)\n[![Dependency Review](https://github.com/minggnim/nlp-classification-model/actions/workflows/dependency-review.yml/badge.svg)](https://github.com/minggnim/nlp-classification-model/actions/workflows/dependency-review.yml)\n\n# NLP Models\n\nA repository for building transformer based nlp models\n\n## Installation\n\n### Install from PyPi\n\n```\npip install nlp-models\n```\n\n### Install from source\n\n```\ngit clone git@github.com:minggnim/nlp-models.git\npip install -r requirements\n```\n\n## Llama2 Quantization model on consumer CPU\n\n### Run Chat applications on CPU\n1. Streamlit UI\n\n    ```\n    cd apps\n    streamlit run chat.py\n    ```\n\n2. Command line\n\n    ```\n    llm_app chat -s 'hi there'\n    ```\n\n### Run Q\u0026A application on CPU\n1. Steamlit UI\n\n    ```\n    cd apps\n    streamlit run qa.py\n    ```\n\n## Models\n\n1. `bert_classifier`\n   A wrapper package around BERT-based classification models\n\n   - [Training example](https://github.com/minggnim/nlp-models/blob/master/notebooks/01_bert-classification-finetuning/01_a_classification_model_training_example.ipynb)\n   - [Inference example](https://github.com/minggnim/nlp-models/blob/master/notebooks/01_bert-classification-finetuning/01_b_classification_inference_example.ipynb)\n2. `multi_task_model`\n   An implementation of multi-tasking model built on encoder models\n\n   - [Zero-shot multi-task model](https://github.com/minggnim/nlp-models/blob/master/notebooks/02_multi-task-model/02_a_multitask_model_zeroshot_learning.ipynb)\n   - [Training example](https://github.com/minggnim/nlp-models/blob/master/notebooks/02_multi-task-model/02_b_multitask_model_training_example.ipynb)\n   - [Inference example](https://github.com/minggnim/nlp-models/blob/master/notebooks/02_multi-task-model/02_c_multitask_model_inference_example.ipynb)\n   - [Qqrant Vector DB](https://github.com/minggnim/nlp-models/blob/master/notebooks/02_multi-task-model/02_d_qdrant_vector_db.ipynb)\n3. `GPT-2`\n\n    - [Training GPT-2 model](https://github.com/minggnim/nlp-models/blob/master/notebooks/03_gpt-2-training/gpt-2-training/03_gpt2_training.ipynb)\n\n4. `Falcon 7B`\n\n    - [Running Falcon 7b model](https://github.com/minggnim/nlp-models/blob/master/notebooks/04_llms/05_falcon_7b.ipynb)\n\n5. Quantized Llama2 models\n\n    - [Run Llama2 chat on CPU](https://github.com/minggnim/nlp-models/blob/master/notebooks/04_llms/06_llama2_langchain_gglm_inference.ipynb)\n    - [Run Llama2 QA on a custom pdf document on CPU](https://github.com/minggnim/nlp-models/blob/master/notebooks/04_llms/07_llama2_doc_qa.ipynb)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fminggnim%2Fnlp-models","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fminggnim%2Fnlp-models","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fminggnim%2Fnlp-models/lists"}