https://github.com/jersongb22/causallanguagemodeling-tensorflow

causal-language-modeling distilgpt2 gpt-2 hugging-face plotly python tensorflow

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/jersongb22/causallanguagemodeling-tensorflow
Owner: JersonGB22
Created: 2024-06-30T00:12:26.000Z (12 months ago)
Default Branch: main
Last Pushed: 2024-07-06T01:46:29.000Z (12 months ago)
Last Synced: 2025-01-25T11:25:41.063Z (5 months ago)
Topics: causal-language-modeling, distilgpt2, gpt-2, hugging-face, plotly, python, tensorflow
Language: Jupyter Notebook
Homepage:
Size: 680 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: readme.md

Awesome Lists containing this project

README

        # 
**Causal Language Modeling**




 



In this repository, Causal Language Modeling is implemented, a natural language processing (NLP) task that involves predicting the next token in a sequence of tokens, where the model can only attend to the tokens on its left, meaning it cannot see future tokens, as is the case with GPT-2. The most common use of these models is in text generation, which involves completing or paraphrasing an incomplete text. These models are created using the TensorFlow and Hugging Face Transformers libraries.

The most significant use cases include creative content generation, text autocompletion, assisted writing, and realistic dialogue creation for chatbot applications.

## **Implemented Models:**

- **Shakespeare-Style Text Generation:** A [LSTM network](https://www.tensorflow.org/api_docs/python/tf/keras/layers/LSTM) model is implemented using a [dataset containing Shakespeare's writings](https://storage.googleapis.com/download.tensorflow.org/data/shakespeare.txt). Subsequently, to achieve better results, the pre-trained [DistilGPT2 (short for Distilled-GPT2)](https://huggingface.co/distilbert/distilgpt2) model is fine-tuned using a larger dataset from the official [Project Gutenberg](https://www.gutenberg.org/cache/epub/100/pg100.txt) page that contains Shakespeare's major works, thus improving coherence, spelling, and capturing Shakespeare's style.

- **Text Generation with GPT-2:** In this case, the [GPT-2](https://huggingface.co/openai-community/gpt2) model is fine-tuned with the [WikiText-103](https://huggingface.co/datasets/Salesforce/wikitext) dataset, which is a collection of over 100 million tokens extracted from articles verified as good and featured on Wikipedia, enabling the model to generate relevant text in fields such as art, history, philosophy, medicine, technology, economics, and more.

- **Text Generation with GPT-2 XL:** Here, the [GPT-2 XL](https://huggingface.co/openai-community/gpt2-xl) model is used without any additional fine-tuning. This is the largest model in the GPT-2 series, with over 1.5 billion parameters, and has achieved the best results across various datasets.

## **Some Results**



 



---



 



---



 



---



 



---



 



#### *Further results can be found in their respective notebooks.*

## **Technological Stack**

[![Python](https://img.shields.io/badge/Python-3776AB?style=for-the-badge&logo=python&logoColor=white&labelColor=101010)](https://docs.python.org/3/) 

[![TensorFlow](https://img.shields.io/badge/TensorFlow-FF6F00?style=for-the-badge&logo=tensorflow&logoColor=white&labelColor=101010)](https://www.tensorflow.org/api_docs)

[![Hugging Face](https://img.shields.io/badge/Hugging%20Face-FFD21E?style=for-the-badge&logo=huggingface&logoColor=white&labelColor=101010)](https://huggingface.co/)

[![Plotly](https://img.shields.io/badge/Plotly-3F4F75?style=for-the-badge&logo=plotly&logoColor=white&labelColor=101010)](https://plotly.com/)

## **Contact**

[![Gmail](https://img.shields.io/badge/Gmail-D14836?style=for-the-badge&logo=gmail&logoColor=white&labelColor=101010)](mailto:[email protected])

[![LinkedIn](https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white&labelColor=101010)](https://www.linkedin.com/in/jerson-gimenes-beltran/)

[![GitHub](https://img.shields.io/badge/GitHub-181717?style=for-the-badge&logo=github&logoColor=white&labelColor=101010)](https://github.com/JersonGB22/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jersongb22/causallanguagemodeling-tensorflow

Awesome Lists containing this project

README

Causal Language Modeling