An open API service indexing awesome lists of open source software.

https://github.com/nutanixdev/nugpt

Codebase for developing a miniaturized GPT model, following GPT-2 architecture
https://github.com/nutanixdev/nugpt

Last synced: 11 days ago
JSON representation

Codebase for developing a miniaturized GPT model, following GPT-2 architecture

Awesome Lists containing this project

README

          

# nugpt

This repo includes pertinent codebase for developing a miniaturized GPT model. It follows GPT-2 architecture.

# Commands

## Data Engineering

* Include text data
* run: `python data/data/prepare.py`

## Model Training

* `python train.py --batch_size=32 --wandb_log=True`

## Inference

* `python sample.py --out_dir=out-wiki`