An open API service indexing awesome lists of open source software.

https://github.com/lbann/llama

Parallel implementation of the LLaMA models
https://github.com/lbann/llama

Last synced: 3 months ago
JSON representation

Parallel implementation of the LLaMA models

Awesome Lists containing this project

README

          

# LLaMA Repository

Distributed implementation of the LLaMA 3.x model. Optimzied to allow both
pipeline and tensor parallel inference execution using PyTorch.

```
torchrun-hpc -N1 -n2 --rdv tcp chat_server.py --model-dir
```
# LBANN: Livermore Big Artificial Neural Network Toolkit

The Livermore Big Artificial Neural Network toolkit (LBANN) is an
open-source, HPC-centric, deep learning training framework that is
optimized to compose multiple levels of parallelism.

LBANN provides model-parallel acceleration through domain
decomposition to optimize for strong scaling of network training. It
also allows for composition of model-parallelism with both data
parallelism and ensemble training methods for training large neural
networks with massive amounts of data. LBANN is able to advantage of
tightly-coupled accelerators, low-latency high-bandwidth networking,
and high-bandwidth parallel file systems.

## Publications

A list of publications, presentations and posters are shown
[here](https://lbann.readthedocs.io/en/latest/publications.html).

## Reporting issues
Issues, questions, and bugs can be raised on the [Github issue
tracker](https://github.com/LBANN/lbann/issues).