https://github.com/halcy/tpuddim
https://github.com/halcy/tpuddim
Last synced: about 1 year ago
JSON representation
- Host: GitHub
- URL: https://github.com/halcy/tpuddim
- Owner: halcy
- Created: 2021-08-03T09:14:19.000Z (almost 5 years ago)
- Default Branch: main
- Last Pushed: 2022-05-03T07:09:39.000Z (about 4 years ago)
- Last Synced: 2025-03-30T22:31:57.173Z (about 1 year ago)
- Language: Jupyter Notebook
- Size: 22.7 MB
- Stars: 22
- Watchers: 5
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Tpuddim
Denoising Diffusion Implicit Models JAX TPU implementation. Based on the network architecture in https://github.com/openai/guided-diffusion , pretrained weights compatible.
Still quite WIP, though less so now, there is code for inference and basic training, weights for the MNIST example, an initial training attempt on Danbooru2019Faces and a [convenient colab inference notebook](https://colab.research.google.com/github/halcy/tpuddim/blob/main/colab/Danbooru2019Portraits_Inference_Colab.ipynb) for the same.
There's also an attempt at making a mixer work as a diffusion model, that doesn't quite work yet. Patches extremely welcome, especially if they adress one of the things marked TODO in the code somewhere.
(Note that training on Colab will _not_ work - this code was developed on, and the models trained on, TRC TPUs)
MNIST example output:

Danbooru2019Faces example output:

# Acknowledgements
This work would not have been possible without a TPU access grant by the Google [TPU Research Cloud](https://sites.research.google/trc/about/).