Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/michaelistrofficus/diffuse_my_lyrics

A dummy project to generate images from song lyrics using Stable Diffusion
https://github.com/michaelistrofficus/diffuse_my_lyrics

Last synced: about 2 months ago
JSON representation

A dummy project to generate images from song lyrics using Stable Diffusion

Host: GitHub
URL: https://github.com/michaelistrofficus/diffuse_my_lyrics
Owner: MichaelisTrofficus
Created: 2022-09-01T17:56:03.000Z (over 2 years ago)
Default Branch: master
Last Pushed: 2023-03-15T12:13:46.000Z (almost 2 years ago)
Last Synced: 2024-10-11T00:07:40.221Z (3 months ago)
Language: Python
Homepage:
Size: 5.99 MB
Stars: 15
Watchers: 3
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

Diffuse My Lyrics!

🎶 ➡ 🧠 ➡ 🖼️

An easy way to generate images from lyrics

Description •
How To Use •
Arguments •
Cool Outputs •
Next Steps •

## Description

This is a simple application that uses the spectacular [Stable Diffusion](https://stability.ai/blog/stable-diffusion-public-release) model to generate images from song lyrics.
So just install the library on a Colab notebook, choose your favorite song and sit back and wait for the visual interpretations of each verse!

In the Cool Outputs section, I have shown some interpretations of verses that I found very cool 😏😏😏

## How To Use

First, simply install the python package in a colab notebook (right now, it's only for Colab,
but extending it to general use is trivial ... as long as you have a good GPU 😅 ).

```bash
# Install the latest version of the package
$ pip install -U diffuse-my-lyrics
```

Now, suppose we want to feed the model with the following verses, belonging to The End, a magnificent piece by The Doors.

```
Ride the King's highway, baby
Weird scenes inside the gold mine
Ride the highway west, baby
Ride the snake, ride the snake
To the lake, the ancient lake, baby
The snake is long, seven miles
Ride the snake, he's old, and his skin is cold
```

After uploading this lyrics to the colab notebook (I am using a .txt extension), we just need to run the following
commands.

```python
# Import the Lyrics2Images class
from diffuse_my_lyrics import Lyrics2Images

l2i = Lyrics2Images(num_inference_steps=100) # In this case, we are indicating the model to run for 100 steps
l2i.run(input_path="/content/my_favourite_song.txt", output_path="my_favourite_song_folder")
```

After running `Lyrics2Images`, a folder will be created in your colab current directory (`my_favourite_song_folder`),
where a series of images will be generated (one image for each verse of the lyrics).

One it's finished, simply zip the folder and download it!!

```python
import shutil
shutil.make_archive("zipped_folder", 'zip', "my_favourite_song_folder")
```

## Arguments

- **model_id** - The model id. By default `CompVis/stable-diffusion-v1-4`
- **revision** - The model revision. By default `fp16`
- **torch_dtype** - The Pytorch dtype. By default `torch.float16`
- **prompt** - This parameter is useful if you want to add additional information to the verse. For example, `digital art`,
`HQ`, etc. By default `digital art`
- **num_inference_steps** - The number of steps. By default `50`
- **use_auth_token** - This parameter determines whether to use an authentication token for Hugging Face. By default
`True`

## Cool Outputs

Let me show you now a selection of results I found interesting during my experiments.

### The Doors - The End

`Ride the King's highway, baby`

![screenshot](./images/the_end_1.png)

`Weird scenes inside the gold mine`

![screenshot](./images/the_end_2.png)

`To the lake, the ancient lake, baby`

![screenshot](./images/the_end_3.png)

### The Doors - The crystal ship

`The days are bright and filled with pain`

![screenshot](./images/crystal_1.png)

`The crystal ship is being filled`

![screenshot](./images/crystal_2.png)

### The Pixies - Monkey Gone to Heaven

`An underwater guy who controlled the sea`

![screenshot](./images/monkey_1.png)

`Got killed by ten million pounds of sludge from New York and New Jersey`

![screenshot](./images/monkey_2.png)

`This monkey's gone to heaven`

![screenshot](./images/monkey_3.png)

`The creature in the sky`

![screenshot](./images/monkey_4.png)

## Next Steps

- Add support for generating several images instead of just one.
- Make the library usable in another environments (not just Colab)
- Create argument for using a manual seed
- Add custom size of output images