Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with text-to-image

A curated list of projects in awesome lists tagged with text-to-image .

https://github.com/lucidrains/dalle2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

artificial-intelligence deep-learning text-to-image

Last synced: 03 Oct 2024

https://github.com/lucidrains/DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

artificial-intelligence deep-learning text-to-image

Last synced: 30 Jul 2024

https://github.com/lucidrains/imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

artificial-intelligence deep-learning imagination-machine text-to-image text-to-video

Last synced: 03 Oct 2024

https://github.com/xavierxiao/dreambooth-stable-diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

pytorch pytorch-lightning stable-diffusion text-to-image

Last synced: 30 Sep 2024

https://github.com/XavierXiao/Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

pytorch pytorch-lightning stable-diffusion text-to-image

Last synced: 30 Jul 2024

https://github.com/lucidrains/dalle-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

artificial-intelligence attention-mechanism deep-learning multi-modal text-to-image transformers

Last synced: 03 Oct 2024

https://github.com/lucidrains/DALLE-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

artificial-intelligence attention-mechanism deep-learning multi-modal text-to-image transformers

Last synced: 30 Jul 2024

https://github.com/lucidrains/deep-daze

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun

artificial-intelligence deep-learning implicit-neural-representation multi-modality siren text-to-image transformers

Last synced: 03 Oct 2024

https://github.com/kuprel/min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

artificial-intelligence deep-learning pytorch text-to-image

Last synced: 30 Sep 2024

https://github.com/saharmor/dalle-playground

A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)

artificial artificial-intelligence dall-e dalle dalle-mini gan machine-learning openai stable-diffusion text-to-image transformers

Last synced: 30 Sep 2024

https://github.com/ai-forever/kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

diffusion image-generation image2image inpainting ipython-notebook kandinsky outpainting text-to-image text2image

Last synced: 27 Sep 2024

https://github.com/nerdyrodent/vqgan-clip

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

text-to-image text2image

Last synced: 30 Sep 2024

https://github.com/nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

text-to-image text2image

Last synced: 31 Jul 2024

https://github.com/lucidrains/big-sleep

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun

artificial-intelligence deep-learning generative-adversarial-networks multimodality text-to-image

Last synced: 03 Oct 2024

https://github.com/FurkanGozukara/Stable-Diffusion

Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney

ai-art coding computer-vision deep-learning deepfake deepfakes dreambooth education guide guides how-to learning machine-learning programming stable-diffusion text-to-image text-to-video tts tutorial tutorials

Last synced: 01 Aug 2024

https://github.com/furkangozukara/stable-diffusion

Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney

ai-art coding computer-vision deep-learning deepfake deepfakes dreambooth education guide guides how-to learning machine-learning programming stable-diffusion text-to-image text-to-video tts tutorial tutorials

Last synced: 30 Sep 2024

https://github.com/thudm/cogview

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

pretrained-models pytorch text-to-image transformers

Last synced: 30 Sep 2024

https://github.com/yangling0818/rpg-diffusionmaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

image-editting large-language-models multimodal-large-language-models text-to-image

Last synced: 30 Sep 2024

https://github.com/THUDM/CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

pretrained-models pytorch text-to-image transformers

Last synced: 01 Aug 2024

https://github.com/YangLing0818/RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

image-editting large-language-models multimodal-large-language-models text-to-image

Last synced: 31 Jul 2024

https://github.com/omerbt/tokenflow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

iclr2024 stable-diffusion text-to-image text-to-video tokenflow video-editing

Last synced: 01 Oct 2024

https://github.com/omerbt/TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

iclr2024 stable-diffusion text-to-image text-to-video tokenflow video-editing

Last synced: 31 Jul 2024

https://github.com/tencentarc/brushnet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 30 Sep 2024

https://github.com/fofr/cog-face-to-many

Turn any face into a video game character, pixel art, claymation, 3D or toy

ai cog comfyui generative-ai replicate text-to-image

Last synced: 30 Sep 2024

https://github.com/TencentARC/BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

diffusion diffusion-models eccv eccv2024 image-inpainting text-to-image

Last synced: 31 Jul 2024

https://github.com/ddPn08/Radiata

Stable diffusion webui based on diffusers.

stable-diffusion stable-diffusion-webui tensorrt text-to-image

Last synced: 02 Aug 2024

https://github.com/lukasHoel/text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

3d-generation diffusion-models mesh-generation text-to-image

Last synced: 31 Jul 2024

https://github.com/THUDM/CogView2

official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

pretrained-models pytorch text-to-image transformer

Last synced: 01 Aug 2024

https://github.com/omerbt/MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

diffusion-models generative-model icml image-generation multidiffusion stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/lucidrains/muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

artificial-intelligence attention-mechanisms deep-learning text-to-image transformers

Last synced: 03 Oct 2024

https://github.com/Shilin-LU/TF-ICON

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

diffusion-model generative-ai image-composition image-inversion stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/Shilin-LU/TF-ICON?tab=readme-ov-file

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)

diffusion-model generative-ai image-composition image-inversion stable-diffusion text-to-image

Last synced: 03 Aug 2024

https://github.com/eps696/aphantasia

CLIP + FFT/DWT/RGB = text to image/video

clip text-to-image text-to-video

Last synced: 01 Aug 2024

https://github.com/haofanwang/Lora-for-Diffusers

The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥

aigc colossalai diffusers fine-tuning guidebook lora stable-diffusion stable-diffusion-webui text-to-image

Last synced: 04 Aug 2024

https://github.com/fboulnois/stable-diffusion-docker

Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.

dall-e dalle diffusion docker generative-art huggingface image-generation inpainting midjourney pytorch stable-diffusion tensorflow text-to-image

Last synced: 01 Oct 2024

https://github.com/vicgalle/stable-diffusion-aesthetic-gradients

Personalization for Stable Diffusion via Aesthetic Gradients 🎨

diffusion-models laion stable-diffusion text-to-image text2image

Last synced: 01 Aug 2024

https://yuval-alaluf.github.io/Attend-and-Excite/

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

diffusion-models stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/yuval-alaluf/Attend-and-Excite

Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)

diffusion-models stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/zsdonghao/text-to-image

Generative Adversarial Text to Image Synthesis / Please Star -->

gan tensorflow tensorlayer text-to-image

Last synced: 02 Oct 2024

https://github.com/omriav/blended-diffusion

Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]

blended-diffusion deep-learning diffusion multimodal openai openai-clip text-guided-manipulation text-to-image

Last synced: 31 Jul 2024

https://github.com/lucidrains/parti-pytorch

Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch

artificial-intelligence attention-mechanism deep-learning text-to-image transformers

Last synced: 03 Oct 2024

https://github.com/ironjr/StreamMultiDiffusion

Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."

diffusion-models drawing huggingface-spaces image-generation stable-diffusion stablediffusion3 streaming text-to-image

Last synced: 31 Jul 2024

https://github.com/jaketae/storyteller

Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech

ddpm diffusion-models gpt image-generation natural-language-generation pytorch stable-diffusion text-to-image text-to-speech text-to-video video-generation

Last synced: 04 Aug 2024

https://github.com/google/break-a-scene

Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]

deep-learning diffusion-models generative-ai multimodal text-to-image

Last synced: 31 Jul 2024

https://github.com/afiaka87/clip-guided-diffusion

A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.

artificial-intelligence deep-learning diffusion image-generation multimodal multimodality openai openai-clip text-to-image text-to-image-synthesis

Last synced: 03 Aug 2024

https://github.com/EleutherAI/DALLE-mtf

Open-AI's DALL-E for large scale training in mesh-tensorflow.

artificial-intelligence autoregressive multimodal text-to-image transformers variational-autoencoder

Last synced: 07 Aug 2024

https://github.com/limuloo/MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

aigc computer-vision cvpr cvpr2024 stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/aelnouby/Text-to-Image-Synthesis

Pytorch implementation of Generative Adversarial Text-to-Image Synthesis paper

gans image-generation pytorch text-to-image zero-shot-learning

Last synced: 03 Aug 2024

https://github.com/nerdyrodent/CLIP-Guided-Diffusion

Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

openai-clip text-to-image text2image

Last synced: 03 Aug 2024

https://github.com/TonyLianLong/LLM-groundedDiffusion

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)

image-generation llm stable-diffusion stable-diffusion-webui text-to-image

Last synced: 31 Jul 2024

https://github.com/awekrx/ChatGPT-MidJourney-prompt

This is a ChatGPT based prompt generation model for MidJorney. The purpose of this model is to simplify the creation of images and increase their creativity. By introducing a partial hint, ChatGPT creates a follow-up that can be used to stimulate creativity and provide new ideas.

ai ai-art ai-painting chatgpt midjourney prompt prompt-engineering text-to-image text2img

Last synced: 31 Jul 2024

https://github.com/mkshing/e4t-diffusion

Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models

deep-learning diffusion-models stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/tobran/DF-GAN

A Simple and Effective Baseline for Text-to-Image Synthesis (CVPR2022 oral)

generative-adversarial-network text-to-image

Last synced: 01 Aug 2024

https://github.com/AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

deep-learning diffusion-models imagen pytorch super-resolution text-to-image

Last synced: 06 Aug 2024

https://github.com/samuraigpt/ai-youtube-shorts-generator

A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.

ai-video-generator artificial-intelligence image-to-video image-to-video-generation shorts shorts-maker sora-video sora-video-ai stable-diffusion text-to-image text-to-video text-to-video-generation video-diffusion video-editing video-generation video-generator youtube-shorts

Last synced: 17 Sep 2024

https://github.com/garibida/cross-image-attention

Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"

appearance-transfer diffusion-models stable-diffusion style-transfer text-to-image

Last synced: 31 Jul 2024

https://garibida.github.io/cross-image-attention/

Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"

appearance-transfer diffusion-models stable-diffusion style-transfer text-to-image

Last synced: 31 Jul 2024

https://github.com/OSU-NLP-Group/MagicBrush

[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

diffusion-models image-editing image-generation image-synthesis instruction-following text-to-image text-to-image-generation text-to-image-synthesis

Last synced: 01 Aug 2024

https://kfirgoldberg.github.io/ConceptLab/

Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"

creative-generation diffusion-models diffusion-prior personalization text-to-image

Last synced: 31 Jul 2024

https://github.com/kfirgoldberg/ConceptLab

Official Implementation for "ConceptLab: Creative Generation using Diffusion Prior Constraints"

creative-generation diffusion-models diffusion-prior personalization text-to-image

Last synced: 31 Jul 2024

https://github.com/gojasper/flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

diffusion-models distillation dit inpainting sdxl super-resolution text-to-image

Last synced: 31 Jul 2024

https://github.com/lucidrains/perfusion-pytorch

Implementation of Key-Locked Rank One Editing, from Nvidia AI

artificial-intelligence deep-learning memory-editing text-to-image

Last synced: 03 Oct 2024

https://github.com/tobran/GALIP

[CVPR2023] A faster, smaller, and better text-to-image model for large-scale training

pytorch text-to-image

Last synced: 31 Jul 2024

https://github.com/VinAIResearch/Anti-DreamBooth

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)

adversarial-attacks dreambooth personalization stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/mihirp1998/AlignProp

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion

alignment diffusion-models reinforcement-learning stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/zhang-zx/SINE

This respository contains the code for the CVPR 2023 paper SINE: SINgle Image Editing with Text-to-Image Diffusion Models.

diffusion-models image-editing stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://zhang-zx.github.io/SINE/

This respository contains the code for the CVPR 2023 paper SINE: SINgle Image Editing with Text-to-Image Diffusion Models.

diffusion-models image-editing stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/sayakpaul/diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

architecture-optimization diffusion-models flux text-to-image torch torch-compile torchao

Last synced: 03 Oct 2024

https://garibida.github.io/ReNoise-Inversion/

Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"

diffusion-models image-editing inversion lcm lcm-lora sdxl-turbo stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/garibida/ReNoise-Inversion

Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"

diffusion-models image-editing inversion lcm lcm-lora sdxl-turbo stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/mehdidc/feed_forward_vqgan_clip

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

generative-model openai-clip text-to-image vqgan

Last synced: 03 Aug 2024

https://github.com/dvlab-research/RIVAL

[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain

diffusion-models image-variations style-transfer text-to-image

Last synced: 31 Jul 2024

https://github.com/HFAiLab/clip-gen

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

clip pytorch text-to-image text2image

Last synced: 01 Aug 2024

https://github.com/IBM/DiffuseKronA

DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models

diffusion-models efficiency personalization text-to-image

Last synced: 31 Jul 2024

https://github.com/TonyLianLong/LLM-groundedVideoDiffusion

[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper

diffusion diffusion-models large-language-models llm text-to-image text-to-video text-to-video-generation video-generation

Last synced: 31 Jul 2024

https://github.com/lucasepe/crumbs

Turn asterisk-indented text lines into mind maps

commandline golang graphviz mindmaps text-to-image

Last synced: 10 Aug 2024

https://github.com/RockeyCoss/SPO

Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step

diffusion-models dpo sdxl text-to-image text-to-image-generation

Last synced: 31 Jul 2024

https://github.com/bahjat-kawar/time-diffusion

Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"

diffusion diffusion-models knowledge-editing stable-diffusion text-to-image text2image

Last synced: 31 Jul 2024

https://github.com/customdiffusion360/custom-diffusion360

CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control

camera-pose-control computer-vision customization generative-model pytorch stable-diffusion text-to-image

Last synced: 31 Jul 2024

https://github.com/j-min/VPGen

Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)

pytorch step-by-step text-to-image text-to-image-evaluation text-to-image-generation

Last synced: 31 Jul 2024

https://github.com/sungnyun/diffblender

DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models

diffusion generative-model multimodal text-to-image

Last synced: 31 Jul 2024

https://sungnyun.github.io/diffblender/

DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models

diffusion generative-model multimodal text-to-image

Last synced: 31 Jul 2024