https://github.com/Franceshe/awesome-generative-models

A collection of awesome generative model papers, frameworks, libraries, software and resources for text, image, video, animation, code generation
https://github.com/Franceshe/awesome-generative-models

List: awesome-generative-models

biggan-models code-generation deep-learning generative-adversarial-network generative-art generative-model image-generation image-processing machine-learning music-generation question-answering question-generation static-analysis text-generation video-generation

Last synced: 7 months ago
JSON representation

A collection of awesome generative model papers, frameworks, libraries, software and resources for text, image, video, animation, code generation

Host: GitHub
URL: https://github.com/Franceshe/awesome-generative-models
Owner: Franceshe
Created: 2020-03-12T09:56:58.000Z (over 5 years ago)
Default Branch: master
Last Pushed: 2021-05-27T14:45:46.000Z (about 4 years ago)
Last Synced: 2024-12-03T00:02:12.408Z (7 months ago)
Topics: biggan-models, code-generation, deep-learning, generative-adversarial-network, generative-art, generative-model, image-generation, image-processing, machine-learning, music-generation, question-answering, question-generation, static-analysis, text-generation, video-generation
Homepage:
Size: 62.5 KB
Stars: 23
Watchers: 3
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

ultimate-awesome - awesome-generative-models - A collection of awesome generative model papers, frameworks, libraries, software and resources for text, image, video, animation, code generation. (Other Lists / Julia Lists)

README

        # awesome-generative-models

A curated list of awesome generative model frameworks, libraries, software and resources for media production

inspired by [awesome-python](https://github.com/vinta/awesome-python)



	

	


	


	

	

		



			^{My open source work is supported by the community}





- [Awesome Generative Mdels](#awesome-generative-models)

	- [Code generation: ML algorithm + Static Analysis](#CodeGeration)

    - [Code to GUI: algorithm to user interface](#CodeToGUI)

	- [Image Synthesis](#imageSynthesis)

		- [Image Style Transfer](#ImageStyleTransfer)

    - [Text Generation/NLP](#NLP)

		- [Audio and music generation/processing](#AudioAndMusicGenerationProcessing)

		- [Music-Video Synthesis](#MusicVideoSynthesis)

    - [Video-Synth](#videoSynth)

	- [Procedural Generation for gaming](#ProceduralGenerationForGaming)

	- [Image-Generation](#imageGeneration)

    - [Forum](#forum)

## Code to GUI: algorithm to user interface:

* [Screenshoot to Code](https://github.com/emilwallner/Screenshot-to-code)

* [pix2code: Generating Code from a Graphical User Interface Screenshot](https://github.com/tonybeltramelli/pix2code)

* [Bred Victor: Machine learning in Engineering inference from enviroment](http://worrydream.com/#!/MagicInk)

## Code generation: ML algorithm + Static Analysis

* [Tabnine: Autocompletion with deep learning](https://www.tabnine.com/blog/deep/)

* [Kite: Algorithm assisted coder completion ](https://kite.com/)

* Insignt:

  Standard code completion tools often still use alphabetical sorting, while Kite uses ML algorithms to infer what a developer is likely trying to do

## Image Synthesis

* [SPADE by NVlabs: Synthesizing photorealistic images given an input semantic layout](https://nvlabs.github.io/SPADE/).

  [code](https://github.com/nvlabs/spade/)

## Image Style Transfer

* [Artbreeder based on BigGAN models](https://artbreeder.com/)

   [opensource version](https://github.com/joel-simon/ganbreeder)

   [BigGAN models](https://tfhub.dev/deepmind/biggan-512/2)

   [About](https://artbreeder.com/about)

* A cool artistic project: 

... DGSpitzer(Eddie Hu) a indie game maker Use [Artbreeder based on BigGAN models](https://artbreeder.com/), StyleGAN-Artm Realistic-Neural_Talking_Head_Modelsm First-Order_Modelsm DAIN

and Topaz Lab for some great work on digital video repair, which colorize black and while video.

* A cool project: [Visual Novel using ML Style transfer](https://medium.com/@pbriod/how-i-used-artificial-intelligence-to-make-my-first-graphic-novel-about-my-trip-to-india-although-ff4af46fd6a3)

## NLP: Text-Sentiment-Analysis:

* [DeepMoji:a sentiment analysis model](https://arxiv.org/pdf/1708.00524.pdf) **This model also used in [15.ai](https://15.ai/about), a text-to-speech tool for generating voices of various characters.**

## Text-to-Image:

* dataset: [Google’s “Quick Draw” open source dataset](https://opensource.google/projects/quickdrawdataset)

  [github](https://github.com/googlecreativelab/quickdraw-dataset)

## Text-to-Video:

* AllenNLP: [Imagine This! Scripts to Compositions to Videos](https://prior.allenai.org/projects/craft)

## Image Captioning

* [Tensorflow core: Image captioning with visual attention](https://www.tensorflow.org/tutorials/text/image_captioning)

## Text-to-Animation

* [Generating Animations from Screenplays](https://arxiv.org/pdf/1904.05440.pdf)

## Text Generation/NLP

### Question Generation:

* [Question Generation: generate multiple choice answers from text](https://github.com/KristiyanVachev/Question-Generation)

### Other

* [OpenAI 1.5 billion params GPT-2 release](https://openai.com/blog/gpt-2-1-5b-release/)

* [AIDungeon](https://github.com/AIDungeon/AIDungeon)

* [ctrl-gce: CTRL text-generating model on Google Compute Engine with just a few console commands.](https://github.com/minimaxir/ctrl-gce), why google compute engine: The CTRL model is so large (12 GB on disk, 15.5 GB GPU VRAM when loaded, even more system RAM during runtime) that it will currently not fit into a free Colaboratory or Kaggle Notebook. 

* [Writing with the machine](https://www.robinsloan.com/notes/writing-with-the-machine/)

  [scifi corpus txt dataset](https://www.kaggle.com/jannesklaas/scifi-stories-text-corpus)

  Could be useful for gpt2 model fine tune

* [GPT2-Chinese-wuxiao-novel](https://leemeng.tw/gpt2-language-model-generate-chinese-jing-yong-novels.html)

* [GPT-2 Chinese](https://github.com/Morizeyao/GPT2-Chinese)

* [Hugging face Transformer]-Pytorch hub 

### Text Corpus/dataset

* [Sci-fi-Script](http://www.scifiscripts.com/)

* [Detroit-Becoming-Human](https://github.com/detroitbecometext/detroitbecometext.github.io)

   Could be source to analyze dialogue tree and decision tree structure.

   

## Audio and music generation/processing

### Project:

* [Talking like your favorite character: Text-To-Speech audio generation ](https://fifteen.ai/app)

related research:

* [Tacotron2](https://github.com/NVIDIA/tacotron2)

* [ForwardTacotron: Tacotron2 without attention](https://github.com/as-ideas/ForwardTacotron)

* Voice clone: [Real-Time-Voice-Cloning](https://github.com/CorentinJ/Real-Time-Voice-Cloning)

* [Spleeter: sound track seperation](https://github.com/deezer/spleeter)

  sound seperation is under domain of music information retrival.

* [Ambient Generative Music by Alex Bainter](https://generative.fm/)

  Although this project is not generated by algorithm, gives much inspiration in the field of music generation.

  [Medium](https://medium.com/@metalex9)

  [code](https://github.com/generative-music/pieces-alex-bainter/blob/master/packages/piece-trees/src/piece.js)

*[NeuralFunk: Sound design with ML](https://towardsdatascience.com/neuralfunk-combining-deep-learning-with-sound-design-91935759d628)

### Music-Video Synthesis

* [Deep Music Visualizer using BigGan](https://towardsdatascience.com/the-deep-music-visualizer-using-sound-to-explore-the-latent-space-of-biggan-198cd37dac9a)

  [code](https://github.com/msieg/deep-music-visualizer)

## Video Synthesis and Generation:

* [pix2pix-tensorflow: poweed interative rendereed Virtual world](https://www.youtube.com/watch?v=ayPqjPekn7g)

  [code](https://github.com/affinelayer/pix2pix-tensorflow)

   update: pretrained model added

   [more about](https://affinelayer.com/pix2pix/)

   [colab](https://www.tensorflow.org/tutorials/generative/pix2pix)

   * Using this technique we can colorize black and white photos, convert google maps to google earth, etc.

* [CRAFT, which generates cartoons based on text descritpionsa](https://arxiv.org/abs/1804.03608)

  A very creative work involved text to video generation from allen nlp

  by [researcher](http://tanmaygupta.info/publications/)

  [Project page](https://prior.allenai.org/projects/craft)

  [Video](https://www.youtube.com/watch?v=688Vv86n0z8&feature=youtu.be)

## Audio Systhesis

* [nvidia's taco2 pytorch implementation](https://github.com/NVIDIA/tacotron2)

* [real-time-voice-clone](https://github.com/CorentinJ/Real-Time-Voice-Cloning)

## ProceduralGenerationForGaming

### Map generation

* [AI-Powered Procedural Fantasy Map Generator](https://80.lv/articles/ai-powered-procedural-fantasy-map-generator/), reference: [@linonetwo's blog](https://onetwo.ren/wiki/#:%E5%9C%B0%E5%9B%BE%E7%94%9F%E6%88%90%E7%9A%84%E4%BE%8B%E5%AD%90)

## Animation

* [deep learning for character animation and control](https://github.com/sebastianstarke/AI4Animation)

* [DeepMimic: Motion imitation with deep reinforcement learning](https://github.com/xbpeng/DeepMimic)

# Distributed Training:

* [MPI Reduce and Allreduce](https://mpitutorial.com/tutorials/mpi-reduce-and-allreduce/)

   Very useful tutorial to illustrate the concept of MPT.

   CHECK [CODE](https://github.com/wesleykendall/mpitutorial/tree/gh-pages)

* Also check on [Tensorflow-core](https://www.tensorflow.org/tutorials/distribute/keras)

* [Distributed Traning strategy](https://www.tensorflow.org/guide/distributed_training)

## Computer Vision application in gaming

* [E-Sports Talent Scouting Based on Multimodal Twitch Stream Data](https://arxiv.org/pdf/1907.01615.pdf)

  [data acquisition and modeling code](https://github.com/mug31416/E-sports-on-Twitch/tree/master/data)

  [Chat Log](https://github.com/bernardopires/twitch-chat-logger)

  [Twich Stream](https://www.godo.dev/tutorials/python-record-twitch/)

## Paper with code

* [Paper with code](https://paperswithcode.com/)

## Resources on related course:

* [UIUC CS598RK: HCI for ML](https://courses.grainger.illinois.edu/cs598rk/fa2019/)

* [coursera: Sequence Models](https://www.coursera.org/learn/nlp-sequence-models/)

* [Full Stack Deep Learning](https://fullstackdeeplearning.com/)

* fast.ai

* [UIUC: ECE420 Video processing lab](https://courses.grainger.illinois.edu/ece420/fa2019/lab7/lab/)

* GPU free resource:[Setting Up a Google Cloud Instance GPU for fast.ai for Free](https://medium.com/@jamsawamsa/running-a-google-cloud-gpu-for-fast-ai-for-free-5f89c707bae6)

## Cloud computing set up

* [AWS](https://sebastianraschka.com/pdf/books/dlb/appendix_cloud-computing.pdf)

* [Google colab](https://medium.com/@mslavescu/try-live-ssd-object-detection-mask-r-cnn-object-detection-and-instance-segmentation-sfmlearner-df62bdc97d52)

## Full Stack Deep Learning tools in data processing pipeline:

* [cortex: Deploy machine learning models in production possibly without docker and kubernetes](https://github.com/cortexlabs/cortex)

   [medium](https://towardsdatascience.com/@calebkaiser)

## Resources for dev tool:

* Lumen: Video syth software

* [Dialogue tree based node editor for unity](https://github.com/Seneral/Node_Editor_Framework)

  and [example](https://github.com/Seneral/Node_Editor_Framework/tree/Examples/Dialogue-System)

  [demo](https://nodeeditor.seneral.dev/Examples.html)

## More resources on pretrained model:

 * [Pytorch hub]https://pytorch.org/hub/research-models

 * Tensorflow hub

## MEET mind-linked people in forum:

* [reddit-MediaSynthesis](https://www.reddit.com/r/MediaSynthesis/)

## Application:

### Generating voices of various characters: Ideally podcast and gaming etc:

* [15.ai](https://15.ai/about)

### FILM MAKING:

* Black Mirror: Bandersnatch

  [Show case: dialogue tree](https://www.reddit.com/r/blackmirror/comments/aajk5r/full_bandersnatch_flowchart_all_branches_story/)

* [LATE-SHIFT](https://lateshift-movie.com/)

## Design

* [Algorithm Driven Design](https://algorithms.design/)

* [Algorithm generated Logo](https://app.brandmark.io/)

## Reference:

* For details about how to represent animation architecture from software perspective

and in math. Check Chapter11-Animation System of Game Engine Architecture,2nd edition

by Jason Gregory

* [Kite's new AI model](https://techcrunch.com/2019/01/28/kite-raises-17m-for-its-ai-driven-code-completion-tool/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/Franceshe/awesome-generative-models

Awesome Lists containing this project

README