Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/natowi/deep-learning-applications-with-gui
List of FOSS Deep Learning Applications with GUI and/or CLI
https://github.com/natowi/deep-learning-applications-with-gui
ai ai-tools cnn colorization deep-learning foss-deep-learning gan generative-adversarial-network gtp gui gui-application list llm stable-diffusion ui upscaling
Last synced: about 1 month ago
JSON representation
List of FOSS Deep Learning Applications with GUI and/or CLI
- Host: GitHub
- URL: https://github.com/natowi/deep-learning-applications-with-gui
- Owner: natowi
- License: cc0-1.0
- Created: 2020-11-22T19:16:35.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2024-02-24T19:57:31.000Z (11 months ago)
- Last Synced: 2024-11-06T01:39:22.505Z (2 months ago)
- Topics: ai, ai-tools, cnn, colorization, deep-learning, foss-deep-learning, gan, generative-adversarial-network, gtp, gui, gui-application, list, llm, stable-diffusion, ui, upscaling
- Homepage:
- Size: 144 KB
- Stars: 75
- Watchers: 9
- Forks: 8
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Deep-Learning-Applications-with-GUI
List of FOSS Deep Learning (Desktop) Applications with GUI and/or CLI. Most Deep Learning models require a complicated setup and are not ready to use out of the box. At best there is a Colab or Jupiter notebook available, which is good for testing, but not for production use.
The software in this list is ready to use (CLI or GUI).
:camera_flash: NeRF π Text π¨ Image :notes: Audio ποΈ Video frame interpolation πΌοΈποΈ Video upscaler πΌοΈ Image/Video upscaling 𧩠Ready to use Plugins
# :camera_flash: 3D / NeRF: Neural Radiance Fields
https://github.com/NVlabs/instant-ngp#interactive-training-and-rendering
# π Text
## Jan
Jan is an open-source ChatGPT alternative that runs 100% offline on your computer.
https://github.com/janhq/jan
## text-generation-webui
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.
Its goal is to become the AUTOMATIC1111/stable-diffusion-webui of text generation.
https://github.com/oobabooga/text-generation-webui
## chatai
https://github.com/Capsize-Games/chatai
## Audio to text
https://github.com/Const-me/Whisperhttps://github.com/kaixxx/noScribe
https://github.com/CheshireCC/fatser-whisper-GUI (highly recommended)
https://github.com/Purfview/whisper-standalone-win (ready to use .exe for cli use)
https://github.com/FL33TW00D/whisper-turbo (wip)
## subtitle
https://github.com/URUWorks/TeroSubtitler (whisper support)## video-subtitle-generator
https://github.com/YaoFANGUK/video-subtitle-generator## Translation
https://github.com/argosopentech/argos-translate#installation
https://github.com/ymoslem/DesktopTranslator## Chat-Style LLM (Alpaca/Llama) (Chat GTP Clone)
https://github.com/antimatter15/alpaca.cpp
https://github.com/tatsu-lab/stanford_alpaca
https://github.com/tloen/alpaca-lora## dalai
Run LLaMA and Alpaca on your computer.
https://github.com/cocktailpeanut/dalai
## GTP4All
Installs a native chat-client with auto-update functionality that runs on your desktop with the GPT4All-J model baked into it.
https://github.com/nomic-ai/gpt4all
## Gpt4All Web UI
This is a Flask web application that provides a chat UI for interacting with llamacpp based chatbots such as GPT4all, vicuna etc...
https://github.com/nomic-ai/gpt4all-ui
## AutoGPT GUI
A graphical user interface to AutoGPT
https://github.com/thecookingsenpai/autogpt-gui
# π¨ Image
## Fooocus
Fooocus is an image generating software.
https://github.com/lllyasviel/Fooocus
## Style2Paints
Style2paints V4 is the current best AI driven lineart **colorization tool**.
https://github.com/lllyasviel/style2paints
Source code not available for the latest release.
JavaScript, Python, Tensorflow
## StableStudio
StableStudio, the open-source version of DreamStudio!
https://github.com/Stability-AI/StableStudio
## NMKD Stable Diffusion GUI - AI Image Generator
https://github.com/n00mkrad/text2image-gui
https://nmkd.itch.io/t2i-gui
## stable-diffusion-webui-forge
https://github.com/lllyasviel/stable-diffusion-webui-forge
## stable-diffusion-ui
Easiest 1-click way to install and use Stable Diffusion on your own computer. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image
https://github.com/cmdr2/stable-diffusion-ui
## MochiDiffusion
This app uses Apple's Core ML Stable Diffusion implementation to achieve maximum performance and speed on Apple Silicon based Macs while reducing memory requirements.
https://github.com/godly-devotion/MochiDiffusion
## ComfyUI
This ui will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart based interface. For some workflow examples and see what ComfyUI can do you can check out:
https://github.com/comfyanonymous/ComfyUI
## Stable Diffusion on AMD GPU
https://github.com/azuritecoin/OnnxDiffusersUI (deprecated)
https://github.com/lshqqytiger/stable-diffusion-webui-directml## Diffusion Bee - Stable Diffusion GUI App for MacOS
https://github.com/divamgupta/diffusionbee-stable-diffusion-ui
## WebUI
ready to use colab based notebooks with web ui
https://github.com/camenduru/stable-diffusion-webui-colab
## InvokeAI
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
https://github.com/invoke-ai/InvokeAI
## airunner
https://github.com/Capsize-Games/airunner $
## Lama Cleaner
A free and open-source inpainting tool powered by SOTA AI model.
https://github.com/Sanster/lama-cleaner
https://panicbyte.itch.io/lama-cleaner
## carefree-creator
AI magics meet Infinite draw board.
https://github.com/carefree0910/carefree-creator
## photoshot
An open-source AI avatar generator web app
https://github.com/shinework/photoshot
## DeOldify.NET
DeOldify.NET: cross-platform application for coloring black and white photos"
https://github.com/ColorfulSoft/DeOldify.NET
## Queryable
Offline photo search App for iOS
https://github.com/mazzzystar/Queryable
## X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
https://github.com/CVHub520/X-AnyLabeling
# :notes: Audio
## clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio
https://github.com/jianchang512/clone-voice
## tacotron2-tts-GUI
GUI wrapper for synthesize. Allows CPU-only synthesis via a toggleable switch. Portable exe file is available (that runs on CPU only).
https://github.com/lokkelvin2/tacotron2-tts-GUI
## riffusion
Riffusion is an app for real-time music generation with stable diffusion.
https://github.com/riffusion/riffusion-app
## whisper-ui
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.
This is a simple Streamlit UI for OpenAI's Whisper speech-to-text model. It let's you download and transcribe media from YouTube videos, playlists, or local files. You can then browse, filter, and search through your saved audio files.
https://github.com/hayabhay/whisper-ui for https://github.com/openai/whisper
https://grisk.itch.io/whisper-gui
## buzz
GUI for whisper
https://github.com/chidiwilliams/buzz
## Ultimate Vocal Remover GUI v5.5.1
This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).
https://github.com/Anjok07/ultimatevocalremovergui
## SpleeterGUI
SpleeterGUI - Music source separation desktop app
Windows Desktop Front end for Spleeter - AI source separation
https://github.com/boy1dr/SpleeterGui
## SoftVC VITS Singing Voice Conversion Fork
https://github.com/34j/so-vits-svc-fork
## subsync subtitle alignment
This is an automatic movie subtitle synchronization tool. Synchronization is done by listening to the audio track, translating it if necessary.
https://github.com/sc0ty/subsync/
# ποΈ Video frame interpolation
## Video Dubbing
https://github.com/jianchang512/pyvideotrans
## DAIN-App
The backbone of this project is a algorithm called DAIN. (https://github.com/baowenbo/DAIN)
https://github.com/BurguerJohn/Dain-App
https://grisk.itch.io/dain-app?download
## DAIN Vulkan GUI
AI-Powered video interpolater (eg. 30fps -> 60fps) for Vulkan devices. Based on dain-ncnn-vulkan and ffmpeg
https://github.com/Mar2ck/DAIN-Vulkan-GUI
## dain-ncnn-vulkan
ncnn implementation of DAIN, Depth-Aware Video Frame Interpolation.
https://github.com/nihui/dain-ncnn-vulkan
## Dain-App
https://github.com/BurguerJohn/Dain-App
## cain-ncnn-vulkan
Video Frame Interpolation
https://github.com/nihui/cain-ncnn-vulkan
## Flowframes
Flowframes is a simple but powerful app that utilizes AI frameworks to interpolate videos in order to increase their framerate with little to no noticable quality loss. (DAIN, CAIN, RIFE)
https://github.com/n00mkrad/flowframes
https://nmkd.itch.io/flowframes
# πΌοΈποΈ Video upscaler
## video2x
A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley 2, 2018.
https://github.com/k4yt3x/video2x## https://github.com/Communist02/AnimePlayer
## ποΈ Video Deep Face
https://github.com/iperov/DeepFaceLive
# πΌοΈ Image/Video upscaling
## Final2x
This is a powerful tool that allows for image super-resolution to arbitrary sizes using multiple models, designed to enhance the resolution and quality of images, making them clearer and more detailed. Presently, it supports several models, such as RealCUGAN, RealESRGAN, Waifu2x, and SRMD.
https://github.com/Tohrusky/Final2x
## Upscayl
Upscayl is a cross-platform application built with the Linux-first philosophy. This means that we prioritize Linux builds over others but that doesn't mean we'll break things for other OSes :) Makes use of Real-ESRGANhttps://github.com/upscayl/upscayl
## Anime4kSharp
https://github.com/shadow578/Anime4kSharp
## dandere2x-tremx
video compression
https://github.com/Tremeschin/dandere2x-tremx## Cupscale
Image Upscaling GUI based on ESRGAN - WORK IN PROGRESS
https://github.com/n00mkrad/cupscale
## IEU.Winforms
Windows only GUI for ESRGAN with additional features
https://github.com/ptrsuder/IEU.Winforms## NVIDIA-Ansel-AI-Enhancer
NVIDIA Upscaler, utilizes NVIDIAΒ΄s Ansel RTX. Requires latest drivers and a RTX GPU. Free, but not open source.
https://github.com/dynamiquel/NVIDIA-Ansel-AI-Enhancer
## RealSR
Real-World Super-Resolution via Kernel Estimation and Noise Injection
https://github.com/jixiaozhong/RealSR
(https://github.com/nihui/RealSR)
## realsr-ncnn-vulkan
https://github.com/nihui/realsr-ncnn-vulkan
## Shell upscaler
https://github.com/n00mkrad/shell-upscaler
Acts as a GUI for esrgan-launcher.
## srmd-ncnn-vulkan
https://github.com/nihui/srmd-ncnn-vulkan
## VapourSynth-Super-Resolution-Helper
https://github.com/AlphaAtlas/VapourSynth-Super-Resolution-Helper
## Waifu2x-Extension-GUI
Photo/Video/GIF enlargement using machine learning
https://github.com/AaronFeng753/Waifu2x-Extension-GUI
C++, python, qt, caffe, ncnn_vulkan
AGPL-3.0 License
## waifu2x-ncnn-vulkan
https://github.com/nihui/waifu2x-ncnn-vulkan
# GAN / Style Transfer
## TorchRayLib
TorchRayLib++: A CMake based AI & generative art platform, integrating the rayib GUI and the PyTorch C++ Deep Learning Library.
https://github.com/QuantScientist/TorchRayLib
## more
https://github.com/Seprentes/ScaleUp
https://github.com/HolyWu/vs-realesrgan
https://github.com/tumuyan/RealSR-NCNN-Android
https://github.com/TransparentLC/realesrgan-gui/blob/master/README.en-US.md
https://github.com/Lucchetto/SuperImage
https://github.com/Tohrusky/Final2x# :toolbox: DL Training GUI
## AIDeveloper
GUI-based software for training, evaluating and applying deep neural nets for image classification
https://github.com/maikherbig/AIDeveloper
# 𧩠Ready to use Plugins
## vstSpleeter
audio source separation library vst plugin
https://github.com/gvne/vstSpleeter
---
## various apps (untested)
Paper2GUI is an AI desktop Application toolbox for everyone no matter your background with it being free and ready to use out of the box, already supporting 10+ AI models, covering the fields of speech synthesis, video frame filling, video overscoring, target detection, image stylization, OCR recognition, etc. Supporting Windows, Mac OS, Linux systems.
https://github.com/Baiyuetribe/paper2gui/blob/main/README_en.md
Free: https://www.nvidia.com/en-us/studio/canvas/