{"id":17052982,"url":"https://github.com/natowi/deep-learning-applications-with-gui","last_synced_at":"2025-04-30T22:45:51.938Z","repository":{"id":95535700,"uuid":"315113644","full_name":"natowi/Deep-Learning-Applications-with-GUI","owner":"natowi","description":"List of FOSS Deep Learning Applications with GUI and/or CLI","archived":false,"fork":false,"pushed_at":"2024-02-24T19:57:31.000Z","size":147,"stargazers_count":86,"open_issues_count":0,"forks_count":9,"subscribers_count":8,"default_branch":"main","last_synced_at":"2025-04-30T22:45:47.509Z","etag":null,"topics":["ai","ai-tools","cnn","colorization","deep-learning","foss-deep-learning","gan","generative-adversarial-network","gtp","gui","gui-application","list","llm","stable-diffusion","ui","upscaling"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"cc0-1.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/natowi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-11-22T19:16:35.000Z","updated_at":"2025-04-23T19:59:53.000Z","dependencies_parsed_at":null,"dependency_job_id":"2e4235ed-35f9-4b46-9a48-422feacc72a7","html_url":"https://github.com/natowi/Deep-Learning-Applications-with-GUI","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/natowi%2FDeep-Learning-Applications-with-GUI","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/natowi%2FDeep-Learning-Applications-with-GUI/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/natowi%2FDeep-Learning-Applications-with-GUI/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/natowi%2FDeep-Learning-Applications-with-GUI/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/natowi","download_url":"https://codeload.github.com/natowi/Deep-Learning-Applications-with-GUI/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":251795391,"owners_count":21645019,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","ai-tools","cnn","colorization","deep-learning","foss-deep-learning","gan","generative-adversarial-network","gtp","gui","gui-application","list","llm","stable-diffusion","ui","upscaling"],"created_at":"2024-10-14T10:11:03.298Z","updated_at":"2025-04-30T22:45:51.910Z","avatar_url":"https://github.com/natowi.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# Deep-Learning-Applications-with-GUI\n\nList of FOSS Deep Learning (Desktop) Applications with GUI and/or CLI. Most Deep Learning models require a complicated setup and are not ready to use out of the box. At best there is a Colab or Jupiter notebook available, which is good for testing, but not for production use.\n\nThe software in this list is ready to use (CLI or GUI).\n\n:camera_flash: NeRF 📃 Text 🎨 Image :notes: Audio 🎞️ Video frame interpolation 🖼️🎞️ Video upscaler 🖼️ Image/Video upscaling 🧩 Ready to use Plugins\n\n# :camera_flash: 3D / NeRF: Neural Radiance Fields\n\nhttps://github.com/NVlabs/instant-ngp#interactive-training-and-rendering\n\n# 📃 Text\n\n## Jan\n\nJan is an open-source ChatGPT alternative that runs 100% offline on your computer.\n\nhttps://github.com/janhq/jan\n\n## text-generation-webui\n\nA gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.\n\nIts goal is to become the AUTOMATIC1111/stable-diffusion-webui of text generation.\n\nhttps://github.com/oobabooga/text-generation-webui\n\n## chatai\n\nhttps://github.com/Capsize-Games/chatai\n\n## Audio to text\nhttps://github.com/Const-me/Whisper\n\nhttps://github.com/kaixxx/noScribe\n\nhttps://github.com/CheshireCC/fatser-whisper-GUI  (highly recommended)\n\nhttps://github.com/Purfview/whisper-standalone-win (ready to use .exe for cli use)\n\nhttps://github.com/FL33TW00D/whisper-turbo (wip)\n\n## subtitle\nhttps://github.com/URUWorks/TeroSubtitler (whisper support)\n\n## video-subtitle-generator\nhttps://github.com/YaoFANGUK/video-subtitle-generator\n\n## Translation\nhttps://github.com/argosopentech/argos-translate#installation\nhttps://github.com/ymoslem/DesktopTranslator\n\n## Chat-Style LLM (Alpaca/Llama) (Chat GTP Clone)\nhttps://github.com/antimatter15/alpaca.cpp\nhttps://github.com/tatsu-lab/stanford_alpaca\nhttps://github.com/tloen/alpaca-lora\n\n## dalai\n\nRun LLaMA and Alpaca on your computer.\n\nhttps://github.com/cocktailpeanut/dalai\n\n## GTP4All\n\nInstalls a native chat-client with auto-update functionality that runs on your desktop with the GPT4All-J model baked into it.\n\nhttps://github.com/nomic-ai/gpt4all\n\n## Gpt4All Web UI\n\nThis is a Flask web application that provides a chat UI for interacting with llamacpp based chatbots such as GPT4all, vicuna etc...\n\nhttps://github.com/nomic-ai/gpt4all-ui\n\n## AutoGPT GUI\n\nA graphical user interface to AutoGPT\n\nhttps://github.com/thecookingsenpai/autogpt-gui\n\n# 🎨 Image\n\n## Fooocus\n\nFooocus is an image generating software. \n\nhttps://github.com/lllyasviel/Fooocus\n\n## Style2Paints\n\nStyle2paints V4 is the current best AI driven lineart **colorization tool**.\n\nhttps://github.com/lllyasviel/style2paints\n\nSource code not available for the latest release.\n\nJavaScript, Python, Tensorflow\n\n## StableStudio\n\nStableStudio, the open-source version of DreamStudio!\n\nhttps://github.com/Stability-AI/StableStudio\n\n## NMKD Stable Diffusion GUI - AI Image Generator\n\nhttps://github.com/n00mkrad/text2image-gui\n\nhttps://nmkd.itch.io/t2i-gui\n\n## stable-diffusion-webui-forge\n\nhttps://github.com/lllyasviel/stable-diffusion-webui-forge\n\n## stable-diffusion-ui\n\nEasiest 1-click way to install and use Stable Diffusion on your own computer. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image\n\nhttps://github.com/cmdr2/stable-diffusion-ui\n\n## MochiDiffusion\n\nThis app uses Apple's Core ML Stable Diffusion implementation to achieve maximum performance and speed on Apple Silicon based Macs while reducing memory requirements.\n\nhttps://github.com/godly-devotion/MochiDiffusion\n\n## ComfyUI\n\nThis ui will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart based interface. For some workflow examples and see what ComfyUI can do you can check out:\n\nhttps://github.com/comfyanonymous/ComfyUI\n\n## Stable Diffusion on AMD GPU\n\nhttps://github.com/azuritecoin/OnnxDiffusersUI (deprecated)\nhttps://github.com/lshqqytiger/stable-diffusion-webui-directml\n\n## Diffusion Bee - Stable Diffusion GUI App for MacOS\n\nhttps://github.com/divamgupta/diffusionbee-stable-diffusion-ui\n\n## WebUI \n\nready to use colab based notebooks with web ui\n\nhttps://github.com/camenduru/stable-diffusion-webui-colab\n\n## InvokeAI\n\n InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products. \n\nhttps://github.com/invoke-ai/InvokeAI\n\n## airunner\n\nhttps://github.com/Capsize-Games/airunner $\n\n\n## Lama Cleaner\n\nA free and open-source inpainting tool powered by SOTA AI model.\n\nhttps://github.com/Sanster/lama-cleaner\n\nhttps://panicbyte.itch.io/lama-cleaner\n\n## carefree-creator\n\n AI magics meet Infinite draw board. \n\nhttps://github.com/carefree0910/carefree-creator\n\n## photoshot\n\nAn open-source AI avatar generator web app\n\nhttps://github.com/shinework/photoshot\n\n## DeOldify.NET\n\nDeOldify.NET: cross-platform application for coloring black and white photos\"\n\nhttps://github.com/ColorfulSoft/DeOldify.NET\n\n## Queryable\n\nOffline photo search App for iOS\n\nhttps://github.com/mazzzystar/Queryable\n\n## X-AnyLabeling\n\nEffortless data labeling with AI support from Segment Anything and other awesome models. \n\nhttps://github.com/CVHub520/X-AnyLabeling\n\n# :notes: Audio\n\n## clone-voice\n\nA sound cloning tool with a web interface, using your voice or any sound to record audio\n\nhttps://github.com/jianchang512/clone-voice\n\n## tacotron2-tts-GUI\n\nGUI wrapper for synthesize. Allows CPU-only synthesis via a toggleable switch. Portable exe file is available (that runs on CPU only).\n\nhttps://github.com/lokkelvin2/tacotron2-tts-GUI\n\n## riffusion\n\nRiffusion is an app for real-time music generation with stable diffusion.\n\nhttps://github.com/riffusion/riffusion-app\n\n## whisper-ui\n\nWhisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.\n\nThis is a simple Streamlit UI for OpenAI's Whisper speech-to-text model. It let's you download and transcribe media from YouTube videos, playlists, or local files. You can then browse, filter, and search through your saved audio files.\n\nhttps://github.com/hayabhay/whisper-ui for https://github.com/openai/whisper\n\nhttps://grisk.itch.io/whisper-gui\n\n## buzz \n\nGUI for whisper\n\nhttps://github.com/chidiwilliams/buzz\n\n## Ultimate Vocal Remover GUI v5.5.1\n\nThis application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).\n\nhttps://github.com/Anjok07/ultimatevocalremovergui\n\n## SpleeterGUI\n\nSpleeterGUI - Music source separation desktop app\n\nWindows Desktop Front end for Spleeter - AI source separation\n\nhttps://github.com/boy1dr/SpleeterGui\n\n## SoftVC VITS Singing Voice Conversion Fork\n\nhttps://github.com/34j/so-vits-svc-fork\n\n## subsync subtitle alignment\n\nThis is an automatic movie subtitle synchronization tool. Synchronization is done by listening to the audio track, translating it if necessary.\n\nhttps://github.com/sc0ty/subsync/\n\n# 🎞️ Video frame interpolation\n\n## Video Dubbing\n\nhttps://github.com/jianchang512/pyvideotrans\n\n## DAIN-App\n\nThe backbone of this project is a algorithm called DAIN. (https://github.com/baowenbo/DAIN)\n\nhttps://github.com/BurguerJohn/Dain-App\n\nhttps://grisk.itch.io/dain-app?download\n\n## DAIN Vulkan GUI\n\nAI-Powered video interpolater (eg. 30fps -\u003e 60fps) for Vulkan devices. Based on dain-ncnn-vulkan and ffmpeg\n\nhttps://github.com/Mar2ck/DAIN-Vulkan-GUI\n\n## dain-ncnn-vulkan\n\nncnn implementation of DAIN, Depth-Aware Video Frame Interpolation.\n\nhttps://github.com/nihui/dain-ncnn-vulkan\n\n## Dain-App\n\nhttps://github.com/BurguerJohn/Dain-App\n\n## cain-ncnn-vulkan\n\nVideo Frame Interpolation\n\nhttps://github.com/nihui/cain-ncnn-vulkan\n\n## Flowframes\n\nFlowframes is a simple but powerful app that utilizes AI frameworks to interpolate videos in order to increase their framerate with little to no noticable quality loss. (DAIN, CAIN, RIFE)\n\nhttps://github.com/n00mkrad/flowframes\n\nhttps://nmkd.itch.io/flowframes\n\n# 🖼️🎞️ Video upscaler\n\n## video2x\n\nA lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley 2, 2018. \nhttps://github.com/k4yt3x/video2x\n\n## https://github.com/Communist02/AnimePlayer\n\n\n## 🎞️ Video Deep Face\n\nhttps://github.com/iperov/DeepFaceLive\n\n# 🖼️ Image/Video upscaling\n\n## Final2x\n\nThis is a powerful tool that allows for image super-resolution to arbitrary sizes using multiple models, designed to enhance the resolution and quality of images, making them clearer and more detailed. Presently, it supports several models, such as RealCUGAN, RealESRGAN, Waifu2x, and SRMD.\n\nhttps://github.com/Tohrusky/Final2x\n\n## Upscayl\nUpscayl is a cross-platform application built with the Linux-first philosophy. This means that we prioritize Linux builds over others but that doesn't mean we'll break things for other OSes :) Makes use of Real-ESRGAN\n\nhttps://github.com/upscayl/upscayl\n\n## Anime4kSharp\n\nhttps://github.com/shadow578/Anime4kSharp\n\n## dandere2x-tremx\n\nvideo compression \nhttps://github.com/Tremeschin/dandere2x-tremx\n\n## Cupscale\n\nImage Upscaling GUI based on ESRGAN - WORK IN PROGRESS\n\nhttps://github.com/n00mkrad/cupscale\n\n## IEU.Winforms\n\nWindows only GUI for ESRGAN with additional features \nhttps://github.com/ptrsuder/IEU.Winforms\n\n## NVIDIA-Ansel-AI-Enhancer\n\nNVIDIA Upscaler, utilizes NVIDIA´s Ansel RTX. Requires latest drivers and a RTX GPU. Free, but not open source. \n\nhttps://github.com/dynamiquel/NVIDIA-Ansel-AI-Enhancer \n\n## RealSR\n\nReal-World Super-Resolution via Kernel Estimation and Noise Injection \n\nhttps://github.com/jixiaozhong/RealSR\n\n(https://github.com/nihui/RealSR)\n\n## realsr-ncnn-vulkan\n\nhttps://github.com/nihui/realsr-ncnn-vulkan\n\n## Shell upscaler\n\nhttps://github.com/n00mkrad/shell-upscaler\n\nActs as a GUI for esrgan-launcher.\n\n## srmd-ncnn-vulkan\n\nhttps://github.com/nihui/srmd-ncnn-vulkan\n\n## VapourSynth-Super-Resolution-Helper\n\nhttps://github.com/AlphaAtlas/VapourSynth-Super-Resolution-Helper\n\n## Waifu2x-Extension-GUI\n\nPhoto/Video/GIF enlargement using machine learning\n\nhttps://github.com/AaronFeng753/Waifu2x-Extension-GUI\n\nC++, python, qt, caffe, ncnn_vulkan\n\nAGPL-3.0 License\n\n## waifu2x-ncnn-vulkan\n\nhttps://github.com/nihui/waifu2x-ncnn-vulkan\n\n# GAN / Style Transfer\n\n## TorchRayLib\n\nTorchRayLib++: A CMake based AI \u0026 generative art platform, integrating the rayib GUI and the PyTorch C++ Deep Learning Library.\n\nhttps://github.com/QuantScientist/TorchRayLib\n\n## more\nhttps://github.com/Seprentes/ScaleUp\nhttps://github.com/HolyWu/vs-realesrgan\nhttps://github.com/tumuyan/RealSR-NCNN-Android\nhttps://github.com/TransparentLC/realesrgan-gui/blob/master/README.en-US.md\nhttps://github.com/Lucchetto/SuperImage\nhttps://github.com/Tohrusky/Final2x\n\n# :toolbox: DL Training GUI\n\n## AIDeveloper\n\nGUI-based software for training, evaluating and applying deep neural nets for image classification \n\nhttps://github.com/maikherbig/AIDeveloper\n\n# 🧩 Ready to use Plugins\n\n## vstSpleeter\n\naudio source separation library vst plugin\n\nhttps://github.com/gvne/vstSpleeter\n\n\n---\n\n\n## various apps (untested)\n\nPaper2GUI is an AI desktop Application toolbox for everyone no matter your background with it being free and ready to use out of the box, already supporting 10+ AI models, covering the fields of speech synthesis, video frame filling, video overscoring, target detection, image stylization, OCR recognition, etc. Supporting Windows, Mac OS, Linux systems.\n\nhttps://github.com/Baiyuetribe/paper2gui/blob/main/README_en.md\n\nFree: https://www.nvidia.com/en-us/studio/canvas/\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnatowi%2Fdeep-learning-applications-with-gui","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fnatowi%2Fdeep-learning-applications-with-gui","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnatowi%2Fdeep-learning-applications-with-gui/lists"}