Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-12-29 00:27:45 UTC
- JSON Representation
https://github.com/showlab/VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
chatgpt langchain large-language-model video-language whisper
Last synced: 06 Nov 2024
https://github.com/yinruiqing/pyannote-whisper
asr chatgpt meeting-summarization pyannote speaker-diarization whisper
Last synced: 27 Dec 2024
https://github.com/dsymbol/decipher
Effortlessly add AI-generated transcription subtitles to your videos
openai transcription translation whisper
Last synced: 28 Dec 2024
https://github.com/dadangdut33/speech-translate
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
python speech-transcription speech-translation tkinter-python translate whisper
Last synced: 28 Dec 2024
https://github.com/yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
android asr chinese ctranslate2 huggingface lora pytorch speech-recognition transformers web whisper
Last synced: 09 Oct 2024
https://github.com/Dadangdut33/Speech-Translate
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
python speech-transcription speech-translation tkinter-python translate whisper
Last synced: 20 Nov 2024
https://github.com/nyrahealth/crisperwhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
asr audio detection filler recognition speech speech-processing speech-recognition timestamps transcription verbatim whisper
Last synced: 27 Dec 2024
https://github.com/ai-ng/swift
Fast voice assistant powered by Groq, Cartesia, and Vercel.
artificial-intelligence cartesia groq llama nextjs react vercel whisper
Last synced: 28 Dec 2024
https://github.com/zh-plus/openlrc
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
auto-subtitle faster-whisper lyrics lyrics-generator openai-api openlrc python speech-to-text subtitle-translation transcribe voice-to-text whisper
Last synced: 28 Dec 2024
https://github.com/dicklesworthstone/bulk_transcribe_youtube_videos_from_playlist
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
playlists transcription transcripts whisper youtube
Last synced: 28 Dec 2024
https://github.com/woolverine94/biniou
a self-hosted webui for 30+ generative ai
animatediff audiocraft bark controlnet diffusers flux generative-ai gfpgan gradio huggingface insightface ip-adapter kandinsky llama-cpp-python photomaker real-esrgan stable-diffusion stable-diffusion-3 webui whisper
Last synced: 28 Dec 2024
https://github.com/Dicklesworthstone/bulk_transcribe_youtube_videos_from_playlist
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
playlists transcription transcripts whisper youtube
Last synced: 08 Nov 2024
https://github.com/macoron/whisper.unity
Running speech to text model (whisper.cpp) in Unity3d on your local machine.
asr openai speech-recognition speech-to-text stt unity3d whisper
Last synced: 28 Dec 2024
https://github.com/reriiasu/speech-to-text
Real-time transcription using faster-whisper
faster-whisper openai speech-recognition speech-to-text voice-recognition whisper
Last synced: 29 Dec 2024
https://github.com/mybigday/whisper.rn
React Native binding of whisper.cpp.
openai react-native speech-recognition whisper whisper-cpp
Last synced: 27 Dec 2024
https://github.com/substratusai/kubeai
Private Open AI on Kubernetes
ai autoscaler faster-whisper inference-operator k8s kubernetes llm ollama ollama-operator openai-api vllm vllm-operator whisper
Last synced: 28 Dec 2024
https://github.com/seanoliver/audioflare
An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.
ai cloudflare distilbert llama2 m2m100 openai whisper
Last synced: 23 Dec 2024
https://github.com/toverainc/willow-inference-server
Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
cuda deep-learning llama llm privacy speech-recognition speech-to-text text-to-speech vicuna webrtc whisper willow
Last synced: 28 Dec 2024
https://github.com/voxos-ai/bolna
End-to-end platform for building voice first multimodal agents
anyscale chatgpt-api claude-3-sonnet deepgram elevenlabs fastapi gpt-4o llama3 llm mistral openai perplexity-api polly telephony twilio voice-assistant websocket-chat websockets whisper xtts
Last synced: 28 Dec 2024
https://github.com/bolna-ai/bolna
End-to-end platform for building voice first multimodal agents
anyscale chatgpt-api claude-3-sonnet deepgram elevenlabs fastapi gpt-4o llama3 llm mistral openai perplexity-api polly telephony twilio voice-assistant websocket-chat websockets whisper xtts
Last synced: 27 Dec 2024
https://github.com/savbell/whisper-writer
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
dictation faster-whisper openai openai-api openai-whisper speech-recognition speech-to-text typing-assistant whisper
Last synced: 29 Dec 2024
https://github.com/chrislemke/ChatFred
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more.
alfred-workflow alfredapp chatbot chatgpt dall-e2 gpt-3 gpt-4 image-generation openai stable-diffusion whisper
Last synced: 06 Nov 2024
https://github.com/chrislemke/chatfred
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more.
alfred-workflow alfredapp chatbot chatgpt dall-e2 gpt-3 gpt-4 image-generation openai stable-diffusion whisper
Last synced: 26 Sep 2024
https://github.com/arthurfdlr/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper
automatic-speech-recognition colab-notebook speech-recognition speech-to-text transformer whisper youtube
Last synced: 29 Dec 2024
https://github.com/ArthurFDLR/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper
automatic-speech-recognition colab-notebook speech-recognition speech-to-text transformer whisper youtube
Last synced: 02 Nov 2024
https://github.com/azkadev/speech_to_text_telegram_bot_dart
Speech To Text Telegram Bot Dart
dart openai speech-to-text telegram whisper whisper-dart
Last synced: 23 Dec 2024
https://github.com/revdotcom/reverb
Open source inference code for Rev's model
asr asr-model canary deeplearning diarization docker huggingface neural-network open-source opensource pyannote rev revai speaker-diarization speech-recognition speech-to-text speechrecognition wenet whisper
Last synced: 29 Dec 2024
https://github.com/rayfernando1337/mlx-auto-subtitled-video-generator
Generate accurate transcripts using Apple's MLX framework
apple mlx transcribe translate whisper
Last synced: 22 Dec 2024
https://github.com/RayFernando1337/MLX-Auto-Subtitled-Video-Generator
Generate accurate transcripts using Apple's MLX framework
apple mlx transcribe translate whisper
Last synced: 27 Dec 2024
https://github.com/shashikg/whispers2t
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
asr deep-learning speech-recognition speech-to-text tensorrt tensorrt-llm vad voice-activity-detection whisper
Last synced: 27 Dec 2024
https://github.com/lspahija/aiui
AIUI is a platform enabling seamless two-way verbal communication with AI.
ai artificial-intelligence chatgpt chatgpt-api conversation conversational-ai gpt gpt-3 gpt-4 machine-learning speech whisper whisper-ai
Last synced: 29 Dec 2024
https://github.com/nikorasu/livewhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
ai assistant chatbot dictation numpy openai openai-whisper python sounddevice speech-recognition speech-to-text terminal text-to-speech transcription translation tts voice voice-assistant voice-recognition whisper
Last synced: 23 Dec 2024
https://github.com/lspahija/AIUI
AIUI is a platform enabling seamless two-way verbal communication with AI.
ai artificial-intelligence chatgpt chatgpt-api conversation conversational-ai gpt gpt-3 gpt-4 machine-learning speech whisper whisper-ai
Last synced: 06 Nov 2024
https://github.com/Nikorasu/LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
ai assistant chatbot dictation numpy openai openai-whisper python sounddevice speech-recognition speech-to-text terminal text-to-speech transcription translation tts voice voice-assistant voice-recognition whisper
Last synced: 11 Nov 2024
https://github.com/aarnphm/whispercpp
Pybind11 bindings for Whisper.cpp
audio-transcription bazel bentoml mlops-workflow nix pybind11 python3 whisper whisper-cpp
Last synced: 29 Dec 2024
https://github.com/azkadev/general_ai
GENERAL Ai Library For DART & Flutter
ai artificial-intelligence azkadev dart deep-learning flutter ggml library machine-learning ml piper stable-diffusion whisper
Last synced: 23 Dec 2024
https://github.com/lablab-ai/whisper-transcription_and_diarization-speaker-identification-
How to use OpenAIs Whisper to transcribe and diarize audio files
Last synced: 24 Dec 2024
https://github.com/shashikg/WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
asr deep-learning speech-recognition speech-to-text tensorrt tensorrt-llm vad voice-activity-detection whisper
Last synced: 14 Nov 2024
https://github.com/yohasebe/openai-chat-api-workflow
🎩 An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT-4o 🤖💬 It also allows image generation 🖼️, image understanding 👀, speech-to-text conversion 🎤, and text-to-speech synthesis 🔈
ai alfred chatbot dall-e gpt image-generation image-understanding openai speech-to-text text-to-speech whisper workflow
Last synced: 24 Dec 2024
https://github.com/andraxdev/speak-gpt
Your personal voice assistant based on OpenAI ChatGPT.
android assistant assistant-chat-bots chatbot chatgpt chatgpt-client dall-e gemini google-assistant gpt gpt-4o gpt-vision kotlin kotlin-android llama mobile openai openai-api voice-assistant whisper
Last synced: 22 Dec 2024
https://github.com/etienneab3d/whisperhallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
asr audio-processing noise-removal sound-processing text-to-speech vad vocals whisper
Last synced: 25 Dec 2024
https://github.com/carleslc/audiototext
Transcribe and translate audio to text using Whisper and DeepL.
audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api
Last synced: 24 Dec 2024
https://github.com/developersdigest/ai-devices
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
function-calling gpt-4-vision groq langchain langsmith llama3 llava llm openai serper tts whisper
Last synced: 22 Dec 2024
https://github.com/bbc-esq/vectordb-plugin-for-lm-studio
Plugin that lets you ask questions about your documents including audio and video files.
bark database-management embedding-models embedding-vectors embeddings gtts koboldai koboldcpp python rag retrieval-augmented-generation retrieval-chatbot tiledb vector-data-management vector-database vector-search vision whisper whispers2t whisperspeech
Last synced: 22 Dec 2024
https://github.com/uruworks/terosubtitler
Tero Subtitler is an open source, cross-platform, and free subtitle editing software.
ai audio-to-text blu-ray captions editor ffmpeg free linux macos mpv open-source smpte subtitle-editor subtitler subtitles tero transcription whisper windows yt-dlp
Last synced: 24 Dec 2024
https://github.com/kabanosk/whisper-website
Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model
audio-to-text fastapi hacktoberfest open-source openai python3 speech-to-text subtitles subtitles-generator uvicorn website whisper
Last synced: 23 Dec 2024
https://github.com/Carleslc/AudioToText
Transcribe and translate audio to text using Whisper and DeepL.
audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api
Last synced: 07 Nov 2024
https://github.com/gtreshchev/runtimespeechrecognizer
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
audio-processing openai speech-detection speech-processing speech-recognition speech-to-text ue4 ue4-plugin ue5 ue5-plugin unreal-engine unreal-engine-4 unreal-engine-5 voice-recognition whis whisper whisper-ai whisper-cpp
Last synced: 25 Dec 2024
https://github.com/gtreshchev/RuntimeSpeechRecognizer
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
audio-processing openai speech-detection speech-processing speech-recognition speech-to-text ue4 ue4-plugin ue5 ue5-plugin unreal-engine unreal-engine-4 unreal-engine-5 voice-recognition whis whisper whisper-ai whisper-cpp
Last synced: 06 Nov 2024
https://github.com/stage-whisper/stage-whisper
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
ai-transcription audio-transcription electron-app hacktoberfest journalism openai openai-whisper whisper
Last synced: 24 Dec 2024
https://github.com/ioanmo226/chatgpt-web-application
A web application that allows users to interact with various OpenAI's models through a simple and user-friendly interface.
ai audio-text chatgpt chatgpt-clone dalle dalle2 davinci-003 express gpt3 highlight-js image-generation markdown-to-html openai whisper
Last synced: 23 Dec 2024
https://github.com/Stage-Whisper/Stage-Whisper
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
ai-transcription audio-transcription electron-app hacktoberfest journalism openai openai-whisper whisper
Last synced: 25 Nov 2024
https://github.com/URUWorks/TeroSubtitler
Tero Subtitler is an open source, cross-platform, and free subtitle editing software.
ai audio-to-text blu-ray captions editor ffmpeg free linux macos mpv open-source smpte subtitle-editor subtitler subtitles tero transcription whisper windows yt-dlp
Last synced: 05 Nov 2024
https://github.com/Kabanosk/whisper-website
Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model
audio-to-text fastapi hacktoberfest open-source openai python3 speech-to-text subtitles subtitles-generator uvicorn website whisper
Last synced: 05 Nov 2024
https://github.com/ariym/whisper-node
Node.js bindings for OpenAI's Whisper. (C++ CPU version by ggerganov)
ai cpp ffmpeg ml nodejs openai typescript whisper
Last synced: 28 Dec 2024
https://github.com/matteofasulo/whisper-tiktok
From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model
edge-tts ffmpeg mkdocs-material python text-to-speech tiktok whisper
Last synced: 25 Dec 2024
https://github.com/bhattbhavesh91/voice-assistant-whisper-chatgpt
This repository will guide you to create your own Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.
chatgpt chatgpt-api google-assistant gpt-3 gradio huggingface language-model language-models openapi virtual-assistant voice-assistant whisper
Last synced: 25 Dec 2024
https://github.com/promptslab/llmtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
fine-tuning fine-tuning-llm finetune finetune-gpt finetune-llama finetune-llm finetune-llms finetune-whisper finetunechatgpt finetuning finetuning-large-language-models finetuning-rl llm llm-framework llm-inference llm-training llmops llmtuner whisper whisper-finetune
Last synced: 25 Dec 2024
https://github.com/microsoft/ai-dev-gallery
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.
ai csharp developer-tools directml dotnet genai mistral npu onnx onnxruntime onnxruntime-genai phi3 qnn stable-diffusion visual-studio whisper winappsdk windows winui3 wpf
Last synced: 28 Dec 2024
https://github.com/xf00f/web3x
Ethereum TypeScript Client Library - for perfect types and tiny builds.
api ethereum javascript swarm typescript web3 web3js whisper
Last synced: 22 Dec 2024
https://github.com/nikdanilov/whisper-obsidian-plugin
Speech-to-text in Obsidian using OpenAI Whisper
obsidian openai-whisper speech-to-text stt transcribe voice whisper
Last synced: 04 Dec 2024
https://github.com/Robitx/gp.nvim
Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI]
ai chatgpt codeium copilot cursor gpt gpt-4 gpt4 llm lua neovim nvim openai plugin speech-to-text tabnine vim voice whisper
Last synced: 26 Oct 2024
https://github.com/pszemraj/vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
audio audio-processing keyword keyword-extraction nlp python sentence sentence-boundary-detection speech speech-recognition speech-to-text spelling-correction transcription transformer video video-processing video-summarisation video-summarization wav2vec2 whisper
Last synced: 22 Dec 2024
https://github.com/jim60105/docker-whisperx
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
asr docker-image dockerfile speech speech-recognition speech-to-text whisper
Last synced: 27 Dec 2024
https://github.com/josefalbers/whisper-turbo-mlx
Blazing fast whisper turbo for ASR (speech-to-text) tasks
asr deep-learning mlx speech-recognition speech-to-text whisper whisper-turbo
Last synced: 23 Dec 2024
https://github.com/jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
asr docker-image dockerfile speech speech-recognition speech-to-text whisper
Last synced: 05 Nov 2024
https://github.com/pluja/web-whisper
OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
ai audio docker frontend go openai self-hosting speech text transcription translation web whisper
Last synced: 08 Nov 2024
https://github.com/gaborvecsei/whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
ai applied-machine-learning gradio machine-learning python speach-to-text stt whisper
Last synced: 21 Dec 2024
https://github.com/arihanv/Shush
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
flash-attention-2 huggingface-transformers machine-learning modal shadcn-ui transcription whisper
Last synced: 30 Nov 2024
https://github.com/supershaneski/openai-whisper
A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for some time interval then uploads the audio data to the server for transcribing/translating.
nextjs openai openai-whisper reactjs whisper
Last synced: 24 Oct 2024
https://github.com/zhuzilin/whisper-openvino
openvino version of openai/whisper
Last synced: 02 Nov 2024
https://github.com/locaal-ai/obs-cleanstream
CleanStream is an OBS plugin that uses AI to clean live audio streams from unwanted words and utterances
ai obs obs-plugin obs-studio obs-studio-plugin plugin profanity-blocking profanity-detection profanity-filter profanity-filtering real-time-filter real-time-transcription realtime-detection realtime-transcribe speech-to-text transcription whisper
Last synced: 29 Dec 2024
https://github.com/IgnoranceAI/hugh
A voice-powered AI built with Whisper, ChatGPT, and ElevenLabs
chatgpt elevenlabs flask whisper
Last synced: 27 Oct 2024
https://github.com/geekodour/wscribe
ez audio transcription tool with flexible processing and post-processing options
audio-processing transcription whisper
Last synced: 24 Dec 2024
https://github.com/ieasybooks/tafrigh
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
asr automatic-speech-recognition ctranslate2 facebook faster-whisper javascript python soundcloud srt stable-whisper subtitles twitter vtt whisper youtube
Last synced: 25 Dec 2024
https://github.com/etienneab3d/whispertimesync
Synchronize Whisper's timestamps over an existing accurate transcription
aligner asr nlp subtitles text-to-speech whisper
Last synced: 19 Nov 2024
https://github.com/Illyism/openai-whisper-api
OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example
chatgpt openai openai-whisper whisper
Last synced: 14 Nov 2024
https://github.com/noco-ai/spellbook-docker
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
automatic-speech-recognition bark llama2 llm-inference mixtral musicgeneration stable-diffusion text-to-speech whisper xttsv2
Last synced: 18 Nov 2024
https://github.com/Cledev-Limited/Cledev.OpenAI
.NET 7 SDK for OpenAI with a Blazor Server playground
azureopenai blazor blazor-server chat-gpt chatgpt chatgpt-4 chatgpt-api dall-e dontnet-core dotnet gpt-3 gpt3 net7 openai openai-api sdk sdk-dotnet tokenizer whisper whisper-ai
Last synced: 18 Nov 2024
https://github.com/illyism/openai-whisper-api
OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example
chatgpt openai openai-whisper whisper
Last synced: 27 Oct 2024
https://github.com/bits-by-brandon/whisper-ui
A GUI interface for Open AI Whisper based on Tauri and Sveltekit
rust speech-to-text svelte tauri whisper
Last synced: 09 Nov 2024
https://github.com/johniwasz/whetstone.chatgpt
A simple light-weight library that wraps the Open AI API.
chatgpt dotnet dotnet-standard2 dotnet-standard2-1 gpt-3 gpt-35-turbo gpt-4 openai whisper whisper-ai
Last synced: 24 Dec 2024
https://github.com/aadeshkulkarni/sanchay-ai
Takes your video and generates video title, description, hashtags, transcription, subtitles and more.
generative-ai javascript object-store python rabbitmq whisper
Last synced: 14 Nov 2024
https://github.com/pinto0309/whisper-onnx-cpu
ONNX implementation of Whisper. PyTorch free.
Last synced: 24 Dec 2024
https://github.com/nalbion/whisper-server
streaming speech to text server using Whisper
Last synced: 19 Dec 2024
https://github.com/mutablelogic/go-whisper
Speech-to-Text in golang
golang speech-recognition speech-to-text whisper
Last synced: 24 Dec 2024
https://github.com/piotrkawa/deepfake-whisper-features
Implementation of the paper "Improved DeepFake Detection Using Whisper Features"
audio-deepfake-detection deep-learning deepfake-detection paper-implementations whisper
Last synced: 24 Oct 2024
https://github.com/Woolverine94/biniou
a self-hosted webui for 30+ generative ai
audiogen bark controlnet diffusers generative-ai gfpgan gradio huggingface insightface ip-adapter kandinsky llama-cpp-python musicgen photomaker pix2pix real-esrgan stable-diffusion stable-video-diffusion webui whisper
Last synced: 29 Oct 2024
https://github.com/askrella/speech-rest-api
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)
artificial-intelligence openai python3 speech-recognition speech-to-text text-to-speech whisper whisper-ai
Last synced: 06 Nov 2024