Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/fleschutz/talk2windows
Control your Windows desktop by voice commands.
https://github.com/fleschutz/talk2windows
powershell serenade voice voice-assistant voice-commands voice-control voice-recognition
Last synced: 3 months ago
JSON representation
Control your Windows desktop by voice commands.
- Host: GitHub
- URL: https://github.com/fleschutz/talk2windows
- Owner: fleschutz
- License: cc0-1.0
- Created: 2021-12-13T17:44:30.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2024-11-07T07:19:07.000Z (3 months ago)
- Last Synced: 2024-11-07T08:19:17.933Z (3 months ago)
- Topics: powershell, serenade, voice, voice-assistant, voice-commands, voice-control, voice-recognition
- Language: PowerShell
- Homepage:
- Size: 28.1 MB
- Stars: 35
- Watchers: 3
- Forks: 6
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
Talk2Windows
============
This repo lets you control your *Windows*ยฎ desktop by voice. Using keyboard, mouse, and also voice commands is intuitive, increases your productivity, and is really awesome.Required is *Windows* 10 or 11, a headset, and the free *Serenade* voice recognition application.
๐ง Installation
----------------
1. **Download and install** the free *Serenade* app from [https://serenade.ai](https://serenade.ai/download). During installation choose *no plugin* and *no programming language*, then go to *Settings > Server* and choose the 'Local' endpoint (good for privacy and speed). Close the application when finished.
2. **Download and unzip** this repo from [https://github.com/fleschutz/talk2windows/archive/main.zip](https://github.com/fleschutz/talk2windows/archive/main.zip).
3. **Open 'Windows PowerShell (Administrator)'** and execute: `Set-ExecutionPolicy RemoteSigned` to allow the execution of PowerShell scripts.
4. **Open 'Windows PowerShell'** and execute: `./setup.ps1` in the unzipped *talk2windows* folder. It exports all voice commands with your personal wake word to *Serenade*. In the following we will use the wake word **Windows**.Usage
-----
1. Put your headset on and check that it's working.
2. Launch *Serenade* and click the slider to switch from *Paused* to *Listening* mode. Raise the mic audio level in case a *'low voice'* is shown.
3. Say: **"*Windows, open calculator*"** with a calm voice into the mic. This voice command executes the PowerShell script `scripts/open-calculator.ps1` which launches the calculator app.Most voice commands follow the generic scheme "*<wake word>, <verb> <subject>*" such as: "Windows, open calculator". Supported are the following voice commands:
๐ *"Windows, <greeting>"*
---------------------------
Say hello by replacing <greeting> with: `good afternoon`, `good evening`, `good morning`, `good night`, `hello`, `hey`, `hi`, `howdy`, `morning`, or `nice to meet you`.๐ง *"Windows, install <app>"*
-----------------------------
Install a Microsoft Store application by replacing <app> with: `7-Zip`, `CrystalDiskInfo`, `CrystalDiskMark`, `Discord`, `Disney+`, `Edge`, `Firefox`, `Google Chrome`, `IrfanView`, `Mp3tag`, `Netflix`, `Opera Browser`, `Opera GX`, `One Calendar`, `Quick Share`, `Rufus`, `Spotify`, `TikTok`, `Twitter`, `Visual Studio Code`, `VLC`, `Windows Terminal`, or `WhatsApp`. To uninstall it simply say: **"Windows, uninstall <app>"**.๐ป *"Windows, open <app>"*
----------------------------
Launch an installed application by replacing <app> with: `3D-Viewer`, `7-Zip`, `Calculator`, `Character Map`, `CrystalDiskInfo`, `CrystalDiskMark`, `Discord`, `Disney+`, `Edge`, `Firefox`, `Git Extensions`, `Google Chrome`, `Magnifier`, `Microsoft Paint`, `Microsoft Store`, `Mp3tag`, `Netflix`, `Notepad`, `OBS Studio`, `One Calendar`, `One Note`, `Outlook`, `Quick Share`, `Remote Desktop`, `Screen Clip`, `Spotify`, `System Information`, `Thunderbird`, `Visual Studio`, `VLC`, `voice recorder`, `Windows Terminal`, or `WhatsApp`. To close it simply say: **"Windows, close <app>"**.๐ *"Windows, open <name> website"*
------------------------------------
Launch the default Web browser (or a new tab) with the given website. Replace <name> with: `Amazon`, `Apple`, `Baidu`, `BBC`, `Bing`, `BitBucket`, `BitBucket status`, `CDC`, `CIA`, `CNN`, `Discord`, `DistroWatch`, `Docker`, `Docker status`, `Dropbox`, `eBay`, `Facebook`, `FBI`, `FlightRadar`, `Flipboard`, `FourSquare`, `FRITZ!Box`, `FRITZ!Repeater`, `GitHub`, `GliderTracker`, `HolidayCheck`, `HRworks`, `Instagram`, `IPFS`, `Microsoft`, `NASA`, `NBC`, `Netflix`, `Notepad`, `Outdoor Active`, `PayPal`, `Pinterest`, `Pixabay`, `Plex`, `Serenade`, `Slashdot`, `Snap Store`, `Space Weather`, `SpaceX`, `Spotify`, `Starbucks`, `Tesla`, `TikTok`, `Toggl`, `Topo Map`, `Twitter`, `UFA`, `Unsplash`, `Walmart`, `WhatsApp`, `White House`, `Windy`, `Wikipedia`, `Wolfram Alpha`, `World News`, `Yahoo`, `YouTube`, or `Zoom`. When finished say: **"Windows, close tab"**.๐ฆ *"Windows, check <item>"*
-----------------------------
Let the computer check something by replacing <item> with: `audio`, `Bitcoin rate`, `CPU temperature`, `dawn`, `day`, `DNS`, `drives`, `dusk`, `Earth` (fun), `Ether rate`, `for crash dumps`, `my downloads`, `for rain`, `for snow`, `Git version`, `headlines`, `internet connection`, `internet speed`, `moon phase`, `month`, `New Year`, `operating system`, `Outlook`, `ping latency`, `recycle bin`, `Santa`, `swap space`, `temperature`, `Tether rate`, `time zone`, `up-time`, `VPN`, `weather`, `week`, `the wind`, `year`, or `zenith`.โ *"Windows, <shortcut>"*
-------------------------
Press a keyboard shortcut to navigate in a browser window. Replace <shortcut> with: `close tab`, `last tab`, `next page`, `next tab`, `previous page`, `previous tab`, `open new tab`, `open new window`, `reload page`, `scroll down`, `scroll to bottom`, `scroll to top`, `scroll up`, `tab 1`, `tab 2`, `tab 3`, `tab 4`, `tab 5`, `tab 6`, `zoom in`, or `zoom out`.๐ *"Windows, insert <char> sign"*
---------------------------------------
Insert the given character at the current text cursor position. Just replace <char> with: `at`, `backslash`, `comma`, `copyright`, `Dollar`, `Euro`, `hashtag`, `minus`, `number`, `percent`, `pipe`, `plus`, `Pound`, `slash`, `trademark`, `underscore`, or `Yen`.๐ *"Windows, insert <text>"*
---------------------------------
Insert text at the current text cursor position. Replace <text> with: `bomb`, `bye bye`, `call?`, `checkmark`, `clapping`, `date`, `good-bye`, `good morning`, `heart`, `OK`, `shrugging`, `smiley`, or `thumbs up`.๐ *"Windows, open Google <service>"*
-------------------------------------
Launch the default Web browser (or a new tab) showing the given Googleยฎ service. Replace <service> with: `Alerts`, `Blogger`, `Books`, `Calendar`, `Charts`, `Contacts`, `Docs`, `Drawings`, `Earth`, `Flights`, `Gemini`, `Keep`, `Mail`, `Maps`, `Mars`, `News`, `Notes`, `Photos`, `Play`, `Search`, `Sheets`, `Shopping list`, `Sky`, `Slides`, `Stadia`, `Tasks`, or `Translate`. When finished say: **"Windows, close tab"**.๐ *"Windows, open <name> magazine"*
-------------------------------------
Launches the default Web browser (or a new tab) showing the given magazine website. Replace <name> with: `Cosmopolitan`, `Focus`, `Golf`, `GQ`, `Health`, `National Geographic`, `People`, `RollingStone`, `Scientific American`, `Spiegel`, `Sports Illustrated`, `SZ`, `The Oprah`, `Time`, `Vanity Fair`, `Vogue`, or `Wired`. When finished say: **"Windows, close tab"**.๐ *"Windows, open <name> manual"*
-----------------------------------
Launches the default Web browser (or a new tab) with the given manual. Replace <name> with: `Ant`, `Apple`, `Audacity`, `Azure`, `Bash`, `Blender`, `Chrome`, `Edge`, `Firefox`, `GCC`, `Git`, `Jenkins`, `MSBuild`, `OBS Studio`, `PowerShell`, `Serenade`, `SystemRescue`, `Toyota`, `Vim`, `Voice` *(yes, this page!)*, `Volkswagen`, or `Windows`. When finished say: **"Windows, close tab"**.๐ฆ *"Windows, show <name> city"*
---------------------------------
Launches the default Web browser (or a new tab) showing the given city in *Google Maps*. Replace <name> with the name of the city. When finished say: **"Windows, close tab"**.๐ *"Windows, show <name> rate"*
----------------------------------
Launches the default Web browser (or a new tab) showing the given exchange rate. Replace <name> with: `Bitcoin`, `Cardano`, `Ether`, `Euro`, `NFT collections`, `Solana`, `Tether`, or `US dollar`. When finished say: **"Windows, close tab"**.๐ต *"Windows, play <genre> music"*
-----------------------------------
Launches the default Web browser (or a new tab) and plays a music genre. Replace <genre> with: `70s`, `80s`, `90s`, `Alternative Rock`, `Blues`, `Chillout`, `Christmas`, `Classical`, `Country`, `Dance`, `Folk`, `Hip Hop`, `House`, `Indie`, `Jazz`, `Metal`, `Pop`,`Reggae`, `Relax`, `RnB`, `Rock`, or `Workout`. When finished say: **"Windows, close tab"**.๐ *"Windows, play <name> sound"*
---------------------------------
Starts the playback of the given sound, just replace <name> with: `alarm`, `bee`, `beep`, `Big Ben`, `car`, `cat`, `cow`, `dog`, `donkey`, `elephant`, `elk`, `frog`, `goat`, `gorilla`, `horse`, `lion`, `parrot`, `pig`, `rattlesnake`, `vulture`, or `wolf`.๐ฎ *"Windows, let's play <game>"*
---------------------------------
Launches the default Web browser (or a new tab) and lets you play a free browser game. Replace <game> with: `2048`, `Agar`, `Catan Universe`, `Chess`, `Cube`, `ISS Docking Simulator`, `Pacman`, `Santa Tracker`, `Slither`, `Tetris`, `TicTacToe`, `Tower`, `Vanis`, `War Brokers`, or `Wordle`. When finished say: **"Windows, close tab"**.๐จ *"Windows, next <category> wallpaper"*
------------------------------------------
Replaces the desktop background by a random photo from Unsplash.com. Replace <category> with: `Animal`, `Beach`, `Car`, `Christmas`, `City`, `Desert`, `Fantasy`, `Military`, `Movie`, `Music`, `Nature`, `Plane`, `Random`, `Space`, `Sports`, or `Winter`.โฐ *"Windows, remind me <time>"*
---------------------------------
Sets a reminder, just replace <time> with: `at # AM`, `at # PM`, `at midnight`, `at noon`, `at sunrise`, `at sunset`, `at tea time`, `in # minutes`, or `in # hours`.๐ฝ *"Windows, open <letter> drive"*
------------------------------------
Launches the *File Explorer* showing the given drive. Replace <letter> with: `C:`, `D:`, `E:`, `F:`, `G:`, `H:` or `M:`. When finished say: **"Windows, close file explorer"**.๐ *"Windows, open <name> folder"*
-----------------------------------
Launches the File Explorer with the given folder. Replace <name> with: `apps`, `applications`, `autostart`, `crash dumps`, `desktop`, `documents`, `downloads`, `Dropbox`, `home`, `music`, `OneDrive`, `pictures`, `recycle bin`, `repos`, `SSH`, `Talk2Windows`, `temporary`, `videos`, or `Windows`. When finished say: **"Windows, close file explorer"**.โ๏ธ *"Windows, open <category> settings"*
-------------------------------------
Open the system settings with the given category - just replace <category> with: `activation`, `apps`, `background`, `backup`, `bluetooth`, `color`, `date`, `default apps`, `developer`, `display`, `ethernet`, `lockscreen`, `maps`, `personal`, `printer`, `proxy`, `recovery`, `speech`, `start`, `system`, `taskbar`, `themes`, `time`, `update`, `USB`, `VPN`, `Wifi`, or `Windows`. When finished say: **"Windows, close system settings"**.โ *"Windows, list <table>"*
---------------------------
Shows the given table, just replace <table> with: `bluetooth devices`, `city weather`, `earthquakes`, `emojis`, `environment variables`, `headlines`, `installed apps`, `installed software`, `installed voices`, `network adapters`, `printers`, `running processes`, `services`, `system info`, `tasks`, `time zones`, or `user groups`.๐ง *"Windows, <action> computer"*
---------------------------------
Performs the given action to the computer, replace <action> with: `hibernate`, `lock`, `reboot`, `shut down`, or `suspend`.โ *"Windows, <question>"*
--------------------------
Asks Windows a question, just replace <question> with: `are you a computer?`, `are you here?`, `are you listening?`, `are you ready?`, `are you there?`, `are you with me?`, `can you talk?`, `do you listen?`, `how are you?`, `how do you do?`, `how much daylight today?`, `how old are you?`, `what about ...?`, `what is [abbreviation]`, `what's the date today?`, `what's the temperature?`, `what's the time?`, `what time is it?`, `when is Christmas?`, `when is easter?`, `when is midnight?`, `when is noon?`, `when is President's day?`, `when is sunrise?`, `when is sunset?`, `when is tea time?`, `where am I?`, `where are you?`, `where is the ISS`, `who is your father?`, or `you can talk?`.๐ญ *"Windows, <command>"*
-------------------------
Lets Windows execute a command, replace <command> with: `calculate [expression]`, `connect VPN`, `count down from [number]`, `empty recycle bin`, `enable god mode`, `I need help`, `locate my phone`, `minimize all windows`, `ping [host]`, `repeat last reply`, `roll a dice`, `save screenshot`, `set volume to [number] percent`, `shut up`, `sorry`, `spell [text]`, `tell me a joke`, `tell me a quote`, `thank you`, `thanks`, `translate [text] to [language]`, `turn volume off`, `turn volume on`, `turn volume up`, `turn volume fully up`, `turn volume down`, `update repository`, or `wakeup [hostname]`.๐ *"Windows, <farewell>"*
---------------------------
Say good-bye to your computer by replacing <farewell> with: `bye`, `bye-bye`, `ciao`, `good-bye`, `I'll be back`, `see you`, or `see you later`.๐ง Feedback
------------
Send your email feedback to: markus.fleschutz [at] gmail.com๐ค License, Copyright, and Trademarks
--------------------------------------
This open source project is licensed under the CC0 license. All trademarks are the property of their respective owners.