https://github.com/jamditis/audiobash
Voice-controlled terminal for developers. Speak commands, execute instantly.
https://github.com/jamditis/audiobash
ai audio bash claude-code cli commandline commands gemini speech terminal tts voice-to-text vscode vtt
Last synced: about 1 month ago
JSON representation
Voice-controlled terminal for developers. Speak commands, execute instantly.
- Host: GitHub
- URL: https://github.com/jamditis/audiobash
- Owner: jamditis
- License: mit
- Created: 2025-12-11T04:05:56.000Z (2 months ago)
- Default Branch: master
- Last Pushed: 2026-01-09T00:35:56.000Z (about 1 month ago)
- Last Synced: 2026-01-09T03:34:41.777Z (about 1 month ago)
- Topics: ai, audio, bash, claude-code, cli, commandline, commands, gemini, speech, terminal, tts, voice-to-text, vscode, vtt
- Language: TypeScript
- Homepage: https://jamditis.github.io/audiobash/
- Size: 18.7 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Security: SECURITY_REVIEW.md
Awesome Lists containing this project
README
# AudioBash
[](https://opensource.org/licenses/MIT)
[](https://github.com/jamditis/audiobash/releases)
[](https://github.com/jamditis/audiobash/stargazers)
[](https://github.com/jamditis/audiobash/actions/workflows/build.yml)
[](https://github.com/jamditis/audiobash/releases)
[](https://github.com/jamditis/audiobash/releases)
[](https://audiobash.app)

A voice-controlled terminal for developers. Speak commands, execute them instantly.
**Website:** [audiobash.app](https://audiobash.app) | **Remote control:** [remote.audiobash.app](https://remote.audiobash.app)
## Features
- **Voice-to-terminal** - Speak naturally and have your words transcribed directly into the terminal
- **Context-aware agent mode** - AI understands your environment: current directory, recent output, errors
- **Custom instructions** - Add personal instructions for transcription and agent modes
- **Custom vocabulary** - Map spoken words to correct spellings (e.g., "next js" → "Next.js")
- **CLI notifications** - Audio chime when CLI tools request input/approval
- **Split view** - View up to 4 terminals simultaneously with 5 layout modes (single, horizontal, vertical, 2x2 grid, 1+2)
- **Resizable panes** - Drag dividers to resize terminal panes in split view
- **Focus indicator** - Voice badge shows which terminal receives voice commands in split mode
- **Tab rename** - Double-click tab names to customize them
- **Multi-tab support** - Run up to 4 terminal sessions simultaneously
- **Quick directory navigation** - Jump to recent or favorite folders with one click
- **Multiple AI providers** - Gemini 2.0/2.5 Flash, OpenAI Whisper, Claude, or ElevenLabs
- **Auto-copy** - Selected text is automatically copied to clipboard
- **Always-on-top mode** - Pin the voice panel while you work
- **System tray** - Runs quietly in background, accessible via global shortcuts
- **Extensive keyboard shortcuts** - 16 customizable shortcuts for power users
- **Preview pane** - Embedded web preview for localhost dev servers, HTML, images, and markdown
- **Screenshot capture** - Take screenshots of the preview pane (saves to current working directory)
- **Auto-refresh** - File watcher automatically refreshes preview when source files change
## Installation
### Windows
Download the latest `.exe` installer from [Releases](https://github.com/jamditis/audiobash/releases).
### macOS
Download the `.dmg` from [Releases](https://github.com/jamditis/audiobash/releases), or build locally:
```bash
git clone https://github.com/jamditis/audiobash.git
cd audiobash
npm install
npm run electron:build:mac:arm64 # Apple Silicon (M1/M2/M3/M4)
npm run electron:build:mac:x64 # Intel Macs
```
**Note:** The macOS build is unsigned. On first launch, right-click the app and select "Open" to bypass Gatekeeper. See [macOS build guide](docs/MACOS_BUILD.md) for details.
### Build from source (any platform)
```bash
git clone https://github.com/jamditis/audiobash.git
cd audiobash
npm install
npm run electron:build
```
## Usage
1. **Launch AudioBash** - The app starts with your default shell (PowerShell on Windows, zsh on macOS)
2. **Press Alt+S** (or Option+S on Mac) - Opens the voice panel and starts recording
3. **Speak your command** - e.g., "list all files in the current directory"
4. **Press Alt+S again** - Transcription is sent to the terminal
### Keyboard shortcuts
#### Voice
| Shortcut | Action |
|----------|--------|
| `Alt+S` | Start/stop voice recording |
| `Alt+A` | Cancel recording (abort without sending) |
| `Alt+M` | Toggle raw/agent mode |
| `Alt+R` | Resend last transcription |
#### Window
| Shortcut | Action |
|----------|--------|
| `Alt+H` | Show/hide window |
| `Alt+L` | Cycle through layouts |
#### Terminal
| Shortcut | Action |
|----------|--------|
| `Alt+C` | Clear terminal |
| `Alt+→` | Focus next pane |
| `Alt+←` | Focus previous pane |
| `Alt+B` | Bookmark current directory |
#### Tabs
| Shortcut | Action |
|----------|--------|
| `Alt+1-4` | Switch to tab 1-4 |
#### Preview
| Shortcut | Action |
|----------|--------|
| `Alt+P` | Toggle preview pane |
| `Alt+Shift+P` | Capture screenshot |
### Voice panel modes
- **Auto-send** - Automatically execute transcribed commands (toggle in settings)
- **Pin mode** - Keep voice panel open while working
## Configuration
Open Settings (gear icon in title bar) to configure:
- **API Keys** - Add keys for Gemini, OpenAI, Anthropic, or ElevenLabs
- **Transcription model** - Choose between cloud or local transcription
- **Default shell** - PowerShell, CMD, or Bash
- **Visual theme** - Select from multiple color schemes
- **Scanlines** - Enable retro CRT effect
### Getting API keys
- **Gemini** (recommended): [Google AI Studio](https://aistudio.google.com/app/apikey)
- **OpenAI**: [OpenAI Platform](https://platform.openai.com/api-keys)
- **Anthropic**: [Anthropic Console](https://console.anthropic.com/settings/keys)
## Tech stack
- **Electron** - Desktop application framework
- **React 19** - UI framework
- **TypeScript** - Type-safe JavaScript
- **xterm.js** - Terminal emulator
- **node-pty** - Pseudoterminal bindings
- **Tailwind CSS** - Styling
- **Vite** - Build tool
## Development
```bash
# Install dependencies
npm install
# Run in development mode
npm run electron:dev
# Build for production
npm run electron:build
```
## License
MIT
## Author
Joe Amditis ([@jamditis](https://github.com/jamditis))