An open API service indexing awesome lists of open source software.

https://github.com/dortanes/gemini-desktop

Desktop application for Gemini AI. Based on Electron
https://github.com/dortanes/gemini-desktop

ai desktop-app electron gemini gemini-ai

Last synced: 11 months ago
JSON representation

Desktop application for Gemini AI. Based on Electron

Awesome Lists containing this project

README

          

# ๐Ÿค– Gemini AI Desktop

**Desktop application for Google's Gemini AI**

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT) [![GitHub release](https://img.shields.io/github/release/dortanes/gemini-desktop.svg)](https://github.com/dortanes/gemini-desktop/releases) [![Downloads](https://img.shields.io/github/downloads/dortanes/gemini-desktop/total.svg)](https://github.com/dortanes/gemini-desktop/releases)

---

*Transform your Gemini AI experience with a desktop application that brings Google's powerful AI assistant directly to your desktop with enhanced features and seamless integration.*

## โš ๏ธ Disclaimer

**This project is not affiliated with, endorsed by, or sponsored by Google LLC or the Gemini AI team.** This is an independent, open-source desktop application that provides a wrapper interface for accessing Google's Gemini AI service through their official website.

- **Gemini AI** is a trademark of Google LLC
- This application simply loads the official Gemini website (gemini.google.com) in a desktop wrapper
- All AI functionality is provided by Google's servers
- No data is collected or stored by this application
- Users must comply with Google's Terms of Service when using Gemini AI

## โœจ Features

- **๐Ÿ–ฅ๏ธ Native Desktop Experience** - Full-featured Electron app with native OS integration
- **๐Ÿ”„ Dual Window Modes** - Switch between full window and drawer
- **โŒจ๏ธ Global Hotkeys** - Customizable keyboard shortcuts for instant access
- **๐ŸŽจ System Tray Integration** - Quick access from your system tray
- **๐ŸŽค Voice Mode** - Hands-free voice interaction with automatic speech-to-text and text-to-speech

## ๐Ÿ“ฅ Installation

### ๐Ÿ“ฆ Pre-built Releases (Recommended)

Download the latest release for your platform:

| Platform | Download | Notes |
|----------|----------|-------|
| ๐ŸชŸ **Windows** | [Download .exe](https://github.com/dortanes/gemini-desktop/releases/latest) | NSIS installer with auto-updater |
| ๐ŸŽ **macOS** | ๐Ÿšง Coming Soon | Universal binary (Intel + Apple Silicon) |
| ๐Ÿง **Linux** | ๐Ÿšง Coming Soon | AppImage, .deb, and .rpm packages |

> **Note:** Currently, only Windows builds are available. macOS and Linux builds are coming soon! ๐Ÿš€

### ๐Ÿ”ง Installation Instructions

#### Windows
1. Download the installer
2. Run the installer
3. Follow the setup wizard
4. Launch from Start Menu or Desktop shortcut

#### macOS & Linux
These platforms are currently under development. You can still run the app from source code (see [Development](#-development) section below) or wait for the official releases coming soon!

## ๐Ÿš€ Quick Start

1. **Launch the Application**
- Use your desktop shortcut or application menu
- Or use the system tray icon for quick access

2. **Set Up Hotkeys** (Optional)
- Open Settings from the tray menu
- Configure global hotkeys for drawer and main window
- Customize voice mode hotkey (default: `Ctrl+Shift+V`)
- Save your preferences

3. **Start Using Gemini AI**
- The app loads Google's Gemini official website (gemini.google.com)
- Use drawer for quick queries
- Switch to full mode for extended conversations
- You can also open it in full screen mode (click F11)

4. **Voice Mode** ๐ŸŽค
- Press your voice mode hotkey (default: `Ctrl+Shift+V`) to activate hands-free mode
- Speak your question - the app will automatically detect when you're done
- Your speech will be converted to text and sent to Gemini
- The response will be read aloud using text-to-speech
- Perfect for hands-free interaction while working on other tasks

## ๐ŸŽค Voice Mode

The Voice Mode feature provides a completely hands-free way to interact with Gemini AI using speech-to-text and text-to-speech capabilities.

> **FYI**: This feature uses Gemini's original built-in speech-to-text (STT) and text-to-speech (TTS) functionality - no additional services or APIs required!

### How It Works

1. **Activation**: Press the voice mode hotkey (default: `Ctrl+Shift+V`)
2. **Voice Recording**: The app automatically starts recording your voice
3. **Smart Detection**: Automatically detects when you've finished speaking (2-second pause)
4. **Auto-Send**: Converts your speech to text and sends it to Gemini
5. **Audio Response**: Gemini's response is automatically read aloud using text-to-speech

## ๐Ÿ› ๏ธ Development

### ๐Ÿ“‹ Prerequisites

- **Node.js** 20+ ([Download](https://nodejs.org/))
- **yarn**
- **Git**

### ๐Ÿ”ง Setup

```bash
# Clone the repository
git clone https://github.com/dortanes/gemini-desktop.git
cd gemini-desktop

# Install dependencies
yarn install

# Start development server
yarn start
```

### ๐Ÿ—๏ธ Building

```bash
# Build for Windows specifically
yarn build-win

# Build for development testing
yarn dist
```

> **Note:** macOS and Linux build scripts are temporarily disabled but will be re-enabled soon. For now, focus is on Windows builds for better testing and stability.

### ๐Ÿงช Testing

```bash
# Run in development mode
yarn start

# Build and test locally
yarn dist
```

## โš™๏ธ Configuration

### ๐ŸŽ›๏ธ Application Settings

The app stores settings in your system's user data directory:
- **Windows**: `%APPDATA%/gemini-desktop/`
- **macOS**: `~/Library/Application Support/gemini-desktop/`
- **Linux**: `~/.config/gemini-desktop/`

## ๐Ÿ“„ License

This project is licensed under the **MIT License** - see the [LICENSE](LICENSE) file for details.

---

**Made with โค๏ธ by [dortanes](https://github.com/dortanes)**

โญ **Star this repo if you find it useful!** โญ